• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RefBool:一种基于参考的基因表达数据离散化算法。

RefBool: a reference-based algorithm for discretizing gene expression data.

机构信息

Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, Luxembourg.

出版信息

Bioinformatics. 2017 Jul 1;33(13):1953-1962. doi: 10.1093/bioinformatics/btx111.

DOI:10.1093/bioinformatics/btx111
PMID:28334101
Abstract

MOTIVATION

The identification of genes or molecular regulatory mechanisms implicated in biological processes often requires the discretization, and in particular booleanization, of gene expression measurements. However, currently used methods mostly classify each measurement into an active or inactive state regardless of its statistical support possibly leading to downstream analysis conclusions based on spurious booleanization results.

RESULTS

In order to overcome the lack of certainty inherent in current methodologies and to improve the process of discretization, we introduce RefBool, a reference-based algorithm for discretizing gene expression data. Instead of requiring each measurement to be classified as active or inactive, RefBool allows for the classification of a third state that can be interpreted as an intermediate expression of genes. Furthermore, each measurement is associated to a p- and q-value indicating the significance of each classification. Validation of RefBool on a neuroepithelial differentiation study and subsequent qualitative and quantitative comparison against 10 currently used methods supports its advantages and shows clear improvements of resulting clusterings.

AVAILABILITY AND IMPLEMENTATION

The software is available as MATLAB files in the Supplementary Information and as an online repository ( https://github.com/saschajung/RefBool ).

CONTACT

antonio.delsol@uni.lu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

在生物过程中,识别基因或分子调控机制通常需要对基因表达测量进行离散化,特别是布尔化。然而,目前使用的方法大多将每个测量值分类为活动或不活动状态,而不考虑其统计支持,这可能导致基于虚假布尔化结果的下游分析结论。

结果

为了克服当前方法中固有的不确定性,并改进离散化过程,我们引入了 RefBool,这是一种用于离散化基因表达数据的基于参考的算法。RefBool 不要求对每个测量值进行分类为活动或不活动,而是允许对可解释为基因中间表达的第三种状态进行分类。此外,每个测量值都与 p 值和 q 值相关联,指示每个分类的显著性。在神经上皮分化研究中对 RefBool 的验证,以及随后与 10 种当前使用的方法进行定性和定量比较,支持了它的优势,并显示了聚类结果的明显改进。

可用性和实现

该软件以 MATLAB 文件的形式在补充信息中提供,并作为在线存储库(https://github.com/saschajung/RefBool)提供。

联系方式

antonio.delsol@uni.lu。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
RefBool: a reference-based algorithm for discretizing gene expression data.RefBool:一种基于参考的基因表达数据离散化算法。
Bioinformatics. 2017 Jul 1;33(13):1953-1962. doi: 10.1093/bioinformatics/btx111.
2
Feature specific quantile normalization enables cross-platform classification of molecular subtypes using gene expression data.特征特异性分位数归一化可使用基因表达数据对分子亚型进行跨平台分类。
Bioinformatics. 2018 Jun 1;34(11):1868-1874. doi: 10.1093/bioinformatics/bty026.
3
ST Pipeline: an automated pipeline for spatial mapping of unique transcripts.ST 管道:用于独特转录本空间映射的自动化管道。
Bioinformatics. 2017 Aug 15;33(16):2591-2593. doi: 10.1093/bioinformatics/btx211.
4
pulseR: Versatile computational analysis of RNA turnover from metabolic labeling experiments.pulseR:基于代谢标记实验的 RNA 周转的多功能计算分析。
Bioinformatics. 2017 Oct 15;33(20):3305-3307. doi: 10.1093/bioinformatics/btx368.
5
powsimR: power analysis for bulk and single cell RNA-seq experiments.powsimR:用于批量和单细胞 RNA-seq 实验的功效分析。
Bioinformatics. 2017 Nov 1;33(21):3486-3488. doi: 10.1093/bioinformatics/btx435.
6
Robust and sparse correlation matrix estimation for the analysis of high-dimensional genomics data.稳健稀疏相关矩阵估计在高通量基因组学数据分析中的应用
Bioinformatics. 2018 Feb 15;34(4):625-634. doi: 10.1093/bioinformatics/btx642.
7
A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm.一种使用层次聚类算法净化从头转录组的新方法。
Bioinformatics. 2017 May 1;33(9):1293-1300. doi: 10.1093/bioinformatics/btw793.
8
anexVis: visual analytics framework for analysis of RNA expression.anexVis:用于分析 RNA 表达的可视化分析框架。
Bioinformatics. 2018 Jul 15;34(14):2510-2512. doi: 10.1093/bioinformatics/bty122.
9
switchde: inference of switch-like differential expression along single-cell trajectories.switchde:沿单细胞轨迹推断类似开关的差异表达
Bioinformatics. 2017 Apr 15;33(8):1241-1242. doi: 10.1093/bioinformatics/btw798.
10
AltHapAlignR: improved accuracy of RNA-seq analyses through the use of alternative haplotypes.AltHapAlignR:通过使用替代单倍型提高 RNA-seq 分析的准确性。
Bioinformatics. 2018 Jul 15;34(14):2401-2408. doi: 10.1093/bioinformatics/bty125.

引用本文的文献

1
Detecting expressed genes in cell populations at the single-cell level with scGeneXpress.单细胞水平细胞群体中表达基因的检测 scGeneXpress
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae494.
2
scBoolSeq: Linking scRNA-seq statistics and Boolean dynamics.scBoolSeq:将 scRNA-seq 统计与布尔动力学联系起来。
PLoS Comput Biol. 2024 Jul 8;20(7):e1011620. doi: 10.1371/journal.pcbi.1011620. eCollection 2024 Jul.
3
A computer-guided design tool to increase the efficiency of cellular conversions.一种用于提高细胞转化效率的计算机辅助设计工具。
Nat Commun. 2021 Mar 12;12(1):1659. doi: 10.1038/s41467-021-21801-4.
4
Personalization of Logical Models With Multi-Omics Data Allows Clinical Stratification of Patients.利用多组学数据对逻辑模型进行个性化分析可实现患者的临床分层。
Front Physiol. 2019 Jan 24;9:1965. doi: 10.3389/fphys.2018.01965. eCollection 2018.