• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

测量基因表达谱之间的相似性:贝叶斯方法。

Measuring similarity between gene expression profiles: a Bayesian approach.

机构信息

Computer Laboratory, University of Cambridge, Cambridge, CB3 0FD, UK.

出版信息

BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2164-10-S3-S14.

DOI:10.1186/1471-2164-10-S3-S14
PMID:19958477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2788366/
Abstract

BACKGROUND

Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data.

RESULTS

We advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis.

CONCLUSION

BayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data.

摘要

背景

基于基因表达谱之间的相似性将基因分组到聚类中,一直是预测功能模块的主要方法,从中可以做出重要的推断或进一步的调查决策。虽然相似性度量的明确确定很重要,但目前的实践通常涉及欧几里得距离和皮尔逊相关系数,而这些假设不太可能适用于高通量微阵列数据。

结果

我们主张使用一种新的度量标准 - BayesGen - 来测量基因表达谱之间的相似性,并在两个重要应用中展示其性能:构建全基因组共表达网络,以及将人类癌症组织聚类成亚型。BayesGen 被表述为关于给定基因对生成机制的两个替代假设之间的证据比,并将整个数据集的全局特征作为先验知识纳入其中。通过对预期强度水平和噪声方差的联合建模,它解决了不同微阵列值范围内噪声水平的固有非线性和相关性。全贝叶斯公式也为荟萃分析提供了可能性。

结论

BayesGen 允许从微阵列表达数据中更有效地提取基因之间的相似性信息,这对各种推断任务都有显著影响。它还为其他对象特征数据提供了稳健的选择,如通过对合成数据的测试结果所示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/15917fe55b39/1471-2164-10-S3-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/e266c4d08af3/1471-2164-10-S3-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/2a9612839b12/1471-2164-10-S3-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/15917fe55b39/1471-2164-10-S3-S14-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/e266c4d08af3/1471-2164-10-S3-S14-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/2a9612839b12/1471-2164-10-S3-S14-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84ee/2788366/15917fe55b39/1471-2164-10-S3-S14-3.jpg

相似文献

1
Measuring similarity between gene expression profiles: a Bayesian approach.测量基因表达谱之间的相似性:贝叶斯方法。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S14. doi: 10.1186/1471-2164-10-S3-S14.
2
Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications.置信差异准则:一种新的贝叶斯差异表达基因选择算法及其应用
BMC Bioinformatics. 2015 Aug 7;16:245. doi: 10.1186/s12859-015-0664-3.
3
Metric for measuring the effectiveness of clustering of DNA microarray expression.用于测量 DNA 微阵列表达聚类有效性的度量。
BMC Bioinformatics. 2006 Sep 6;7 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2105-7-S2-S5.
4
Relevant and significant supervised gene clusters for microarray cancer classification.与微阵列癌症分类相关且重要的有监督基因簇。
IEEE Trans Nanobioscience. 2012 Jun;11(2):161-8. doi: 10.1109/TNB.2012.2193590. Epub 2012 Apr 27.
5
A semi-parametric Bayesian model for unsupervised differential co-expression analysis.一种用于无监督差异共表达分析的半参数贝叶斯模型。
BMC Bioinformatics. 2010 May 7;11:234. doi: 10.1186/1471-2105-11-234.
6
Bayesian infinite mixture model based clustering of gene expression profiles.基于贝叶斯无限混合模型的基因表达谱聚类
Bioinformatics. 2002 Sep;18(9):1194-206. doi: 10.1093/bioinformatics/18.9.1194.
7
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.使用收缩相关系数对重复微阵列进行全基因组规模的聚类分析。
BMC Bioinformatics. 2008 Jun 18;9:288. doi: 10.1186/1471-2105-9-288.
8
Bayesian robust inference for differential gene expression in microarrays with multiple samples.用于多个样本微阵列中差异基因表达的贝叶斯稳健推断。
Biometrics. 2006 Mar;62(1):10-8. doi: 10.1111/j.1541-0420.2005.00397.x.
9
Tissue-specific gene expression of head and neck squamous cell carcinoma in vivo by complementary DNA microarray analysis.通过互补DNA微阵列分析对头颈部鳞状细胞癌在体内的组织特异性基因表达进行研究。
Arch Otolaryngol Head Neck Surg. 2003 Jul;129(7):760-70. doi: 10.1001/archotol.129.7.760.
10
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT:一种使用时间序列基因表达数据推断基因调控网络的新算法。
BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.

引用本文的文献

1
Detection of an anti-angina therapeutic module in the effective population treated by a multi-target drug Danhong injection: a randomized trial.丹红注射液治疗有效人群中抗心绞痛治疗模块的检测:一项随机试验。
Signal Transduct Target Ther. 2021 Sep 1;6(1):329. doi: 10.1038/s41392-021-00741-x.
2
Single cell analysis of adult mouse skeletal muscle stem cells in homeostatic and regenerative conditions.成体鼠骨骼肌干细胞在稳态和再生条件下的单细胞分析。
Development. 2019 Apr 11;146(12):dev174177. doi: 10.1242/dev.174177.
3
Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

本文引用的文献

1
Clustering of gene expression data based on shape similarity.基于形状相似性的基因表达数据聚类
EURASIP J Bioinform Syst Biol. 2009;2009(1):195712. doi: 10.1155/2009/195712. Epub 2009 Apr 23.
2
Bayes factors for genome-wide association studies: comparison with P-values.全基因组关联研究的贝叶斯因子:与P值的比较。
Genet Epidemiol. 2009 Jan;33(1):79-86. doi: 10.1002/gepi.20359.
3
Finding function: evaluation methods for functional genomic data.寻找功能:功能基因组数据的评估方法
利用模式生物对新测序物种的调控序列进行注释的统计方法。
PLoS One. 2012;7(9):e42489. doi: 10.1371/journal.pone.0042489. Epub 2012 Sep 11.
4
Comprehensive literature review and statistical considerations for microarray meta-analysis.综合文献回顾和微阵列荟萃分析的统计考虑。
Nucleic Acids Res. 2012 May;40(9):3785-99. doi: 10.1093/nar/gkr1265. Epub 2012 Jan 19.
5
Extending Asia Pacific bioinformatics into new realms in the "-omics" era.将亚太生物信息学拓展到“组学”时代的新领域。
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S1. doi: 10.1186/1471-2164-10-S3-S1.
BMC Genomics. 2006 Jul 25;7:187. doi: 10.1186/1471-2164-7-187.
4
Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks.系统调查揭示了基因共表达网络中“关联有罪”的普遍适用性。
BMC Bioinformatics. 2005 Sep 14;6:227. doi: 10.1186/1471-2105-6-227.
5
Exploratory Bayesian model selection for serial genetics data.用于序列遗传学数据的探索性贝叶斯模型选择
Biometrics. 2005 Jun;61(2):591-9. doi: 10.1111/j.1541-0420.2005.040417.x.
6
Bayesian model selection for genome-wide epistatic quantitative trait loci analysis.用于全基因组上位性数量性状基因座分析的贝叶斯模型选择
Genetics. 2005 Jul;170(3):1333-44. doi: 10.1534/genetics.104.040386. Epub 2005 May 23.
7
Molecular profiling of diffuse large B-cell lymphoma identifies robust subtypes including one characterized by host inflammatory response.弥漫性大B细胞淋巴瘤的分子谱分析确定了多种明确的亚型,其中一种以宿主炎症反应为特征。
Blood. 2005 Mar 1;105(5):1851-61. doi: 10.1182/blood-2004-07-2947. Epub 2004 Nov 18.
8
A gene-coexpression network for global discovery of conserved genetic modules.用于全面发现保守遗传模块的基因共表达网络。
Science. 2003 Oct 10;302(5643):249-55. doi: 10.1126/science.1087447. Epub 2003 Aug 21.
9
Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae.酿酒酵母mRNA翻译谱的全基因组分析。
Proc Natl Acad Sci U S A. 2003 Apr 1;100(7):3889-94. doi: 10.1073/pnas.0635171100. Epub 2003 Mar 26.
10
Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.通过基因表达谱分析对儿童急性淋巴细胞白血病进行分类、亚型发现及预后预测。
Cancer Cell. 2002 Mar;1(2):133-43. doi: 10.1016/s1535-6108(02)00032-6.