贝叶斯基因本体论术语分配到基因表达实验。

Bayesian assignment of gene ontology terms to gene expression experiments.

机构信息

Department of Biotechnology, BOKU University, Muthgasse 18, 1190 Vienna.

出版信息

Bioinformatics. 2012 Sep 15;28(18):i603-i610. doi: 10.1093/bioinformatics/bts405.

DOI:10.1093/bioinformatics/bts405

PMID:22962488

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3436832/

Abstract

MOTIVATION

Gene expression assays allow for genome scale analyses of molecular biological mechanisms. State-of-the-art data analysis provides lists of involved genes, either by calculating significance levels of mRNA abundance or by Bayesian assessments of gene activity. A common problem of such approaches is the difficulty of interpreting the biological implication of the resulting gene lists. This lead to an increased interest in methods for inferring high-level biological information. A common approach for representing high level information is by inferring gene ontology (GO) terms which may be attributed to the expression data experiment.

RESULTS

This article proposes a probabilistic model for GO term inference. Modelling assumes that gene annotations to GO terms are available and gene involvement in an experiment is represented by a posterior probabilities over gene-specific indicator variables. Such probability measures result from many Bayesian approaches for expression data analysis. The proposed model combines these indicator probabilities in a probabilistic fashion and provides a probabilistic GO term assignment as a result. Experiments on synthetic and microarray data suggest that advantages of the proposed probabilistic GO term inference over statistical test-based approaches are in particular evident for sparsely annotated GO terms and in situations of large uncertainty about gene activity. Provided that appropriate annotations exist, the proposed approach is easily applied to inferring other high level assignments like pathways.

AVAILABILITY

Source code under GPL license is available from the author.

CONTACT

peter.sykacek@boku.ac.at.

摘要

动机

基因表达分析允许对分子生物学机制进行全基因组规模的分析。最先进的数据分析提供了涉及基因的列表，要么通过计算 mRNA 丰度的显著性水平，要么通过贝叶斯评估基因活性。这种方法的一个常见问题是难以解释产生的基因列表的生物学含义。这导致人们对推断高级别生物学信息的方法产生了浓厚的兴趣。表示高级别信息的一种常见方法是推断可能归因于表达数据实验的基因本体 (GO) 术语。

结果

本文提出了一种用于 GO 术语推断的概率模型。建模假设基因注释到 GO 术语是可用的，并且基因在实验中的参与度由基因特定指示变量的后验概率表示。这些概率度量是许多用于表达数据分析的贝叶斯方法的结果。所提出的模型以概率方式组合这些指示概率，并提供作为结果的概率 GO 术语分配。在合成和微阵列数据上的实验表明，与基于统计检验的方法相比，所提出的概率 GO 术语推断方法的优势尤其体现在注释稀疏的 GO 术语和基因活性存在较大不确定性的情况下。只要存在适当的注释，所提出的方法就可以很容易地应用于推断其他高级别分配，如途径。

可用性

GPL 许可证下的源代码可从作者处获得。

联系方式

peter.sykacek@boku.ac.at。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8af0/3436832/0e1f762d3957/bts405f1.jpg

相似文献

Bayesian assignment of gene ontology terms to gene expression experiments.

Bioinformatics. 2012 Sep 15;28(18):i603-i610. doi: 10.1093/bioinformatics/bts405.

Bayesian modelling of shared gene function.

Bioinformatics. 2007 Aug 1;23(15):1936-44. doi: 10.1093/bioinformatics/btm280. Epub 2007 May 31.

Bayesian infinite mixture model based clustering of gene expression profiles.

Bioinformatics. 2002 Sep;18(9):1194-206. doi: 10.1093/bioinformatics/18.9.1194.

GObar: a gene ontology based analysis and visualization tool for gene sets.

BMC Bioinformatics. 2005 Jul 25;6:189. doi: 10.1186/1471-2105-6-189.

GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

Bioinformatics. 2010 Apr 1;26(7):905-11. doi: 10.1093/bioinformatics/btq059. Epub 2010 Feb 21.

Bayesian meta-analysis models for microarray data: a comparative study.

BMC Bioinformatics. 2007 Mar 7;8:80. doi: 10.1186/1471-2105-8-80.

Improved detection of overrepresentation of Gene-Ontology annotations with parent child analysis.

Bioinformatics. 2007 Nov 15;23(22):3024-31. doi: 10.1093/bioinformatics/btm440. Epub 2007 Sep 11.

Boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources.

PLoS One. 2013 Jun 24;8(6):e67410. doi: 10.1371/journal.pone.0067410. Print 2013.

GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes.

Bioinformatics. 2004 Dec 12;20(18):3710-5. doi: 10.1093/bioinformatics/bth456. Epub 2004 Aug 5.

Empirical Bayes estimation of posterior probabilities of enrichment: a comparative study of five estimators of the local false discovery rate.

BMC Bioinformatics. 2013 Mar 6;14:87. doi: 10.1186/1471-2105-14-87.

引用本文的文献

Embedding of Genes Using Cancer Gene Expression Data: Biological Relevance and Potential Application on Biomarker Discovery.

Front Genet. 2019 Jan 4;9:682. doi: 10.3389/fgene.2018.00682. eCollection 2018.

A Factor Graph Approach to Automated GO Annotation.

PLoS One. 2016 Jan 15;11(1):e0146986. doi: 10.1371/journal.pone.0146986. eCollection 2016.

本文引用的文献

Heat shock response in yeast involves changes in both transcription rates and mRNA stabilities.

PLoS One. 2011 Feb 25;6(2):e17272. doi: 10.1371/journal.pone.0017272.

Biological assessment of robust noise models in microarray data analysis.

Bioinformatics. 2011 Mar 15;27(6):807-14. doi: 10.1093/bioinformatics/btr018. Epub 2011 Jan 19.

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Nat Biotechnol. 2010 May;28(5):511-5. doi: 10.1038/nbt.1621. Epub 2010 May 2.

Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs.

Nat Biotechnol. 2010 May;28(5):503-10. doi: 10.1038/nbt.1633. Epub 2010 May 2.

GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach.

Bioinformatics. 2010 Apr 1;26(7):905-11. doi: 10.1093/bioinformatics/btq059. Epub 2010 Feb 21.

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists.

Nucleic Acids Res. 2009 Jan;37(1):1-13. doi: 10.1093/nar/gkn923. Epub 2008 Nov 25.

Bayesian modelling of shared gene function.

Bioinformatics. 2007 Aug 1;23(15):1936-44. doi: 10.1093/bioinformatics/btm280. Epub 2007 May 31.

Functional interpretation of microarray experiments.

OMICS. 2006 Fall;10(3):398-410. doi: 10.1089/omi.2006.10.398.

BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W472-6. doi: 10.1093/nar/gkl172.

BayGO: Bayesian analysis of ontology term enrichment in microarray data.

BMC Bioinformatics. 2006 Feb 23;7:86. doi: 10.1186/1471-2105-7-86.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

贝叶斯基因本体论术语分配到基因表达实验。

Bayesian assignment of gene ontology terms to gene expression experiments.

机构信息

Department of Biotechnology, BOKU University, Muthgasse 18, 1190 Vienna.