BayGO：微阵列数据中本体术语富集的贝叶斯分析

BayGO: Bayesian analysis of ontology term enrichment in microarray data.

作者信息

Vêncio Ricardo Z N, Koide Tie, Gomes Suely L, Pereira Carlos A de B

机构信息

BIOINFO, Universidade de São Paulo, 05508-090 São Paulo, Brazil.

出版信息

BMC Bioinformatics. 2006 Feb 23;7:86. doi: 10.1186/1471-2105-7-86.

DOI:10.1186/1471-2105-7-86

PMID:16504085

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1440873/

Abstract

BACKGROUND

The search for enriched (aka over-represented or enhanced) ontology terms in a list of genes obtained from microarray experiments is becoming a standard procedure for a system-level analysis. This procedure tries to summarize the information focussing on classification designs such as Gene Ontology, KEGG pathways, and so on, instead of focussing on individual genes. Although it is well known in statistics that association and significance are distinct concepts, only the former approach has been used to deal with the ontology term enrichment problem.

RESULTS

BayGO implements a Bayesian approach to search for enriched terms from microarray data. The R source-code is freely available at http://blasto.iq.usp.br/~tkoide/BayGO in three versions: Linux, which can be easily incorporated into pre-existent pipelines; Windows, to be controlled interactively; and as a web-tool. The software was validated using a bacterial heat shock response dataset, since this stress triggers known system-level responses.

CONCLUSION

The Bayesian model accounts for the fact that, eventually, not all the genes from a given category are observable in microarray data due to low intensity signal, quality filters, genes that were not spotted and so on. Moreover, BayGO allows one to measure the statistical association between generic ontology terms and differential expression, instead of working only with the common significance analysis.

摘要

背景

在从微阵列实验获得的基因列表中寻找富集（又称过度代表或增强）的本体术语，正成为系统水平分析的标准程序。该程序试图聚焦于诸如基因本体、KEGG通路等分类设计来总结信息，而非聚焦于单个基因。尽管在统计学中关联和显著性是不同的概念这一点广为人知，但目前仅前一种方法被用于处理本体术语富集问题。

结果

BayGO实现了一种贝叶斯方法，用于从微阵列数据中搜索富集术语。R源代码可在http://blasto.iq.usp.br/~tkoide/BayGO免费获取，有三个版本：Linux版本，可轻松整合到现有流程中；Windows版本，用于交互式控制；还有网络工具版本。该软件使用细菌热休克反应数据集进行了验证，因为这种应激会触发已知的系统水平反应。

结论

贝叶斯模型考虑到了这样一个事实，即由于低强度信号、质量过滤、未点样的基因等原因，最终并非给定类别的所有基因在微阵列数据中都可观测到。此外，BayGO允许人们测量通用本体术语与差异表达之间的统计关联，而不是仅进行常见的显著性分析。

相似文献

BayGO: Bayesian analysis of ontology term enrichment in microarray data.BayGO：微阵列数据中本体术语富集的贝叶斯分析

BMC Bioinformatics. 2006 Feb 23;7:86. doi: 10.1186/1471-2105-7-86.

SpotWhatR: a user-friendly microarray data analysis system.SpotWhatR：一个用户友好的微阵列数据分析系统。

Genet Mol Res. 2006 Mar 31;5(1):93-107.

Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。

BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.

STARNET 2: a web-based tool for accelerating discovery of gene regulatory networks using microarray co-expression data.STARET 2：一个基于网络的工具，用于使用微阵列共表达数据加速基因调控网络的发现。

BMC Bioinformatics. 2009 Oct 14;10:332. doi: 10.1186/1471-2105-10-332.

OntologyWidget - a reusable, embeddable widget for easily locating ontology terms.本体小部件 - 一种可重复使用、可嵌入的小部件，用于轻松定位本体术语。

BMC Bioinformatics. 2007 Sep 13;8:338. doi: 10.1186/1471-2105-8-338.

The Neural/Immune Gene Ontology: clipping the Gene Ontology for neurological and immunological systems.神经/免疫基因本体论：为神经系统和免疫系统裁剪基因本体论。

BMC Bioinformatics. 2010 Sep 12;11:458. doi: 10.1186/1471-2105-11-458.

Bayesian assignment of gene ontology terms to gene expression experiments.贝叶斯基因本体论术语分配到基因表达实验。

Bioinformatics. 2012 Sep 15;28(18):i603-i610. doi: 10.1093/bioinformatics/bts405.

GO::TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes.GO::TermFinder——用于访问基因本体论信息并查找与基因列表相关的显著富集基因本体论术语的开源软件。

Bioinformatics. 2004 Dec 12;20(18):3710-5. doi: 10.1093/bioinformatics/bth456. Epub 2004 Aug 5.

A Bayesian extension of the hypergeometric test for functional enrichment analysis.用于功能富集分析的超几何检验的贝叶斯扩展。

Biometrics. 2014 Mar;70(1):84-94. doi: 10.1111/biom.12122. Epub 2013 Dec 9.

VAMPIRE microarray suite: a web-based platform for the interpretation of gene expression data.VAMPIRE微阵列套件：一个基于网络的基因表达数据解读平台。

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W627-32. doi: 10.1093/nar/gki443.

引用本文的文献

The Antidepressant Sertraline Affects Cell Signaling and Metabolism in .抗抑郁药舍曲林影响……中的细胞信号传导和代谢。

J Fungi (Basel). 2023 Feb 20;9(2):275. doi: 10.3390/jof9020275.

Integrative Analysis of Next-Generation Sequencing for Next-Generation Cancer Research toward Artificial Intelligence.面向人工智能的下一代癌症研究的下一代测序综合分析

Cancers (Basel). 2021 Jun 24;13(13):3148. doi: 10.3390/cancers13133148.

StuA-Regulated Processes in the Dermatophyte : Transcription Profile, Cell-Cell Adhesion, and Immunomodulation.StuA 调控的皮肤真菌作用机制：转录组分析、细胞间黏附及免疫调节。

Front Cell Infect Microbiol. 2021 Jun 8;11:643659. doi: 10.3389/fcimb.2021.643659. eCollection 2021.

The Transcriptional Profile of Co-Cultured with Human Keratinocytes Shows New Insights about Gene Modulation by Terbinafine.与人角质形成细胞共培养的转录谱显示了关于特比萘芬基因调控的新见解。

Pathogens. 2019 Nov 29;8(4):274. doi: 10.3390/pathogens8040274.

Global Analysis of Cell Wall Genes Revealed Putative Virulence Factors in the Dermatophyte .皮肤癣菌细胞壁基因的全基因组分析揭示了潜在的毒力因子

Front Microbiol. 2019 Sep 19;10:2168. doi: 10.3389/fmicb.2019.02168. eCollection 2019.

The pH Signaling Transcription Factor PAC-3 Regulates Metabolic and Developmental Processes in Pathogenic Fungi.pH信号转录因子PAC-3调节致病真菌的代谢和发育过程。

Front Microbiol. 2019 Sep 4;10:2076. doi: 10.3389/fmicb.2019.02076. eCollection 2019.

Trans-chalcone activity against Trichophyton rubrum relies on an interplay between signaling pathways related to cell wall integrity and fatty acid metabolism.反式查耳酮对红色毛癣菌的活性依赖于与细胞壁完整性和脂肪酸代谢相关的信号通路之间的相互作用。

BMC Genomics. 2019 May 22;20(1):411. doi: 10.1186/s12864-019-5792-0.

Dual RNA-Seq Analysis of and HaCat Keratinocyte Co-Culture Highlights Important Genes for Fungal-Host Interaction.对[具体内容]与HaCat角质形成细胞共培养的双重RNA测序分析突出了真菌与宿主相互作用的重要基因。（你提供的原文中“of”后面似乎缺失了相关内容）

Genes (Basel). 2018 Jul 19;9(7):362. doi: 10.3390/genes9070362.

mus-52 disruption and metabolic regulation in Neurospora crassa: Transcriptional responses to extracellular phosphate availability.在粗糙脉孢菌中，mus-52 缺失和代谢调控：对外源磷酸盐可用性的转录响应。

PLoS One. 2018 Apr 18;13(4):e0195871. doi: 10.1371/journal.pone.0195871. eCollection 2018.

Transcriptome-wide survey of gene expression changes and alternative splicing in Trichophyton rubrum in response to undecanoic acid.转录组水平研究十一碳烯酸对红色毛癣菌基因表达变化和可变剪接的影响

Sci Rep. 2018 Feb 6;8(1):2520. doi: 10.1038/s41598-018-20738-x.

本文引用的文献

Probabilistic annotation of protein sequences based on functional classifications.基于功能分类的蛋白质序列概率注释。

BMC Bioinformatics. 2005 Dec 14;6:302. doi: 10.1186/1471-2105-6-302.

Gene sequence signatures revealed by mining the UniGene affiliation network.通过挖掘UniGene关联网络揭示的基因序列特征。

Bioinformatics. 2006 Feb 15;22(4):385-91. doi: 10.1093/bioinformatics/bti796. Epub 2005 Dec 8.

HTself: self-self based statistical test for low replication microarray studies.HTself：用于低重复微阵列研究的基于自身的统计检验。

DNA Res. 2005;12(3):211-4. doi: 10.1093/dnares/dsi007.

Protein molecular function prediction by Bayesian phylogenomics.基于贝叶斯系统发育基因组学的蛋白质分子功能预测

PLoS Comput Biol. 2005 Oct;1(5):e45. doi: 10.1371/journal.pcbi.0010045. Epub 2005 Oct 7.

Ontological analysis of gene expression data: current tools, limitations, and open problems.基因表达数据的本体分析：当前工具、局限性及开放问题

Bioinformatics. 2005 Sep 15;21(18):3587-95. doi: 10.1093/bioinformatics/bti565. Epub 2005 Jun 30.

An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.对生物创意（BioCreAtIvE）和基因本体注释（GOA）的基因本体（GO）注释检索的评估。

BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S17. doi: 10.1186/1471-2105-6-S1-S17. Epub 2005 May 24.

Bioinformatic methods for integrating whole-genome expression results into cellular networks.将全基因组表达结果整合到细胞网络中的生物信息学方法。

Drug Discov Today. 2005 May 15;10(10):727-34. doi: 10.1016/S1359-6446(05)03433-1.

Pathway and ontology analysis: emerging approaches connecting transcriptome data and clinical endpoints.

Curr Mol Med. 2005 Feb;5(1):11-21. doi: 10.2174/1566524053152906.

NCBI GEO: mining millions of expression profiles--database and tools.NCBI基因表达综合数据库：挖掘数百万个表达谱——数据库与工具

Nucleic Acids Res. 2005 Jan 1;33(Database issue):D562-6. doi: 10.1093/nar/gki022.

Handling multiple testing while interpreting microarrays with the Gene Ontology Database.在使用基因本体数据库解释微阵列时处理多重检验。

BMC Bioinformatics. 2004 Sep 6;5:124. doi: 10.1186/1471-2105-5-124.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验