Suppr超能文献

一种整合多种基因组规模数据源的图形模型方法。

A graphical model method for integrating multiple sources of genome-scale data.

作者信息

Dvorkin Daniel, Biehs Brian, Kechris Katerina

机构信息

Computational Bioscience Program, University of Colorado School of Medicine, 12801 E. 17th Ave., Aurora, CO 80045–0511, USA.

出版信息

Stat Appl Genet Mol Biol. 2013 Aug;12(4):469-87. doi: 10.1515/sagmb-2012-0051.

Abstract

Making effective use of multiple data sources is a major challenge in modern bioinformatics. Genome-wide data such as measures of transcription factor binding, gene expression, and sequence conservation, which are used to identify binding regions and genes that are important to major biological processes such as development and disease, can be difficult to use together due to the different biological meanings and statistical distributions of the heterogeneous data types, but each can provide valuable information for understanding the processes under study. Here we present methods for integrating multiple data sources to gain a more complete picture of gene regulation and expression. Our goal is to identify genes and cis-regulatory regions which play specific biological roles. We describe a graphical mixture model approach for data integration, examine the effect of using different model topologies, and discuss methods for evaluating the effectiveness of the models. Model fitting is computationally efficient and produces results which have clear biological and statistical interpretations. The Hedgehog and Dorsal signaling pathways in Drosophila, which are critical in embryonic development, are used as examples.

摘要

有效利用多个数据源是现代生物信息学中的一项重大挑战。全基因组数据,如转录因子结合、基因表达和序列保守性的测量数据,用于识别对发育和疾病等主要生物过程至关重要的结合区域和基因。由于异构数据类型具有不同的生物学意义和统计分布,这些数据很难一起使用,但每种数据都能为理解所研究的过程提供有价值的信息。在此,我们提出整合多个数据源的方法,以更全面地了解基因调控和表达。我们的目标是识别发挥特定生物学作用的基因和顺式调控区域。我们描述了一种用于数据整合的图形混合模型方法,研究了使用不同模型拓扑结构的效果,并讨论了评估模型有效性的方法。模型拟合在计算上效率很高,并且产生的结果具有清晰的生物学和统计学解释。以果蝇中对胚胎发育至关重要的刺猬信号通路和背侧信号通路为例进行说明。

相似文献

2
Bayesian hierarchical error model for analysis of gene expression data.用于基因表达数据分析的贝叶斯分层误差模型。
Bioinformatics. 2004 Sep 1;20(13):2016-25. doi: 10.1093/bioinformatics/bth192. Epub 2004 Mar 25.
7
A GMM-IG framework for selecting genes as expression panel biomarkers.一种用于选择基因作为表达谱生物标志物的 GMM-IG 框架。
Artif Intell Med. 2010 Feb-Mar;48(2-3):75-82. doi: 10.1016/j.artmed.2009.07.006. Epub 2009 Dec 8.
8
Fast Bayesian inference in large Gaussian graphical models.大型高斯图模型中的快速贝叶斯推理。
Biometrics. 2019 Dec;75(4):1288-1298. doi: 10.1111/biom.13064. Epub 2019 May 6.
9
An order estimation based approach to identify response genes for microarray time course data.一种基于顺序估计的方法,用于识别微阵列时间序列数据的响应基因。
Stat Appl Genet Mol Biol. 2012 Dec 14;11(6):/j/sagmb.2012.11.issue-6/1544-6115.1818/1544-6115.1818.xml. doi: 10.1515/1544-6115.1818.

引用本文的文献

1
Evaluation of hierarchical models for integrative genomic analyses.用于整合基因组分析的分层模型评估。
Bioinformatics. 2016 Mar 1;32(5):738-46. doi: 10.1093/bioinformatics/btv653. Epub 2015 Nov 5.
2
The discordant method: a novel approach for differential correlation.不一致方法:一种用于差异相关性分析的新方法。
Bioinformatics. 2016 Mar 1;32(5):690-6. doi: 10.1093/bioinformatics/btv633. Epub 2015 Oct 31.
3
DNA methylation and childhood asthma in the inner city.城市中心区的DNA甲基化与儿童哮喘
J Allergy Clin Immunol. 2015 Jul;136(1):69-80. doi: 10.1016/j.jaci.2015.01.025. Epub 2015 Mar 11.

本文引用的文献

3
FlyBase 101--the basics of navigating FlyBase.FlyBase101——导航 FlyBase 的基础知识。
Nucleic Acids Res. 2012 Jan;40(Database issue):D706-14. doi: 10.1093/nar/gkr1030. Epub 2011 Nov 29.
4
KEGG for integration and interpretation of large-scale molecular data sets.KEGG 用于整合和解释大规模分子数据集。
Nucleic Acids Res. 2012 Jan;40(Database issue):D109-14. doi: 10.1093/nar/gkr988. Epub 2011 Nov 10.
5
Integrating diverse genomic data using gene sets.利用基因集整合多种基因组数据。
Genome Biol. 2011 Oct 21;12(10):R105. doi: 10.1186/gb-2011-12-10-r105.
8
The UCSC Genome Browser database: update 2011.加州大学圣克鲁兹分校基因组浏览器数据库:2011年更新
Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.
9

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验