• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估图形高斯模型的有效性域,以便推断复杂生物系统各组成部分之间的关系。

Assessing the validity domains of graphical Gaussian models in order to infer relationships among components of complex biological systems.

作者信息

Villers Fanny, Schaeffer Brigitte, Bertin Caroline, Huet Sylvie

机构信息

INRA Jouy-En-Josas.

出版信息

Stat Appl Genet Mol Biol. 2008;7(1):Article 14. doi: 10.2202/1544-6115.1371. Epub 2008 Sep 11.

DOI:10.2202/1544-6115.1371
PMID:18976229
Abstract

The study of the interactions of cellular components is an essential base step to understand the structure and dynamics of biological networks. Various methods were recently developed for this purpose. While most of them combine different types of data and a priori knowledge, methods based on graphical Gaussian models are capable of learning the network directly from raw data. They consider the full-order partial correlations which are partial correlations between two variables given the remaining ones, for modeling direct links between variables. Statistical methods were developed for estimating these links when the number of observations is larger than the number of variables. However, the rapid advance of new technologies that allow the simultaneous measure of genome expression, led to large-scale datasets where the number of variables is far larger than the number of observations. To get around this dimensionality problem, different strategies and new statistical methods were proposed. In this study we focused on statistical methods recently published. All are based on the fact that the number of direct relationships between two variables is very small in regards to the number of possible relationships, p(p-1)/2. In the biological context, this assumption is not always satisfied over the whole graph. It is essential to precisely know the behavior of the methods in regards to the characteristics of the studied object before applying them. For this purpose, we evaluated the validity domain of each method from wide-ranging simulated datasets. We then illustrated our results using recently published biological data.

摘要

细胞成分相互作用的研究是理解生物网络结构和动态的重要基础步骤。最近为此目的开发了各种方法。虽然其中大多数方法结合了不同类型的数据和先验知识,但基于图形高斯模型的方法能够直接从原始数据中学习网络。它们考虑全阶偏相关性,即给定其余变量时两个变量之间的偏相关性,用于对变量之间的直接联系进行建模。当观测值的数量大于变量的数量时,开发了统计方法来估计这些联系。然而,允许同时测量基因组表达的新技术的迅速发展,导致了大规模数据集,其中变量的数量远远大于观测值的数量。为了解决这个维度问题,人们提出了不同的策略和新的统计方法。在本研究中,我们重点关注最近发表的统计方法。所有这些方法都基于这样一个事实,即两个变量之间的直接关系数量相对于可能关系的数量p(p - 1)/2来说非常少。在生物学背景下,在整个图上这个假设并不总是成立。在应用这些方法之前,准确了解它们相对于所研究对象特征的行为至关重要。为此,我们从广泛的模拟数据集中评估了每种方法的有效域。然后,我们使用最近发表的生物学数据说明了我们的结果。

相似文献

1
Assessing the validity domains of graphical Gaussian models in order to infer relationships among components of complex biological systems.评估图形高斯模型的有效性域,以便推断复杂生物系统各组成部分之间的关系。
Stat Appl Genet Mol Biol. 2008;7(1):Article 14. doi: 10.2202/1544-6115.1371. Epub 2008 Sep 11.
2
Weighted lasso in graphical Gaussian modeling for large gene network estimation based on microarray data.基于微阵列数据的大型基因网络估计的图形高斯建模中的加权套索法
Genome Inform. 2007;19:142-53.
3
Low-order conditional independence graphs for inferring genetic networks.用于推断遗传网络的低阶条件独立图。
Stat Appl Genet Mol Biol. 2006;5:Article1. doi: 10.2202/1544-6115.1170. Epub 2006 Jan 4.
4
Causal inference in biomolecular pathways using a Bayesian network approach and an Implicit method.使用贝叶斯网络方法和隐式方法进行生物分子途径中的因果推断。
J Theor Biol. 2008 Aug 21;253(4):717-24. doi: 10.1016/j.jtbi.2008.04.030. Epub 2008 May 4.
5
SysBioMed report: advancing systems biology for medical applications.系统生物医学报告:推动系统生物学在医学应用中的发展。
IET Syst Biol. 2009 May;3(3):131-6. doi: 10.1049/iet-syb.2009.0005.
6
Multiscale modeling of biological pattern formation.生物模式形成的多尺度建模
Curr Top Dev Biol. 2008;81:435-60. doi: 10.1016/S0070-2153(07)81015-5.
7
Extracting protein regulatory networks with graphical models.使用图形模型提取蛋白质调控网络。
Proteomics. 2007 Sep;7 Suppl 1:51-9. doi: 10.1002/pmic.200700466.
8
Motif distribution, dynamical properties, and computational performance of two data-based cortical microcircuit templates.两种基于数据的皮质微电路模板的基序分布、动力学特性及计算性能
J Physiol Paris. 2009 Jan-Mar;103(1-2):73-87. doi: 10.1016/j.jphysparis.2009.05.006. Epub 2009 Jun 11.
9
List-decoding methods for inferring polynomials in finite dynamical gene network models.有限动态基因网络模型中用于推断多项式的列表译码方法。
Bioinformatics. 2009 Jul 1;25(13):1686-93. doi: 10.1093/bioinformatics/btp281. Epub 2009 Apr 28.
10
Application of Gaussian processes for black-box modelling of biosystems.高斯过程在生物系统黑箱建模中的应用。
ISA Trans. 2007 Oct;46(4):443-57. doi: 10.1016/j.isatra.2007.04.001. Epub 2007 Jun 4.

引用本文的文献

1
Dietary networks identified by Gaussian graphical model and general and abdominal obesity in adults.基于高斯图形模型识别的膳食网络与成年人的总体肥胖和腹部肥胖
Nutr J. 2021 Oct 27;20(1):86. doi: 10.1186/s12937-021-00746-w.
2
Advances in dietary pattern analysis in nutritional epidemiology.膳食模式分析在营养流行病学中的进展。
Eur J Nutr. 2021 Dec;60(8):4115-4130. doi: 10.1007/s00394-021-02545-9. Epub 2021 Apr 25.
3
Integrated mRNA and miRNA expression profiling in blood reveals candidate biomarkers associated with endurance exercise in the horse.
血液中整合的mRNA和miRNA表达谱揭示了与马耐力运动相关的候选生物标志物。
Sci Rep. 2016 Mar 10;6:22932. doi: 10.1038/srep22932.
4
The structure of a gene co-expression network reveals biological functions underlying eQTLs.基因共表达网络的结构揭示了 eQTL 背后的生物学功能。
PLoS One. 2013;8(4):e60045. doi: 10.1371/journal.pone.0060045. Epub 2013 Apr 5.
5
Conceptual aspects of large meta-analyses with publicly available microarray data: a case study in oncology.利用公开可用的微阵列数据进行大型荟萃分析的概念性问题:肿瘤学案例研究
Bioinform Biol Insights. 2011 Jan 23;5:13-39. doi: 10.4137/BBI.S5537.