识别分子网络图形模型中的显著边缘。

Identifying significant edges in graphical models of molecular networks.

机构信息

Genetics Institute, University College London, Darwin Building, Gower Street, WC1E 6BT London, UK.

出版信息

Artif Intell Med. 2013 Mar;57(3):207-17. doi: 10.1016/j.artmed.2012.12.006. Epub 2013 Feb 8.

DOI:10.1016/j.artmed.2012.12.006

PMID:23395009

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4070079/

Abstract

OBJECTIVE

Modelling the associations from high-throughput experimental molecular data has provided unprecedented insights into biological pathways and signalling mechanisms. Graphical models and networks have especially proven to be useful abstractions in this regard. Ad hoc thresholds are often used in conjunction with structure learning algorithms to determine significant associations. The present study overcomes this limitation by proposing a statistically motivated approach for identifying significant associations in a network.

METHODS AND MATERIALS

A new method that identifies significant associations in graphical models by estimating the threshold minimising the L1 norm between the cumulative distribution function (CDF) of the observed edge confidences and those of its asymptotic counterpart is proposed. The effectiveness of the proposed method is demonstrated on popular synthetic data sets as well as publicly available experimental molecular data corresponding to gene and protein expression profiles.

RESULTS

The improved performance of the proposed approach is demonstrated across the synthetic data sets using sensitivity, specificity and accuracy as performance metrics. The results are also demonstrated across varying sample sizes and three different structure learning algorithms with widely varying assumptions. In all cases, the proposed approach has specificity and accuracy close to 1, while sensitivity increases linearly in the logarithm of the sample size. The estimated threshold systematically outperforms common ad hoc ones in terms of sensitivity while maintaining comparable levels of specificity and accuracy. Networks from experimental data sets are reconstructed accurately with respect to the results from the original papers.

CONCLUSION

Current studies use structure learning algorithms in conjunction with ad hoc thresholds for identifying significant associations in graphical abstractions of biological pathways and signalling mechanisms. Such an ad hoc choice can have pronounced effect on attributing biological significance to the associations in the resulting network and possible downstream analysis. The statistically motivated approach presented in this study has been shown to outperform ad hoc thresholds and is expected to alleviate spurious conclusions of significant associations in such graphical abstractions.

摘要

目的

通过对高通量实验分子数据进行建模，可以深入了解生物途径和信号机制。图形模型和网络在这方面尤其被证明是有用的抽象。在结构学习算法中经常结合使用特定的阈值来确定显著的关联。本研究通过提出一种在网络中识别显著关联的统计驱动方法克服了这一局限性。

方法和材料

提出了一种新的方法，通过估计最小化观察到的边缘置信度累积分布函数（CDF）与渐近对应物之间的 L1 范数的阈值来识别图形模型中的显著关联。在所提出的方法的有效性在流行的合成数据集以及与基因和蛋白质表达谱相对应的公开可用的实验分子数据上得到了证明。

结果

在所提出的方法的有效性在流行的合成数据集以及与基因和蛋白质表达谱相对应的公开可用的实验分子数据上得到了证明。在所提出的方法的有效性在流行的合成数据集以及与基因和蛋白质表达谱相对应的公开可用的实验分子数据上得到了证明。在所提出的方法的有效性在流行的合成数据集以及与基因和蛋白质表达谱相对应的公开可用的实验分子数据上得到了证明。使用灵敏度、特异性和准确性作为性能指标，在合成数据集上证明了所提出的方法的改进性能。还使用广泛不同的假设的三种不同的结构学习算法，在不同的样本大小下证明了结果。在所有情况下，所提出的方法的特异性和准确性接近 1，而灵敏度随着样本大小的对数线性增加。在所提出的方法中，所估计的阈值在保持类似的特异性和准确性水平的同时，在灵敏度方面系统地优于常见的特定阈值。实验数据集的网络相对于原始论文的结果准确地重建。

结论

当前的研究使用结构学习算法和特定的阈值来识别生物途径和信号机制的图形抽象中的显著关联。这种特定的选择可能会对归因于网络中关联的生物学意义以及可能的下游分析产生显著影响。本研究中提出的统计驱动方法已被证明优于特定的阈值，并有望减轻此类图形抽象中显著关联的虚假结论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/97a0/4070079/de0628ee4f21/gr1.jpg

相似文献

Identifying significant edges in graphical models of molecular networks.

Artif Intell Med. 2013 Mar;57(3):207-17. doi: 10.1016/j.artmed.2012.12.006. Epub 2013 Feb 8.

Impact of noise on molecular network inference.

PLoS One. 2013 Dec 5;8(12):e80735. doi: 10.1371/journal.pone.0080735. eCollection 2013.

Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks.

Bioinformatics. 2006 Oct 15;22(20):2523-31. doi: 10.1093/bioinformatics/btl391. Epub 2006 Jul 14.

A linear programming approach for estimating the structure of a sparse linear genetic network from transcript profiling data.

Algorithms Mol Biol. 2009 Feb 24;4:5. doi: 10.1186/1748-7188-4-5.

Structural identifiability of cyclic graphical models of biological networks with latent variables.

BMC Syst Biol. 2016 Jun 13;10(1):41. doi: 10.1186/s12918-016-0287-y.

An empirical Bayes approach to inferring large-scale gene association networks.

Bioinformatics. 2005 Mar;21(6):754-64. doi: 10.1093/bioinformatics/bti062. Epub 2004 Oct 12.

GraphAlignment: Bayesian pairwise alignment of biological networks.

BMC Syst Biol. 2012 Nov 21;6:144. doi: 10.1186/1752-0509-6-144.

Development and use of a Cytoscape app for GRNCOP2.

Comput Methods Programs Biomed. 2019 Aug;177:211-218. doi: 10.1016/j.cmpb.2019.05.030. Epub 2019 Jun 4.

Condition-adaptive fused graphical lasso (CFGL): An adaptive procedure for inferring condition-specific gene co-expression network.

PLoS Comput Biol. 2018 Sep 21;14(9):e1006436. doi: 10.1371/journal.pcbi.1006436. eCollection 2018 Sep.

Bayesian inference of hub nodes across multiple networks.

Biometrics. 2019 Mar;75(1):172-182. doi: 10.1111/biom.12958. Epub 2018 Aug 23.

引用本文的文献

Gender differences in the interactions and inferred causal relationships between risk factors and gaming disorder: a network and structural equation modeling approach.

Eur Arch Psychiatry Clin Neurosci. 2025 Aug 14. doi: 10.1007/s00406-025-02075-z.

Network relationships among depressive symptoms, sleep quality, and frailty in Chinese older adults: an undirected and bayesian network analysis.

BMC Geriatr. 2025 Aug 13;25(1):619. doi: 10.1186/s12877-025-06273-1.

Depression and anxiety symptoms associated with internet addiction and non-suicidal self-injury in Chinese adolescent students - a network analysis.

BMC Psychiatry. 2025 Jul 29;25(1):731. doi: 10.1186/s12888-025-07131-5.

Bayesian network imputation methods applied to multi-omics data identify putative causal relationships in a type 2 diabetes dataset containing incomplete data: An IMI DIRECT Study.

PLoS Genet. 2025 Jul 15;21(7):e1011776. doi: 10.1371/journal.pgen.1011776. eCollection 2025 Jul.

The network and interactive pattern of social adjustment and psychological symptoms in patients with spinal cord injury: a network analysis.

BMC Psychol. 2025 Jul 11;13(1):774. doi: 10.1186/s40359-025-03105-0.

Large scale causal modeling to identify adults at risk for combined and common variable immunodeficiencies.

NPJ Digit Med. 2025 Jun 14;8(1):361. doi: 10.1038/s41746-025-01761-5.

Mapping Connection and Direction Among Symptoms of Sleep Disturbance and Perceived Stress in Firefighters: Embracing the Network Analysis Perspective.

Nat Sci Sleep. 2025 Jun 4;17:1143-1162. doi: 10.2147/NSS.S517178. eCollection 2025.

Risk Factors and Consequences of Parental Burnout: Role of Early Maladaptive Schemas and Emotion-Focused Coping.

Trends Psychol. 2023 Apr 7:1-18. doi: 10.1007/s43076-023-00288-6.

The associations between PTSD symptoms, intolerance of uncertainty and personal growth initiative in trauma-exposed children and adolescents: undirected and Bayesian network analyses.

Eur Child Adolesc Psychiatry. 2025 Jun 2. doi: 10.1007/s00787-025-02761-2.

Unraveling the impact of cyberporn motivations on mental health: insights from Chinese college students.

BMC Psychol. 2025 May 26;13(1):562. doi: 10.1186/s40359-025-02901-y.

本文引用的文献

Efficient Markov Network Structure Discovery Using Independence Tests.

J Artif Intell Res. 2009 May;35(1):449-484.

Functional relationships between genes associated with differentiation potential of aged myogenic progenitors.

Front Physiol. 2010 Sep 9;1:21. doi: 10.3389/fphys.2010.00021. eCollection 2010.

Causal protein-signaling networks derived from multiparameter single-cell data.

Science. 2005 Apr 22;308(5721):523-9. doi: 10.1126/science.1105809.

Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks.

Bioinformatics. 2003 Nov 22;19(17):2271-82. doi: 10.1093/bioinformatics/btg313.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

识别分子网络图形模型中的显著边缘。

Identifying significant edges in graphical models of molecular networks.

机构信息

Genetics Institute, University College London, Darwin Building, Gower Street, WC1E 6BT London, UK.