通过因果推断测试理清分子关系。

Disentangling molecular relationships with a causal inference test.

作者信息

Millstein Joshua, Zhang Bin, Zhu Jun, Schadt Eric E

机构信息

Genetics Department, Rosetta Inpharmatics, LLC, Seattle, Washington 98109, USA.

出版信息

BMC Genet. 2009 May 27;10:23. doi: 10.1186/1471-2156-10-23.

DOI:10.1186/1471-2156-10-23

PMID:19473544

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3224661/

Abstract

BACKGROUND

There has been intense effort over the past couple of decades to identify loci underlying quantitative traits as a key step in the process of elucidating the etiology of complex diseases. Recently there has been some effort to coalesce non-biased high-throughput data, e.g. high density genotyping and genome wide RNA expression, to drive understanding of the molecular basis of disease. However, a stumbling block has been the difficult question of how to leverage this information to identify molecular mechanisms that explain quantitative trait loci (QTL). We have developed a formal statistical hypothesis test, resulting in a p-value, to quantify uncertainty in a causal inference pertaining to a measured factor, e.g. a molecular species, which potentially mediates a known causal association between a locus and a quantitative trait.

RESULTS

We treat the causal inference as a 'chain' of mathematical conditions that must be satisfied to conclude that the potential mediator is causal for the trait, where the inference is only as good as the weakest link in the chain. P-values are computed for the component conditions, which include tests of linkage and conditional independence. The Intersection-Union Test, in which a series of statistical tests are combined to form an omnibus test, is then employed to generate the overall test result. Using computer simulated mouse crosses, we show that type I error is low under a variety of conditions that include hidden variables and reactive pathways. We show that power under a simple causal model is comparable to other model selection techniques as well as Bayesian network reconstruction methods. We further show empirically that this method compares favorably to Bayesian network reconstruction methods for reconstructing transcriptional regulatory networks in yeast, recovering 7 out of 8 experimentally validated regulators.

CONCLUSION

Here we propose a novel statistical framework in which existing notions of causal mediation are formalized into a hypothesis test, thus providing a standard quantitative measure of uncertainty in the form of a p-value. The method is theoretically and computationally accessible and with the provided software may prove a useful tool in disentangling molecular relationships.

摘要

背景

在过去几十年里，人们付出了巨大努力来确定复杂性状背后的基因座，这是阐明复杂疾病病因过程中的关键一步。最近，人们致力于整合无偏的高通量数据，如高密度基因分型和全基因组RNA表达数据，以推动对疾病分子基础的理解。然而，一个绊脚石是如何利用这些信息来识别解释数量性状基因座（QTL）的分子机制这一难题。我们开发了一种正式的统计假设检验，得出一个p值，以量化与一个测量因素（如一种分子物质）相关的因果推断中的不确定性，该因素可能介导一个基因座与一个数量性状之间已知的因果关联。

结果

我们将因果推断视为一系列必须满足的数学条件的“链条”，以便得出潜在的介导因素对该性状具有因果关系的结论，其中推断的可靠性取决于链条中最薄弱的环节。为组成条件计算p值，这些条件包括连锁和条件独立性检验。然后采用联合检验，将一系列统计检验组合起来形成一个综合检验，以生成总体检验结果。使用计算机模拟的小鼠杂交实验，我们表明在包括隐藏变量和反应途径在内的各种条件下，I型错误率较低。我们表明，在一个简单的因果模型下，该方法的功效与其他模型选择技术以及贝叶斯网络重建方法相当。我们进一步通过实验表明，在重建酵母转录调控网络方面，该方法优于贝叶斯网络重建方法，能够从8个经实验验证的调控因子中恢复7个。

结论

在此，我们提出了一个新颖的统计框架，其中现有的因果中介概念被形式化为一个假设检验，从而以p值的形式提供了一个标准的不确定性定量度量。该方法在理论和计算上都是可行的，并且借助所提供的软件可能会成为解开分子关系的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd23/3224661/76cc2e760c83/1471-2156-10-23-1.jpg

相似文献

Disentangling molecular relationships with a causal inference test.

BMC Genet. 2009 May 27;10:23. doi: 10.1186/1471-2156-10-23.

Uncovering the genetic landscape for multiple sleep-wake traits.

PLoS One. 2009;4(4):e5161. doi: 10.1371/journal.pone.0005161. Epub 2009 Apr 10.

Using stochastic causal trees to augment Bayesian networks for modeling eQTL datasets.

BMC Bioinformatics. 2011 Jan 6;12:7. doi: 10.1186/1471-2105-12-7.

A Bayesian framework for inference of the genotype-phenotype map for segregating populations.

Genetics. 2011 Apr;187(4):1163-70. doi: 10.1534/genetics.110.123273. Epub 2011 Jan 17.

Causal inference of regulator-target pairs by gene mapping of expression phenotypes.

BMC Genomics. 2006 May 24;7:125. doi: 10.1186/1471-2164-7-125.

Causal inference in biology networks with integrated belief propagation.

Pac Symp Biocomput. 2015:359-70.

Comparison between instrumental variable and mediation-based methods for reconstructing causal gene networks in yeast.

Mol Omics. 2021 Apr 1;17(2):241-251. doi: 10.1039/d0mo00140f. Epub 2021 Jan 13.

High-confidence discovery of genetic network regulators in expression quantitative trait loci data.

Genetics. 2011 Mar;187(3):955-64. doi: 10.1534/genetics.110.124685. Epub 2011 Jan 6.

Bayesian mapping of quantitative trait loci for complex binary traits.

Genetics. 2000 Jul;155(3):1391-403. doi: 10.1093/genetics/155.3.1391.

Identifying QTL for multiple complex traits in experimental crosses.

Methods Mol Biol. 2012;871:205-25. doi: 10.1007/978-1-61779-785-9_11.

引用本文的文献

Causal network inference of cis- and trans-gene regulation of expression quantitative trait loci across human tissues.

Genetics. 2025 Jun 4;230(2). doi: 10.1093/genetics/iyaf064.

Caregiver-child interaction and early childhood development among preschool children in rural China: the possible role of blood epigenome-wide DNA methylation.

BMC Genomics. 2025 Apr 1;26(1):329. doi: 10.1186/s12864-025-11406-2.

Predicting the genetic component of gene expression using gene regulatory networks.

Bioinform Adv. 2024 Nov 23;4(1):vbae180. doi: 10.1093/bioadv/vbae180. eCollection 2024.

Exposome-Wide Ranking to Uncover Environmental Chemicals Associated with Dyslipidemia: A Panel Study in Healthy Older Chinese Adults from the BAPE Study.

Environ Health Perspect. 2024 Sep;132(9):97005. doi: 10.1289/EHP13864. Epub 2024 Sep 6.

Single cell transcriptomes and multiscale networks from persons with and without Alzheimer's disease.

Nat Commun. 2024 Jul 10;15(1):5815. doi: 10.1038/s41467-024-49790-0.

Prediagnosis recognition of acute ischemic stroke by artificial intelligence from facial images.

Aging Cell. 2024 Aug;23(8):e14196. doi: 10.1111/acel.14196. Epub 2024 Jun 6.

Mediation role of DNA methylation in association between handgrip strength and cognitive function in monozygotic twins.

J Hum Genet. 2024 Aug;69(8):357-363. doi: 10.1038/s10038-024-01247-4. Epub 2024 Apr 23.

Omics-based construction of regulatory variants can be applied to help decipher pig liver-related traits.

Commun Biol. 2024 Mar 29;7(1):381. doi: 10.1038/s42003-024-06050-7.

Population-scale skeletal muscle single-nucleus multi-omic profiling reveals extensive context specific genetic regulation.

bioRxiv. 2024 Dec 17:2023.12.15.571696. doi: 10.1101/2023.12.15.571696.

Systems genetics approaches for understanding complex traits with relevance for human disease.

Elife. 2023 Nov 14;12:e91004. doi: 10.7554/eLife.91004.

本文引用的文献

A general framework for multiple testing dependence.

Proc Natl Acad Sci U S A. 2008 Dec 2;105(48):18718-23. doi: 10.1073/pnas.0808709105. Epub 2008 Nov 24.

Accurate discovery of expression quantitative trait loci under confounding from spurious and genuine regulatory hotspots.

Genetics. 2008 Dec;180(4):1909-25. doi: 10.1534/genetics.108.094201. Epub 2008 Sep 14.

Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks.

Nat Genet. 2008 Jul;40(7):854-61. doi: 10.1038/ng.167. Epub 2008 Jun 15.

Variations in DNA elucidate molecular networks that cause disease.

Nature. 2008 Mar 27;452(7186):429-35. doi: 10.1038/nature06757. Epub 2008 Mar 16.

Genetics of gene expression and its effect on disease.

Nature. 2008 Mar 27;452(7186):423-8. doi: 10.1038/nature06758. Epub 2008 Mar 16.

Harnessing naturally randomized transcription to infer regulatory relationships among genes.

Genome Biol. 2007;8(10):R219. doi: 10.1186/gb-2007-8-10-r219.

Mendelian randomization: using genes as instruments for making causal inferences in epidemiology.

Stat Med. 2008 Apr 15;27(8):1133-63. doi: 10.1002/sim.3034.

Mendelian randomization as an instrumental variable approach to causal inference.

Stat Methods Med Res. 2007 Aug;16(4):309-30. doi: 10.1177/0962280206077743.

Prediction of transcription factor binding sites using genetical genomics methods.

J Bioinform Comput Biol. 2007 Jun;5(3):773-93. doi: 10.1142/s0219720007002680.

Genetical genomics: use all data.

BMC Genomics. 2007 Mar 12;8:69. doi: 10.1186/1471-2164-8-69.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过因果推断测试理清分子关系。

Disentangling molecular relationships with a causal inference test.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献