遗传三体型研究中的因果推断。

Causal inference in genetic trio studies.

机构信息

Department of Statistics, Stanford University, Stanford, CA 94305;

Department of Data Sciences and Operations, Marshall School of Business, University of Southern California, Los Angeles, CA 90089.

出版信息

Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.

DOI:10.1073/pnas.2007743117

PMID:32948695

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7533659/

Abstract

We introduce a method to draw causal inferences-inferences immune to all possible confounding-from genetic data that include parents and offspring. Causal conclusions are possible with these data because the natural randomness in meiosis can be viewed as a high-dimensional randomized experiment. We make this observation actionable by developing a conditional independence test that identifies regions of the genome containing distinct causal variants. The proposed digital twin test compares an observed offspring to carefully constructed synthetic offspring from the same parents to determine statistical significance, and it can leverage any black-box multivariate model and additional nontrio genetic data to increase power. Crucially, our inferences are based only on a well-established mathematical model of recombination and make no assumptions about the relationship between the genotypes and phenotypes. We compare our method to the widely used transmission disequilibrium test and demonstrate enhanced power and localization.

摘要

我们介绍了一种从包含父母和子女的遗传数据中得出因果推论（不受所有可能混杂因素影响的推论）的方法。这些数据之所以能够得出因果结论，是因为减数分裂中的自然随机性可以被视为一种高维随机实验。我们通过开发一种条件独立性检验来使这一观察结果具有可操作性，该检验可以识别基因组中包含不同因果变体的区域。所提出的数字孪生测试将观察到的后代与来自同一父母的精心构建的合成后代进行比较，以确定统计显著性，并且它可以利用任何黑盒多元模型和其他非三亲遗传数据来提高功效。至关重要的是，我们的推论仅基于重组的成熟数学模型，并且不假设基因型和表型之间的关系。我们将我们的方法与广泛使用的传递不平衡测试进行了比较，并证明了增强的功效和定位能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1513/7533659/d581c7dddff3/pnas.2007743117fig01.jpg

相似文献

Causal inference in genetic trio studies.

Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.

Informative-transmission disequilibrium test (i-TDT): combined linkage and association mapping that includes unaffected offspring as well as affected offspring.

Genet Epidemiol. 2007 Feb;31(2):115-33. doi: 10.1002/gepi.20195.

Transmission-disequilibrium tests for quantitative traits.

Am J Hum Genet. 1997 Mar;60(3):676-90.

Screening and replication using the same data set: testing strategies for family-based studies in which all probands are affected.

PLoS Genet. 2008 Sep 19;4(9):e1000197. doi: 10.1371/journal.pgen.1000197.

Weighted variance FBAT: a powerful method for including covariates in FBAT analyses.

Genet Epidemiol. 2007 May;31(4):327-37. doi: 10.1002/gepi.20213.

Statistical equivalent of the classical TDT for quantitative traits and multivariate phenotypes.

J Genet. 2015 Dec;94(4):619-28. doi: 10.1007/s12041-015-0563-4.

Rare-variant extensions of the transmission disequilibrium test: application to autism exome sequence data.

Am J Hum Genet. 2014 Jan 2;94(1):33-46. doi: 10.1016/j.ajhg.2013.11.021. Epub 2013 Dec 19.

Adaptive transmission disequilibrium test for family trio design.

Stat Appl Genet Mol Biol. 2009;8:Article30. doi: 10.2202/1544-6115.1451. Epub 2009 Jun 23.

Including non-informative parents in transmission-based association tests.

J Hum Genet. 2017 Jun;62(6):621-629. doi: 10.1038/jhg.2017.9. Epub 2017 Feb 16.

A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data.

BMC Genomics. 2016 Nov 7;17(1):881. doi: 10.1186/s12864-016-3169-1.

引用本文的文献

Robust inference with GhostKnockoffs in genome-wide association studies.

Res Sq. 2025 May 5:rs.3.rs-6396196. doi: 10.21203/rs.3.rs-6396196/v1.

Identifying causal genotype-phenotype relationships for population-sampled parent-child trios.

bioRxiv. 2024 Dec 11:2024.12.10.627752. doi: 10.1101/2024.12.10.627752.

KnockoffHybrid: A knockoff framework for hybrid analysis of trio and population designs in genome-wide association studies.

Am J Hum Genet. 2024 Jul 11;111(7):1448-1461. doi: 10.1016/j.ajhg.2024.05.003. Epub 2024 May 30.

Searching for robust associations with a multi-environment knockoff filter.

Biometrika. 2022 Sep;109(3):611-629. doi: 10.1093/biomet/asab055. Epub 2021 Nov 2.

Beyond guilty by association at scale: searching for causal variants on the basis of genome-wide summary statistics.

bioRxiv. 2025 Feb 26:2024.02.28.582621. doi: 10.1101/2024.02.28.582621.

Controlled Variable Selection from Summary Statistics Only? A Solution via GhostKnockoffs and Penalized Regression.

ArXiv. 2024 Feb 20:arXiv:2402.12724v1.

How do stochastic processes and genetic threshold effects explain incomplete penetrance and inform causal disease mechanisms?

Philos Trans R Soc Lond B Biol Sci. 2024 Apr 22;379(1900):20230045. doi: 10.1098/rstb.2023.0045. Epub 2024 Mar 4.

DIET: Conditional independence testing with marginal dependence measures of residual information.

Proc Mach Learn Res. 2023 Apr;206:10343-10367.

Fast and powerful conditional randomization testing via distillation.

Biometrika. 2022 Jun;109(2):277-293. doi: 10.1093/biomet/asab039. Epub 2021 Jul 8.

Leveraging family data to design Mendelian randomization that is provably robust to population stratification.

Genome Res. 2023 Jul;33(7):1032-1041. doi: 10.1101/gr.277664.123. Epub 2023 May 17.

本文引用的文献

Multi-resolution localization of causal variants across the genome.

Nat Commun. 2020 Feb 27;11(1):1093. doi: 10.1038/s41467-020-14791-2.

Identification of common genetic risk variants for autism spectrum disorder.

Nat Genet. 2019 Mar;51(3):431-444. doi: 10.1038/s41588-019-0344-8. Epub 2019 Feb 25.

Gene hunting with hidden Markov model knockoffs.

Biometrika. 2019 Mar;106(1):1-18. doi: 10.1093/biomet/asy033. Epub 2018 Aug 4.

The UK Biobank resource with deep phenotyping and genomic data.

Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.

Mixed-model association for biobank-scale datasets.

Nat Genet. 2018 Jul;50(7):906-908. doi: 10.1038/s41588-018-0144-6.

10 Years of GWAS Discovery: Biology, Function, and Translation.

Am J Hum Genet. 2017 Jul 6;101(1):5-22. doi: 10.1016/j.ajhg.2017.06.005.

Refined genetic maps reveal sexual dimorphism in human meiotic recombination at multiple scales.

Nat Commun. 2017 Apr 25;8:14994. doi: 10.1038/ncomms14994.

Gene Mapping in Admixed Families: A Cautionary Note on the Interpretation of the Transmission Disequilibrium Test and a Possible Solution.

Hum Hered. 2016;81(2):106-116. doi: 10.1159/000446956. Epub 2017 Jan 12.

New insights into the generation and role of de novo mutations in health and disease.

Genome Biol. 2016 Nov 28;17(1):241. doi: 10.1186/s13059-016-1110-1.

The contribution of de novo coding mutations to autism spectrum disorder.

Nature. 2014 Nov 13;515(7526):216-21. doi: 10.1038/nature13908. Epub 2014 Oct 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

遗传三体型研究中的因果推断。

Causal inference in genetic trio studies.

机构信息

Department of Statistics, Stanford University, Stanford, CA 94305;

Department of Data Sciences and Operations, Marshall School of Business, University of Southern California, Los Angeles, CA 90089.

出版信息

Proc Natl Acad Sci U S A. 2020 Sep 29;117(39):24117-24126. doi: 10.1073/pnas.2007743117. Epub 2020 Sep 18.

DOI:10.1073/pnas.2007743117

PMID:32948695

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7533659/

Abstract

摘要

遗传三体型研究中的因果推断。

Causal inference in genetic trio studies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

遗传三体型研究中的因果推断。

Causal inference in genetic trio studies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献