利用高斯预测过程在大规模队列研究中高效估计 SNP 遗传力。

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

机构信息

Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States of America.

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America.

出版信息

PLoS Genet. 2022 Apr 20;18(4):e1010151. doi: 10.1371/journal.pgen.1010151. eCollection 2022 Apr.

DOI:10.1371/journal.pgen.1010151

PMID:35442943

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9060362/

Abstract

With the advent of high throughput genetic data, there have been attempts to estimate heritability from genome-wide SNP data on a cohort of distantly related individuals using linear mixed model (LMM). Fitting such an LMM in a large scale cohort study, however, is tremendously challenging due to its high dimensional linear algebraic operations. In this paper, we propose a new method named PredLMM approximating the aforementioned LMM motivated by the concepts of genetic coalescence and Gaussian predictive process. PredLMM has substantially better computational complexity than most of the existing LMM based methods and thus, provides a fast alternative for estimating heritability in large scale cohort studies. Theoretically, we show that under a model of genetic coalescence, the limiting form of our approximation is the celebrated predictive process approximation of large Gaussian process likelihoods that has well-established accuracy standards. We illustrate our approach with extensive simulation studies and use it to estimate the heritability of multiple quantitative traits from the UK Biobank cohort.

摘要

随着高通量遗传数据的出现，人们试图使用线性混合模型（LMM）从远距离相关个体的全基因组 SNP 数据中估计遗传率。然而，由于其高度的线性代数运算，在大规模队列研究中拟合这样的 LMM 极具挑战性。在本文中，我们提出了一种新的方法，名为 PredLMM，它受到遗传合并和高斯预测过程概念的启发，用于逼近上述 LMM。PredLMM 的计算复杂度比大多数现有的基于 LMM 的方法有了显著的降低，因此为在大规模队列研究中估计遗传率提供了一种快速的替代方法。从理论上讲，我们表明，在遗传合并模型下，我们的逼近的极限形式是著名的大高斯过程似然的预测过程逼近，它具有成熟的准确性标准。我们通过广泛的模拟研究说明了我们的方法，并将其用于从英国生物库队列中估计多个数量性状的遗传率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0cd/9060362/0b37111dab8b/pgen.1010151.g001.jpg

相似文献

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

PLoS Genet. 2022 Apr 20;18(4):e1010151. doi: 10.1371/journal.pgen.1010151. eCollection 2022 Apr.

Fast heritability estimation based on MINQUE and batch training.

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac115.

Estimating SNP heritability in presence of population substructure in biobank-scale datasets.

Genetics. 2022 Apr 4;220(4). doi: 10.1093/genetics/iyac015.

A scalable estimator of SNP heritability for biobank-scale data.

Bioinformatics. 2018 Jul 1;34(13):i187-i194. doi: 10.1093/bioinformatics/bty253.

Warped linear mixed models for the genetic analysis of transformed phenotypes.

Nat Commun. 2014 Sep 19;5:4890. doi: 10.1038/ncomms5890.

Genome-wide barebones regression scan for mixed-model association analysis.

Theor Appl Genet. 2020 Jan;133(1):51-58. doi: 10.1007/s00122-019-03439-5. Epub 2019 Sep 24.

Leveraging LD eigenvalue regression to improve the estimation of SNP heritability and confounding inflation.

Am J Hum Genet. 2022 May 5;109(5):802-811. doi: 10.1016/j.ajhg.2022.03.013. Epub 2022 Apr 13.

Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture.

Nat Genet. 2019 Aug;51(8):1244-1251. doi: 10.1038/s41588-019-0465-0. Epub 2019 Jul 29.

Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.

PLoS One. 2015 Jun 30;10(6):e0131106. doi: 10.1371/journal.pone.0131106. eCollection 2015.

SumHer better estimates the SNP heritability of complex traits from summary statistics.

Nat Genet. 2019 Feb;51(2):277-284. doi: 10.1038/s41588-018-0279-5. Epub 2018 Dec 3.

引用本文的文献

SMASH: Scalable Method for Analyzing Spatial Heterogeneity of genes in spatial transcriptomics data.

PLoS Genet. 2023 Oct 20;19(10):e1010983. doi: 10.1371/journal.pgen.1010983. eCollection 2023 Oct.

Strong Genetic Overlaps Between Dimensional and Categorical Models of Bipolar Disorders in a Family Sample.

medRxiv. 2024 Mar 26:2023.06.24.23291169. doi: 10.1101/2023.06.24.23291169.

DenVar: density-based variation analysis of multiplex imaging data.

Bioinform Adv. 2022 May 23;2(1):vbac039. doi: 10.1093/bioadv/vbac039. eCollection 2022.

Comparing heritability estimators under alternative structures of linkage disequilibrium.

G3 (Bethesda). 2022 Jul 29;12(8). doi: 10.1093/g3journal/jkac134.

本文引用的文献

Estimating SNP heritability in presence of population substructure in biobank-scale datasets.

Genetics. 2022 Apr 4;220(4). doi: 10.1093/genetics/iyac015.

Improved genetic prediction of complex traits from individual-level data or summary statistics.

Nat Commun. 2021 Jul 7;12(1):4192. doi: 10.1038/s41467-021-24485-y.

Modeling the Dependence Structure in Genome Wide Association Studies of Binary Phenotypes in Family Data.

Behav Genet. 2020 Nov;50(6):423-439. doi: 10.1007/s10519-020-10010-2. Epub 2020 Aug 17.

A Case Study Competition Among Methods for Analyzing Large Spatial Data.

J Agric Biol Environ Stat. 2019;24(3):398-425. doi: 10.1007/s13253-018-00348-w. Epub 2018 Dec 14.

Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture.

Nat Genet. 2019 Aug;51(8):1244-1251. doi: 10.1038/s41588-019-0465-0. Epub 2019 Jul 29.

Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry.

Hum Mol Genet. 2018 Oct 15;27(20):3641-3649. doi: 10.1093/hmg/ddy271.

Reevaluation of SNP heritability in complex human traits.

Nat Genet. 2017 Jul;49(7):986-992. doi: 10.1038/ng.3865. Epub 2017 May 22.

Phenome-wide heritability analysis of the UK Biobank.

PLoS Genet. 2017 Apr 7;13(4):e1006711. doi: 10.1371/journal.pgen.1006711. eCollection 2017 Apr.

Multidimensional heritability analysis of neuroanatomical shape.

Nat Commun. 2016 Nov 15;7:13291. doi: 10.1038/ncomms13291.

Population Structure of UK Biobank and Ancient Eurasians Reveals Adaptation at Genes Influencing Blood Pressure.

Am J Hum Genet. 2016 Nov 3;99(5):1130-1139. doi: 10.1016/j.ajhg.2016.09.014. Epub 2016 Oct 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用高斯预测过程在大规模队列研究中高效估计 SNP 遗传力。

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

机构信息

Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, Colorado, United States of America.

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America.

出版信息

PLoS Genet. 2022 Apr 20;18(4):e1010151. doi: 10.1371/journal.pgen.1010151. eCollection 2022 Apr.

DOI:10.1371/journal.pgen.1010151

PMID:35442943

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9060362/

Abstract

摘要

利用高斯预测过程在大规模队列研究中高效估计 SNP 遗传力。

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用高斯预测过程在大规模队列研究中高效估计 SNP 遗传力。

Efficient estimation of SNP heritability using Gaussian predictive process in large scale cohort studies.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献