Suppr超能文献

超越遗传缺失:复杂性状的预测。

Beyond missing heritability: prediction of complex traits.

机构信息

Department of Biostatistics, University of Alabama at Birmingham, Alabama, United States of America.

出版信息

PLoS Genet. 2011 Apr;7(4):e1002051. doi: 10.1371/journal.pgen.1002051. Epub 2011 Apr 28.

Abstract

Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the "missing heritability" for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h(2) up to 0.83, R(2) up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R(2) values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (∼ 0.80), substantial room for improvement remains.

摘要

尽管基因组技术发展迅速,但我们利用遗传信息解释表型变异的能力对于许多特征仍然有限。这导致遗传数据在预防和个性化医学方面的应用受到限制,而这正是全基因组关联研究的主要推动力之一。最近,通过同时对数千个单核苷酸多态性进行建模,很大一部分人类身高的“遗传缺失”可以从统计学上得到解释。然而,目前尚不清楚遗传方差的增加将如何转化为对尚未观察到的表型的预测。我们使用弗雷明汉心脏研究的数据,在训练和验证样本中探索人类身高的基因组预测,同时改变所使用的统计方法、纳入模型的 SNP 数量、验证方案以及用于训练模型的样本数量。在我们的训练数据集中,我们能够解释身高变化的很大一部分(h(2)高达 0.83,R(2)高达 0.96)。然而,验证样本中解释的方差比例要小得多(根据训练数据集使用的家族信息程度,范围从 0.15 到 0.36)。虽然这些 R(2)值远远超过了以前使用较少预选标记(<0.10)所报告的值,但考虑到该特征的遗传力(~0.80),仍有很大的改进空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91e4/3084207/2cb38adfe8a7/pgen.1002051.g001.jpg

相似文献

1
Beyond missing heritability: prediction of complex traits.
PLoS Genet. 2011 Apr;7(4):e1002051. doi: 10.1371/journal.pgen.1002051. Epub 2011 Apr 28.
2
Accurate Genomic Prediction of Human Height.
Genetics. 2018 Oct;210(2):477-497. doi: 10.1534/genetics.118.301267. Epub 2018 Aug 27.
3
Will Big Data Close the Missing Heritability Gap?
Genetics. 2017 Nov;207(3):1135-1145. doi: 10.1534/genetics.117.300271. Epub 2017 Sep 11.
4
Ubiquitous polygenicity of human complex traits: genome-wide analysis of 49 traits in Koreans.
PLoS Genet. 2013;9(3):e1003355. doi: 10.1371/journal.pgen.1003355. Epub 2013 Mar 7.
6
Estimation and partition of heritability in human populations using whole-genome analysis methods.
Annu Rev Genet. 2013;47:75-95. doi: 10.1146/annurev-genet-111212-133258. Epub 2013 Aug 22.
7
Genome-wide compound heterozygote analysis highlights alleles associated with adult height in Europeans.
Hum Genet. 2017 Nov;136(11-12):1407-1417. doi: 10.1007/s00439-017-1842-3. Epub 2017 Sep 18.
8
Missing heritability: is the gap closing? An analysis of 32 complex traits in the Lifelines Cohort Study.
Eur J Hum Genet. 2017 Jun;25(7):877-885. doi: 10.1038/ejhg.2017.50. Epub 2017 Apr 12.
9
Common SNPs explain a large proportion of the heritability for human height.
Nat Genet. 2010 Jul;42(7):565-9. doi: 10.1038/ng.608. Epub 2010 Jun 20.

引用本文的文献

1
Harnessing big data for enhanced genome-wide prediction in winter wheat breeding.
Theor Appl Genet. 2025 Aug 22;138(9):224. doi: 10.1007/s00122-025-05007-6.
2
Population structure limits the use of genomic data for predicting phenotypes and managing genetic resources in forest trees.
Proc Natl Acad Sci U S A. 2025 Jul;122(26):e2425691122. doi: 10.1073/pnas.2425691122. Epub 2025 Jun 25.
4
Effect of breed composition in genomic prediction using crossbred pig reference population.
J Anim Sci Technol. 2025 Jan;67(1):56-68. doi: 10.5187/jast.2025.e2. Epub 2025 Jan 31.
5
Bayesian hierarchical hypothesis testing in large-scale genome-wide association analysis.
Genetics. 2024 Nov 19;228(4). doi: 10.1093/genetics/iyae164.
6
Advancements and limitations in polygenic risk score methods for genomic prediction: a scoping review.
Hum Genet. 2024 Dec;143(12):1401-1431. doi: 10.1007/s00439-024-02716-8. Epub 2024 Nov 14.
7
GWAS advancements to investigate disease associations and biological mechanisms.
Clin Transl Discov. 2024 Jul;4(3). doi: 10.1002/ctd2.296. Epub 2024 May 1.
8
Putting Psychology to the Test: Rethinking Model Evaluation Through Benchmarking and Prediction.
Adv Methods Pract Psychol Sci. 2021 Jul-Sep;4(3). doi: 10.1177/25152459211026864. Epub 2021 Sep 23.
10
Biobank-scale methods and projections for sparse polygenic prediction from machine learning.
Sci Rep. 2023 Jul 19;13(1):11662. doi: 10.1038/s41598-023-37580-5.

本文引用的文献

1
Genome-enabled prediction using the BLR (Bayesian Linear Regression) R-package.
Methods Mol Biol. 2013;1019:299-320. doi: 10.1007/978-1-62703-447-0_12.
4
Predicting genetic predisposition in humans: the promise of whole-genome markers.
Nat Rev Genet. 2010 Dec;11(12):880-6. doi: 10.1038/nrg2898. Epub 2010 Nov 3.
5
Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods.
Genet Res (Camb). 2010 Aug;92(4):295-308. doi: 10.1017/S0016672310000285.
6
Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index.
Nat Genet. 2010 Nov;42(11):937-48. doi: 10.1038/ng.686. Epub 2010 Oct 10.
7
Hundreds of variants clustered in genomic loci and biological pathways affect human height.
Nature. 2010 Oct 14;467(7317):832-8. doi: 10.1038/nature09410. Epub 2010 Sep 29.
8
Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers.
Genetics. 2010 Oct;186(2):713-24. doi: 10.1534/genetics.110.118521. Epub 2010 Sep 2.
9
Biological, clinical and population relevance of 95 loci for blood lipids.
Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.
10
Missing heritability: paternal age effect mutations and selfish spermatogonia.
Nat Rev Genet. 2010 Aug;11(8):589. doi: 10.1038/nrg2809-c1.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验