超越遗传缺失：复杂性状的预测。

Beyond missing heritability: prediction of complex traits.

机构信息

Department of Biostatistics, University of Alabama at Birmingham, Alabama, United States of America.

出版信息

PLoS Genet. 2011 Apr;7(4):e1002051. doi: 10.1371/journal.pgen.1002051. Epub 2011 Apr 28.

DOI:10.1371/journal.pgen.1002051

PMID:21552331

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3084207/

Abstract

Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the "missing heritability" for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h(2) up to 0.83, R(2) up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R(2) values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (∼ 0.80), substantial room for improvement remains.

摘要

尽管基因组技术发展迅速，但我们利用遗传信息解释表型变异的能力对于许多特征仍然有限。这导致遗传数据在预防和个性化医学方面的应用受到限制，而这正是全基因组关联研究的主要推动力之一。最近，通过同时对数千个单核苷酸多态性进行建模，很大一部分人类身高的“遗传缺失”可以从统计学上得到解释。然而，目前尚不清楚遗传方差的增加将如何转化为对尚未观察到的表型的预测。我们使用弗雷明汉心脏研究的数据，在训练和验证样本中探索人类身高的基因组预测，同时改变所使用的统计方法、纳入模型的 SNP 数量、验证方案以及用于训练模型的样本数量。在我们的训练数据集中，我们能够解释身高变化的很大一部分（h(2)高达 0.83，R(2)高达 0.96）。然而，验证样本中解释的方差比例要小得多（根据训练数据集使用的家族信息程度，范围从 0.15 到 0.36）。虽然这些 R(2)值远远超过了以前使用较少预选标记（<0.10）所报告的值，但考虑到该特征的遗传力（~0.80），仍有很大的改进空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91e4/3084207/2cb38adfe8a7/pgen.1002051.g001.jpg

相似文献

Beyond missing heritability: prediction of complex traits.超越遗传缺失：复杂性状的预测。

PLoS Genet. 2011 Apr;7(4):e1002051. doi: 10.1371/journal.pgen.1002051. Epub 2011 Apr 28.

Accurate Genomic Prediction of Human Height.人类身高的精确基因组预测。

Genetics. 2018 Oct;210(2):477-497. doi: 10.1534/genetics.118.301267. Epub 2018 Aug 27.

Will Big Data Close the Missing Heritability Gap?大数据能否弥合遗传缺失的鸿沟？

Genetics. 2017 Nov;207(3):1135-1145. doi: 10.1534/genetics.117.300271. Epub 2017 Sep 11.

Ubiquitous polygenicity of human complex traits: genome-wide analysis of 49 traits in Koreans.人类复杂特征的普遍多基因性：对韩国 49 项特征的全基因组分析。

PLoS Genet. 2013;9(3):e1003355. doi: 10.1371/journal.pgen.1003355. Epub 2013 Mar 7.

Methodological Considerations in Estimation of Phenotype Heritability Using Genome-Wide SNP Data, Illustrated by an Analysis of the Heritability of Height in a Large Sample of African Ancestry Adults.利用全基因组SNP数据估计表型遗传力的方法学考量，以对大量非洲裔成年人身高遗传力的分析为例

PLoS One. 2015 Jun 30;10(6):e0131106. doi: 10.1371/journal.pone.0131106. eCollection 2015.

Estimation and partition of heritability in human populations using whole-genome analysis methods.利用全基因组分析方法在人类群体中估计和划分遗传率。

Annu Rev Genet. 2013;47:75-95. doi: 10.1146/annurev-genet-111212-133258. Epub 2013 Aug 22.

Genome-wide compound heterozygote analysis highlights alleles associated with adult height in Europeans.全基因组复合杂合子分析揭示了与欧洲成年人身高相关的等位基因。

Hum Genet. 2017 Nov;136(11-12):1407-1417. doi: 10.1007/s00439-017-1842-3. Epub 2017 Sep 18.

Missing heritability: is the gap closing? An analysis of 32 complex traits in the Lifelines Cohort Study.缺失的遗传力：差距正在缩小吗？生命线队列研究中32种复杂性状的分析。

Eur J Hum Genet. 2017 Jun;25(7):877-885. doi: 10.1038/ejhg.2017.50. Epub 2017 Apr 12.

Common SNPs explain a large proportion of the heritability for human height.常见的单核苷酸多态性解释了人类身高遗传的很大一部分。

Nat Genet. 2010 Jul;42(7):565-9. doi: 10.1038/ng.608. Epub 2010 Jun 20.

Effects of number of training generations on genomic prediction for various traits in a layer chicken population.训练世代数对蛋鸡群体中各种性状基因组预测的影响。

Genet Sel Evol. 2016 Mar 19;48:22. doi: 10.1186/s12711-016-0198-9.

引用本文的文献

Harnessing big data for enhanced genome-wide prediction in winter wheat breeding.利用大数据增强冬小麦育种中的全基因组预测

Theor Appl Genet. 2025 Aug 22;138(9):224. doi: 10.1007/s00122-025-05007-6.

Population structure limits the use of genomic data for predicting phenotypes and managing genetic resources in forest trees.群体结构限制了基因组数据在预测林木表型和管理遗传资源方面的应用。

Proc Natl Acad Sci U S A. 2025 Jul;122(26):e2425691122. doi: 10.1073/pnas.2425691122. Epub 2025 Jun 25.

Genomic prediction in Persian walnut: Optimization levers according to genetic architecture of complex traits.波斯核桃的基因组预测：根据复杂性状的遗传结构确定优化手段。

Plant Genome. 2025 Jun;18(2):e70047. doi: 10.1002/tpg2.70047.

Effect of breed composition in genomic prediction using crossbred pig reference population.在使用杂交猪参考群体进行基因组预测中品种组成的影响。

J Anim Sci Technol. 2025 Jan;67(1):56-68. doi: 10.5187/jast.2025.e2. Epub 2025 Jan 31.

Bayesian hierarchical hypothesis testing in large-scale genome-wide association analysis.大规模全基因组关联分析中的贝叶斯分层假设检验

Genetics. 2024 Nov 19;228(4). doi: 10.1093/genetics/iyae164.

Advancements and limitations in polygenic risk score methods for genomic prediction: a scoping review.多基因风险评分方法在基因组预测中的进展和局限性：范围综述。

Hum Genet. 2024 Dec;143(12):1401-1431. doi: 10.1007/s00439-024-02716-8. Epub 2024 Nov 14.

GWAS advancements to investigate disease associations and biological mechanisms.全基因组关联研究（GWAS）在探究疾病关联和生物学机制方面的进展。

Clin Transl Discov. 2024 Jul;4(3). doi: 10.1002/ctd2.296. Epub 2024 May 1.

Putting Psychology to the Test: Rethinking Model Evaluation Through Benchmarking and Prediction.对心理学进行测试：通过基准测试和预测重新思考模型评估

Adv Methods Pract Psychol Sci. 2021 Jul-Sep;4(3). doi: 10.1177/25152459211026864. Epub 2021 Sep 23.

Can polygenic risk scores help explain disease prevalence differences around the world? A worldwide investigation.多基因风险评分能否帮助解释世界各地疾病流行率的差异？一项全球性研究。

BMC Genom Data. 2023 Nov 20;24(1):70. doi: 10.1186/s12863-023-01168-9.

Biobank-scale methods and projections for sparse polygenic prediction from machine learning.基于机器学习的稀疏多基因预测的生物银行规模方法和预测。

Sci Rep. 2023 Jul 19;13(1):11662. doi: 10.1038/s41598-023-37580-5.

本文引用的文献

Genome-enabled prediction using the BLR (Bayesian Linear Regression) R-package.使用BLR（贝叶斯线性回归）R包进行基于基因组的预测。

Methods Mol Biol. 2013;1019:299-320. doi: 10.1007/978-1-62703-447-0_12.

A commentary on 'common SNPs explain a large proportion of the heritability for human height' by Yang et al. (2010).对杨等人（2010年）所著《常见单核苷酸多态性解释了人类身高遗传力的很大一部分》的一篇评论。

Twin Res Hum Genet. 2010 Dec;13(6):517-24. doi: 10.1375/twin.13.6.517.

Predictive ability of subsets of single nucleotide polymorphisms with and without parent average in US Holsteins.美国荷斯坦牛中单核苷酸多态性亚组及其亲本平均值的预测能力。

J Dairy Sci. 2010 Dec;93(12):5942-9. doi: 10.3168/jds.2010-3335.

Predicting genetic predisposition in humans: the promise of whole-genome markers.预测人类的遗传易感性：全基因组标记的前景。

Nat Rev Genet. 2010 Dec;11(12):880-6. doi: 10.1038/nrg2898. Epub 2010 Nov 3.

Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods.使用再生核希尔伯特空间方法对遗传值进行半参数基因组预测。

Genet Res (Camb). 2010 Aug;92(4):295-308. doi: 10.1017/S0016672310000285.

Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index.对 249796 人的关联分析揭示了 18 个与体重指数相关的新位点。

Nat Genet. 2010 Nov;42(11):937-48. doi: 10.1038/ng.686. Epub 2010 Oct 10.

Hundreds of variants clustered in genomic loci and biological pathways affect human height.数以百计的变异体聚集在基因组位置和生物途径中，影响人类身高。

Nature. 2010 Oct 14;467(7317):832-8. doi: 10.1038/nature09410. Epub 2010 Sep 29.

Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers.利用系谱和分子标记预测植物育种中数量性状的遗传值。

Genetics. 2010 Oct;186(2):713-24. doi: 10.1534/genetics.110.118521. Epub 2010 Sep 2.

Biological, clinical and population relevance of 95 loci for blood lipids.95 个与血脂相关的生物学、临床和人群相关性位点。

Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.

Missing heritability: paternal age effect mutations and selfish spermatogonia.缺失的遗传力：父系年龄效应突变与自私精原细胞

Nat Rev Genet. 2010 Aug;11(8):589. doi: 10.1038/nrg2809-c1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

超越遗传缺失：复杂性状的预测。

Beyond missing heritability: prediction of complex traits.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献