使用全基因组方法预测疾病遗传风险的准确性。

Accuracy of predicting the genetic risk of disease using a genome-wide approach.

作者信息

Daetwyler Hans D, Villanueva Beatriz, Woolliams John A

机构信息

Genetics and Genomics, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Roslin, Midlothian, United Kingdom.

出版信息

PLoS One. 2008;3(10):e3395. doi: 10.1371/journal.pone.0003395. Epub 2008 Oct 14.

DOI:10.1371/journal.pone.0003395

PMID:18852893

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2561058/

Abstract

BACKGROUND

The prediction of the genetic disease risk of an individual is a powerful public health tool. While predicting risk has been successful in diseases which follow simple Mendelian inheritance, it has proven challenging in complex diseases for which a large number of loci contribute to the genetic variance. The large numbers of single nucleotide polymorphisms now available provide new opportunities for predicting genetic risk of complex diseases with high accuracy.

METHODOLOGY/PRINCIPAL FINDINGS: We have derived simple deterministic formulae to predict the accuracy of predicted genetic risk from population or case control studies using a genome-wide approach and assuming a dichotomous disease phenotype with an underlying continuous liability. We show that the prediction equations are special cases of the more general problem of predicting the accuracy of estimates of genetic values of a continuous phenotype. Our predictive equations are responsive to all parameters that affect accuracy and they are independent of allele frequency and effect distributions. Deterministic prediction errors when tested by simulation were generally small. The common link among the expressions for accuracy is that they are best summarized as the product of the ratio of number of phenotypic records per number of risk loci and the observed heritability.

CONCLUSIONS/SIGNIFICANCE: This study advances the understanding of the relative power of case control and population studies of disease. The predictions represent an upper bound of accuracy which may be achievable with improved effect estimation methods. The formulae derived will help researchers determine an appropriate sample size to attain a certain accuracy when predicting genetic risk.

摘要

背景

预测个体的遗传疾病风险是一项强有力的公共卫生工具。虽然在遵循简单孟德尔遗传的疾病中预测风险已取得成功，但在大量基因座对遗传变异有贡献的复杂疾病中，这已被证明具有挑战性。现在可用的大量单核苷酸多态性为高精度预测复杂疾病的遗传风险提供了新机会。

方法/主要发现：我们推导了简单的确定性公式，以使用全基因组方法并假设具有潜在连续易感性的二分疾病表型，从人群或病例对照研究中预测预测遗传风险的准确性。我们表明，预测方程是预测连续表型遗传值估计准确性这一更一般问题的特殊情况。我们的预测方程对影响准确性的所有参数都有响应，并且它们与等位基因频率和效应分布无关。通过模拟测试时，确定性预测误差通常较小。准确性表达式之间的共同联系是，它们最好总结为每个风险基因座的表型记录数与观察到的遗传力之比的乘积。

结论/意义：本研究推进了对疾病病例对照和人群研究相对效力的理解。这些预测代表了通过改进效应估计方法可能实现的准确性上限。推导的公式将帮助研究人员在预测遗传风险时确定适当的样本量以达到一定的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/57a0/2561058/4fe348fde372/pone.0003395.g001.jpg

相似文献

Accuracy of predicting the genetic risk of disease using a genome-wide approach.使用全基因组方法预测疾病遗传风险的准确性。

PLoS One. 2008;3(10):e3395. doi: 10.1371/journal.pone.0003395. Epub 2008 Oct 14.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Prediction of individual genetic risk to disease from genome-wide association studies.基于全基因组关联研究预测个体疾病遗传风险

Genome Res. 2007 Oct;17(10):1520-8. doi: 10.1101/gr.6665407. Epub 2007 Sep 4.

Empirical and deterministic accuracies of across-population genomic prediction.跨群体基因组预测的经验性和确定性准确性。

Genet Sel Evol. 2015 Feb 6;47(1):5. doi: 10.1186/s12711-014-0086-0.

The impact of genetic architecture on genome-wide evaluation methods.遗传结构对全基因组评估方法的影响。

Genetics. 2010 Jul;185(3):1021-31. doi: 10.1534/genetics.110.116855. Epub 2010 Apr 20.

Marker-based estimation of heritability in immortal populations.基于标记的永生群体遗传力估计

Genetics. 2015 Feb;199(2):379-98. doi: 10.1534/genetics.114.167916. Epub 2014 Dec 19.

An Equation to Predict the Accuracy of Genomic Values by Combining Data from Multiple Traits, Populations, or Environments.一种通过整合多个性状、群体或环境的数据来预测基因组值准确性的方程。

Genetics. 2016 Feb;202(2):799-823. doi: 10.1534/genetics.115.183269. Epub 2015 Dec 4.

Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误：利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。

J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.

Novel genetic analysis for case-control genome-wide association studies: quantification of power and genomic prediction accuracy.全基因组关联研究中病例对照的新型遗传分析：效能和基因组预测准确性的量化。

PLoS One. 2013 Aug 19;8(8):e71494. doi: 10.1371/journal.pone.0071494. eCollection 2013.

Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data.植物育种中遗传力和预测准确性的稳健估计：使用模拟和经验数据进行评估。

BMC Genomics. 2020 Jan 14;21(1):43. doi: 10.1186/s12864-019-6429-z.

引用本文的文献

A Self-Supervised Pre-Trained Transformer Model for Accurate Genomic Prediction of Swine Phenotypes.一种用于猪表型准确基因组预测的自监督预训练Transformer模型。

Animals (Basel). 2025 Aug 24;15(17):2485. doi: 10.3390/ani15172485.

Shared SNP effects across breeds increase the genomic prediction accuracy for numerically small breeds.跨品种共享的单核苷酸多态性（SNP）效应提高了小数量品种的基因组预测准确性。

Sci Rep. 2025 Aug 26;15(1):31421. doi: 10.1038/s41598-025-15733-y.

Harnessing big data for enhanced genome-wide prediction in winter wheat breeding.利用大数据增强冬小麦育种中的全基因组预测

Theor Appl Genet. 2025 Aug 22;138(9):224. doi: 10.1007/s00122-025-05007-6.

Improving genomic prediction accuracy for methane emission and feed efficiency in sheep: integrating rumen microbial PCA with host genomic variation using neural network GBLUP (NN-GBLUP).提高绵羊甲烷排放和饲料效率的基因组预测准确性：使用神经网络GBLUP（NN-GBLUP）将瘤胃微生物主成分分析与宿主基因组变异相结合。

Genet Sel Evol. 2025 Jul 17;57(1):41. doi: 10.1186/s12711-025-00987-x.

Estimation of genetic parameters for mature cow size in North American and Australian Angus cattle.北美和澳大利亚安格斯牛成年母牛体型遗传参数的估计。

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf212.

An Updated Polygenic Index Repository: Expanded Phenotypes, New Cohorts, and Improved Causal Inference.一个更新的多基因指数库：扩展的表型、新的队列和改进的因果推断。

bioRxiv. 2025 May 18:2025.05.14.653986. doi: 10.1101/2025.05.14.653986.

Enhancing prediction accuracy of key biomass partitioning traits in wheat using multi-kernel genomic prediction models integrating secondary traits and environmental covariates.利用整合次要性状和环境协变量的多核基因组预测模型提高小麦关键生物量分配性状的预测准确性。

Plant Genome. 2025 Jun;18(2):e70052. doi: 10.1002/tpg2.70052.

Genomic Prediction in a Self-Fertilized Progenies of spp.XX属自交后代中的基因组预测（注：原文中“spp.”指代不明，这里保留原样翻译）

Plants (Basel). 2025 May 9;14(10):1422. doi: 10.3390/plants14101422.

Enhanced Reliability of the Evaluation of Fertility Traits in Pura Raza Española Horses Using Single-Step Genomic Best Linear Unbiased Prediction.使用单步基因组最佳线性无偏预测提高西班牙纯种马繁殖性状评估的可靠性

Genes (Basel). 2025 May 9;16(5):562. doi: 10.3390/genes16050562.

Low density marker-based effectiveness and efficiency of early-generation genomic selection relative to phenotype-based selection in dolichos bean (Lablab purpureus L. Sweet).基于低密度标记的菜豆（Lablab purpureus L. Sweet）早期基因组选择相对于基于表型选择的有效性和效率

Plant Genome. 2025 Jun;18(2):e70039. doi: 10.1002/tpg2.70039.

本文引用的文献

Inbreeding in artificial selection programmes.人工选择计划中的近亲繁殖。

Genet Res. 2007 Dec;89(5-6):275-80. doi: 10.1017/S0016672308009452.

Bayesian LASSO for quantitative trait loci mapping.用于数量性状基因座定位的贝叶斯套索法

Genetics. 2008 Jun;179(2):1045-55. doi: 10.1534/genetics.107.085589. Epub 2008 May 27.

Extent of linkage disequilibrium in Holstein cattle in North America.北美荷斯坦奶牛的连锁不平衡程度。

J Dairy Sci. 2008 May;91(5):2106-17. doi: 10.3168/jds.2007-0553.

Genomic selection using different marker types and densities.使用不同标记类型和密度的基因组选择。

J Anim Sci. 2008 Oct;86(10):2447-54. doi: 10.2527/jas.2007-0010. Epub 2008 Apr 11.

Genome-wide association analysis identifies 20 loci that influence adult height.全基因组关联分析确定了20个影响成人身高的基因座。

Nat Genet. 2008 May;40(5):575-83. doi: 10.1038/ng.121. Epub 2008 Apr 6.

Commonality of functional annotation: a method for prioritization of candidate genes from genome-wide linkage studies.功能注释的共性：一种从全基因组连锁研究中对候选基因进行优先级排序的方法。

Nucleic Acids Res. 2008 Mar;36(4):e26. doi: 10.1093/nar/gkn007. Epub 2008 Feb 7.

The impact of genetic relationship information on genome-assisted breeding values.遗传关系信息对基因组辅助育种值的影响。

Genetics. 2007 Dec;177(4):2389-97. doi: 10.1534/genetics.107.081190.

Prediction of individual genetic risk to disease from genome-wide association studies.基于全基因组关联研究预测个体疾病遗传风险

Genome Res. 2007 Oct;17(10):1520-8. doi: 10.1101/gr.6665407. Epub 2007 Sep 4.

The number of loci that affect milk production traits in dairy cattle.影响奶牛产奶性状的基因座数量。

Genetics. 2007 Oct;177(2):1117-23. doi: 10.1534/genetics.107.077784. Epub 2007 Aug 24.

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.对14000例七种常见疾病患者及3000例共享对照进行全基因组关联研究。

Nature. 2007 Jun 7;447(7145):661-78. doi: 10.1038/nature05911.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用全基因组方法预测疾病遗传风险的准确性。

Accuracy of predicting the genetic risk of disease using a genome-wide approach.

作者信息

机构信息

出版信息

BACKGROUND

背景

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献