用于评估基因分析潜在预测能力的分析和模拟方法：方法比较与结果。

Analytical and simulation methods for estimating the potential predictive ability of genetic profiling: a comparison of methods and results.

机构信息

Department of Epidemiology, Erasmus University Medical Center, Rotterdam, The Netherlands.

出版信息

Eur J Hum Genet. 2012 Dec;20(12):1270-4. doi: 10.1038/ejhg.2012.89. Epub 2012 May 30.

DOI:10.1038/ejhg.2012.89

PMID:22643180

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3499740/

Abstract

Various modeling methods have been proposed to estimate the potential predictive ability of polygenic risk variants that predispose to various common diseases. However, it is unknown whether differences between them affect their conclusions on predictive ability. We reviewed input parameters, assumptions and output of the five most common methods and compared their estimates of the area under the receiver operating characteristic (ROC) curve (AUC) using hypothetical data representing effect sizes and frequencies of genetic variants, population disease risk and number of variants. To assess the accuracy of the estimated AUCs, we aimed to reproduce the AUCs of published empirical studies. All methods assumed that the combined effect of genetic variants on disease risk followed a multiplicative risk model of independent genetic effects, but they either assumed per allele, per genotype or dominant/recessive effects for the genetic variants. Modeling strategy and input parameters differed. Methods used simulation analysis or analytical formulas with effect sizes quantified by odds ratios (ORs) or relative risks. Estimated AUC values were similar for lower ORs (<1.2). When AUCs were larger (>0.7) due to variants with strong effects, differences in estimated AUCs between methods increased. The simulation methods accurately reproduced the AUC values of empirical studies, but the analytical methods did not. We conclude that despite differences in input parameters, the modeling methods estimate similar AUC for realistic values of the ORs. When one or more variants have stronger effects and AUC values are higher, the simulation methods tend to be more accurate.

摘要

已经提出了各种建模方法来估计导致各种常见疾病的多基因风险变异的潜在预测能力。然而，尚不清楚它们之间的差异是否会影响它们对预测能力的结论。我们回顾了五种最常用方法的输入参数、假设和输出，并使用代表遗传变异、人群疾病风险和变异数量的效应大小和频率的假设数据比较了它们对接收者操作特征 (ROC) 曲线 (AUC) 下面积的估计值。为了评估估计 AUC 的准确性，我们旨在重现已发表的经验研究的 AUC。所有方法都假设遗传变异对疾病风险的综合影响遵循独立遗传效应的乘法风险模型，但它们要么假设遗传变异的每个等位基因、每个基因型或显性/隐性效应，要么假设遗传变异的每个等位基因、每个基因型或显性/隐性效应。建模策略和输入参数不同。方法使用模拟分析或带有由比值比 (OR) 或相对风险量化的效应大小的分析公式。对于较低的 OR（<1.2），估计的 AUC 值相似。当由于具有强效应的变异导致 AUC 较大（>0.7）时，方法之间估计的 AUC 差异增加。模拟方法准确再现了经验研究的 AUC 值，但分析方法没有。我们的结论是，尽管输入参数存在差异，但建模方法对 OR 的实际值估计相似的 AUC。当一个或多个变异具有更强的效应且 AUC 值更高时，模拟方法往往更准确。

相似文献

Analytical and simulation methods for estimating the potential predictive ability of genetic profiling: a comparison of methods and results.用于评估基因分析潜在预测能力的分析和模拟方法：方法比较与结果。

Eur J Hum Genet. 2012 Dec;20(12):1270-4. doi: 10.1038/ejhg.2012.89. Epub 2012 May 30.

Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.从ROC曲线下面积构建假设风险数据：多基因风险分布建模

PLoS One. 2016 Mar 29;11(3):e0152359. doi: 10.1371/journal.pone.0152359. eCollection 2016.

Value of genetic profiling for the prediction of coronary heart disease.基因谱分析在预测冠心病方面的价值。

Am Heart J. 2009 Jul;158(1):105-10. doi: 10.1016/j.ahj.2009.04.022.

Predictive testing for complex diseases using multiple genes: fact or fiction?使用多个基因对复杂疾病进行预测性检测：事实还是虚构？

Genet Med. 2006 Jul;8(7):395-400. doi: 10.1097/01.gim.0000229689.18263.f4.

Estimating the predictive ability of genetic risk models in simulated data based on published results from genome-wide association studies.基于全基因组关联研究发表的结果，估计遗传风险模型在模拟数据中的预测能力。

Front Genet. 2014 Jun 13;5:179. doi: 10.3389/fgene.2014.00179. eCollection 2014.

Evidence for further breast cancer susceptibility genes in addition to BRCA1 and BRCA2 in a population-based study.一项基于人群的研究中除BRCA1和BRCA2外的其他乳腺癌易感基因的证据。

Genet Epidemiol. 2001 Jul;21(1):1-18. doi: 10.1002/gepi.1014.

Prediction of lung cancer risk in a Chinese population using a multifactorial genetic model.基于多因素遗传模型预测中国人群的肺癌风险。

BMC Med Genet. 2012 Dec 10;13:118. doi: 10.1186/1471-2350-13-118.

Discriminative accuracy of genomic profiling comparing multiplicative and additive risk models.基因组分析比较乘法风险模型和加法风险模型的判别准确性。

Eur J Hum Genet. 2011 Feb;19(2):180-5. doi: 10.1038/ejhg.2010.165. Epub 2010 Nov 17.

Evaluation of polygenic risk models using multiple performance measures: a critical assessment of discordant results.使用多种性能指标评估多基因风险模型：对不一致结果的批判性评估。

Genet Med. 2019 Feb;21(2):391-397. doi: 10.1038/s41436-018-0058-9. Epub 2018 Jun 12.

Common polygenic variation enhances risk prediction for Alzheimer's disease.常见的多基因变异增强了阿尔茨海默病的风险预测。

Brain. 2015 Dec;138(Pt 12):3673-84. doi: 10.1093/brain/awv268. Epub 2015 Oct 21.

引用本文的文献

Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.从ROC曲线下面积构建假设风险数据：多基因风险分布建模

PLoS One. 2016 Mar 29;11(3):e0152359. doi: 10.1371/journal.pone.0152359. eCollection 2016.

Front Genet. 2014 Jun 13;5:179. doi: 10.3389/fgene.2014.00179. eCollection 2014.

Genetic tests obtainable through pharmacies: the good, the bad, and the ugly.通过药店获得的基因检测：好坏参半。

Hum Genomics. 2013 Jul 8;7(1):17. doi: 10.1186/1479-7364-7-17.

Variations in predicted risks in personal genome testing for common complex diseases.常见复杂疾病个体基因组检测预测风险的差异。

Genet Med. 2014 Jan;16(1):85-91. doi: 10.1038/gim.2013.80. Epub 2013 Jun 27.

本文引用的文献

Genome-wide association studies: results from the first few years and potential implications for clinical medicine.全基因组关联研究：最初几年的结果及其对临床医学的潜在影响。

Annu Rev Med. 2011;62:11-24. doi: 10.1146/annurev.med.091708.162036.

Prevalence of age-related macular degeneration in the US population.美国人群中年龄相关性黄斑变性的患病率。

Arch Ophthalmol. 2011 Jan;129(1):75-80. doi: 10.1001/archophthalmol.2010.318.

Assessment of clinical validity of a breast cancer risk model combining genetic and clinical information.评估结合遗传和临床信息的乳腺癌风险模型的临床有效性。

J Natl Cancer Inst. 2010 Nov 3;102(21):1618-27. doi: 10.1093/jnci/djq388. Epub 2010 Oct 18.

Prediction model for knee osteoarthritis based on genetic and clinical information.基于遗传和临床信息的膝关节骨关节炎预测模型。

Arthritis Res Ther. 2010;12(5):R187. doi: 10.1186/ar3157. Epub 2010 Oct 12.

Genetic prostate cancer risk assessment: common variants in 9 genomic regions are associated with cumulative risk.遗传前列腺癌风险评估：9 个基因组区域的常见变异与累积风险相关。

J Urol. 2010 Aug;184(2):501-5. doi: 10.1016/j.juro.2010.04.032. Epub 2010 Jun 17.

Combined effects of 17 common genetic variants on type 2 diabetes risk in a Han Chinese population.17 种常见遗传变异对汉族人群 2 型糖尿病发病风险的联合效应。

Diabetologia. 2010 Oct;53(10):2163-6. doi: 10.1007/s00125-010-1826-5. Epub 2010 Jun 17.

The genetic interpretation of area under the ROC curve in genomic profiling.ROC 曲线下面积的基因组分析中的遗传解释。

PLoS Genet. 2010 Feb 26;6(2):e1000864. doi: 10.1371/journal.pgen.1000864.

Multi-locus models of genetic risk of disease.疾病遗传风险的多基因座模型。

Genome Med. 2010 Feb 2;2(2):10. doi: 10.1186/gm131.

The potential of genes and other markers to inform about risk.基因及其他标志物在提示风险方面的潜力。

Cancer Epidemiol Biomarkers Prev. 2010 Mar;19(3):655-65. doi: 10.1158/1055-9965.EPI-09-0510. Epub 2010 Feb 16.

Evaluation of the discriminative accuracy of genomic profiling in the prediction of common complex diseases.基因组分析预测常见复杂疾病的判别准确性评估。

Eur J Hum Genet. 2010 Apr;18(4):485-9. doi: 10.1038/ejhg.2009.209. Epub 2009 Nov 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验