Genetic Epidemiology and Queensland Statistical Genetics, Queensland Institute of Medical Research, Brisbane, Australia.
PLoS Genet. 2010 Feb 26;6(2):e1000864. doi: 10.1371/journal.pgen.1000864.
Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determining the efficacy of tests in correctly classifying diseased and non-diseased individuals. We use quantitative genetics theory to provide insight into the genetic interpretation of the area under the ROC curve (AUC) when the test classifier is a predictor of genetic risk. Even when the proportion of genetic variance explained by the test is 100%, there is a maximum value for AUC that depends on the genetic epidemiology of the disease, i.e. either the sibling recurrence risk or heritability and disease prevalence. We derive an equation relating maximum AUC to heritability and disease prevalence. The expression can be reversed to calculate the proportion of genetic variance explained given AUC, disease prevalence, and heritability. We use published estimates of disease prevalence and sibling recurrence risk for 17 complex genetic diseases to calculate the proportion of genetic variance that a test must explain to achieve AUC = 0.75; this varied from 0.10 to 0.74. We provide a genetic interpretation of AUC for use with predictors of genetic risk based on genomic profiles. We provide a strategy to estimate proportion of genetic variance explained on the liability scale from estimates of AUC, disease prevalence, and heritability (or sibling recurrence risk) available as an online calculator.
全基因组关联研究在人类群体中促进了基因组谱的创建,这些基因组谱结合了许多相关遗传变异的影响,以预测疾病风险。接收者操作特征(ROC)曲线下的面积是一种用于确定测试正确分类患病和非患病个体的功效的既定衡量标准。我们使用定量遗传学理论,深入了解当测试分类器是遗传风险预测因子时,ROC 曲线下面积(AUC)的遗传解释。即使测试解释的遗传方差比例为 100%,AUC 也有一个最大值,该最大值取决于疾病的遗传流行病学,即兄弟姐妹复发风险或遗传性和疾病流行率。我们得出了一个与遗传性和疾病流行率相关的最大 AUC 的方程。该表达式可以反过来计算给定 AUC、疾病流行率和遗传性的情况下,遗传方差的比例。我们使用已发表的 17 种复杂遗传疾病的疾病流行率和兄弟姐妹复发风险估计值来计算测试必须解释的遗传方差比例,以达到 AUC = 0.75;这从 0.10 到 0.74 不等。我们提供了一种基于基因组谱的遗传风险预测因子的 AUC 遗传解释。我们提供了一种从 AUC、疾病流行率和遗传性(或兄弟姐妹复发风险)的估计值估计易感性尺度上遗传方差比例的策略,可作为在线计算器使用。