• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ROC 曲线下面积的基因组分析中的遗传解释。

The genetic interpretation of area under the ROC curve in genomic profiling.

机构信息

Genetic Epidemiology and Queensland Statistical Genetics, Queensland Institute of Medical Research, Brisbane, Australia.

出版信息

PLoS Genet. 2010 Feb 26;6(2):e1000864. doi: 10.1371/journal.pgen.1000864.

DOI:10.1371/journal.pgen.1000864
PMID:20195508
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2829056/
Abstract

Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determining the efficacy of tests in correctly classifying diseased and non-diseased individuals. We use quantitative genetics theory to provide insight into the genetic interpretation of the area under the ROC curve (AUC) when the test classifier is a predictor of genetic risk. Even when the proportion of genetic variance explained by the test is 100%, there is a maximum value for AUC that depends on the genetic epidemiology of the disease, i.e. either the sibling recurrence risk or heritability and disease prevalence. We derive an equation relating maximum AUC to heritability and disease prevalence. The expression can be reversed to calculate the proportion of genetic variance explained given AUC, disease prevalence, and heritability. We use published estimates of disease prevalence and sibling recurrence risk for 17 complex genetic diseases to calculate the proportion of genetic variance that a test must explain to achieve AUC = 0.75; this varied from 0.10 to 0.74. We provide a genetic interpretation of AUC for use with predictors of genetic risk based on genomic profiles. We provide a strategy to estimate proportion of genetic variance explained on the liability scale from estimates of AUC, disease prevalence, and heritability (or sibling recurrence risk) available as an online calculator.

摘要

全基因组关联研究在人类群体中促进了基因组谱的创建,这些基因组谱结合了许多相关遗传变异的影响,以预测疾病风险。接收者操作特征(ROC)曲线下的面积是一种用于确定测试正确分类患病和非患病个体的功效的既定衡量标准。我们使用定量遗传学理论,深入了解当测试分类器是遗传风险预测因子时,ROC 曲线下面积(AUC)的遗传解释。即使测试解释的遗传方差比例为 100%,AUC 也有一个最大值,该最大值取决于疾病的遗传流行病学,即兄弟姐妹复发风险或遗传性和疾病流行率。我们得出了一个与遗传性和疾病流行率相关的最大 AUC 的方程。该表达式可以反过来计算给定 AUC、疾病流行率和遗传性的情况下,遗传方差的比例。我们使用已发表的 17 种复杂遗传疾病的疾病流行率和兄弟姐妹复发风险估计值来计算测试必须解释的遗传方差比例,以达到 AUC = 0.75;这从 0.10 到 0.74 不等。我们提供了一种基于基因组谱的遗传风险预测因子的 AUC 遗传解释。我们提供了一种从 AUC、疾病流行率和遗传性(或兄弟姐妹复发风险)的估计值估计易感性尺度上遗传方差比例的策略,可作为在线计算器使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/047ca96ebdaf/pgen.1000864.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/912ca762c305/pgen.1000864.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/6c083aae3a47/pgen.1000864.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/047ca96ebdaf/pgen.1000864.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/912ca762c305/pgen.1000864.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/6c083aae3a47/pgen.1000864.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0181/2829056/047ca96ebdaf/pgen.1000864.g003.jpg

相似文献

1
The genetic interpretation of area under the ROC curve in genomic profiling.ROC 曲线下面积的基因组分析中的遗传解释。
PLoS Genet. 2010 Feb 26;6(2):e1000864. doi: 10.1371/journal.pgen.1000864.
2
Sibling method increases risk assessment estimates for type 1 diabetes.同胞法提高了1型糖尿病的风险评估估计值。
PLoS One. 2017 May 16;12(5):e0176341. doi: 10.1371/journal.pone.0176341. eCollection 2017.
3
A unifying framework for evaluating the predictive power of genetic variants based on the level of heritability explained.基于遗传力解释水平评估遗传变异预测能力的统一框架。
PLoS Genet. 2010 Dec 2;6(12):e1001230. doi: 10.1371/journal.pgen.1001230.
4
Polygenic score prediction captures nearly all common genetic risk for Alzheimer's disease.多基因评分预测几乎涵盖了阿尔茨海默病所有常见的遗传风险。
Neurobiol Aging. 2017 Jan;49:214.e7-214.e11. doi: 10.1016/j.neurobiolaging.2016.07.018. Epub 2016 Aug 5.
5
A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms.不平衡数据中机器学习算法评估的新一致性部分 AUC 和部分 c 统计量。
BMC Med Inform Decis Mak. 2020 Jan 6;20(1):4. doi: 10.1186/s12911-019-1014-6.
6
AUC-RF: a new strategy for genomic profiling with random forest.AUC-RF:一种使用随机森林进行基因组分析的新策略。
Hum Hered. 2011;72(2):121-32. doi: 10.1159/000330778. Epub 2011 Oct 11.
7
The partial area under the summary ROC curve.汇总ROC曲线下的部分面积。
Stat Med. 2005 Jul 15;24(13):2025-40. doi: 10.1002/sim.2103.
8
Receiver operating characteristic analysis under tree orderings of disease classes.疾病类别树形排序下的受试者工作特征分析。
Stat Med. 2016 May 20;35(11):1907-26. doi: 10.1002/sim.6843. Epub 2015 Dec 17.
9
The precision--recall curve overcame the optimism of the receiver operating characteristic curve in rare diseases.精准度-召回率曲线克服了罕见病中接收器工作特征曲线的过度乐观。
J Clin Epidemiol. 2015 Aug;68(8):855-9. doi: 10.1016/j.jclinepi.2015.02.010. Epub 2015 Feb 28.
10
Predictive testing for complex diseases using multiple genes: fact or fiction?使用多个基因对复杂疾病进行预测性检测:事实还是虚构?
Genet Med. 2006 Jul;8(7):395-400. doi: 10.1097/01.gim.0000229689.18263.f4.

引用本文的文献

1
Evaluating the Impact of Family History and Polygenic Risk Scores on Cardiometabolic Disease Risk.评估家族史和多基因风险评分对心脏代谢疾病风险的影响。
Res Sq. 2025 Aug 1:rs.3.rs-7142452. doi: 10.21203/rs.3.rs-7142452/v1.
2
Enhanced genetic fine mapping accuracy with Bayesian Linear Regression models in diverse genetic architectures.在不同遗传结构中使用贝叶斯线性回归模型提高遗传精细定位的准确性。
PLoS Genet. 2025 Jul 30;21(7):e1011783. doi: 10.1371/journal.pgen.1011783. eCollection 2025 Jul.
3
ENaC Biomarker Detection in Platelets Using a Lateral Flow Immunoassay: A Clinical Validation Study.

本文引用的文献

1
Multi-locus models of genetic risk of disease.疾病遗传风险的多基因座模型。
Genome Med. 2010 Feb 2;2(2):10. doi: 10.1186/gm131.
2
Using relative utility curves to evaluate risk prediction.使用相对效用曲线评估风险预测。
J R Stat Soc Ser A Stat Soc. 2009 Oct 1;172(4):729-748. doi: 10.1111/j.1467-985X.2009.00592.x.
3
Sporadic cases are the norm for complex disease.散发性病例是复杂疾病的常态。
使用侧向流动免疫分析法检测血小板中的ENaC生物标志物:一项临床验证研究。
Biosensors (Basel). 2025 Jun 20;15(7):399. doi: 10.3390/bios15070399.
4
Machine learning in Alzheimer's disease genetics.阿尔茨海默病遗传学中的机器学习
Nat Commun. 2025 Jul 22;16(1):6726. doi: 10.1038/s41467-025-61650-z.
5
Polygenic risk score prediction accuracy convergence.多基因风险评分预测准确性的收敛性。
HGG Adv. 2025 May 14;6(3):100457. doi: 10.1016/j.xhgg.2025.100457.
6
Genomics of schizophrenia, bipolar disorder and major depressive disorder.精神分裂症、双相情感障碍和重度抑郁症的基因组学
Nat Rev Genet. 2025 May 12. doi: 10.1038/s41576-025-00843-0.
7
Exon 1 methylation status of CDH13 is associated with decreased overall survival and distant metastasis in patients with postoperative colorectal cancer.CDH13基因外显子1的甲基化状态与结直肠癌术后患者的总生存期缩短及远处转移相关。
Discov Oncol. 2024 Nov 29;15(1):725. doi: 10.1007/s12672-024-01604-x.
8
MicroRNA binding site variants-new potential markers of primary osteoporosis in men and women.微小RNA结合位点变异体——男性和女性原发性骨质疏松症的新潜在标志物
Front Genet. 2024 Oct 1;15:1470310. doi: 10.3389/fgene.2024.1470310. eCollection 2024.
9
R2ROC: an efficient method of comparing two or more correlated AUC from out-of-sample prediction using polygenic scores.R2ROC:一种利用多基因评分比较样本外预测中两个或多个相关 AUC 的有效方法。
Hum Genet. 2024 Oct;143(9-10):1193-1205. doi: 10.1007/s00439-024-02682-1. Epub 2024 Jun 20.
10
Genome Sequencing in an Individual Presenting with 22q11.2 Deletion Syndrome and Juvenile Idiopathic Arthritis.一名患有22q11.2缺失综合征和幼年特发性关节炎患者的基因组测序
Genes (Basel). 2024 Apr 19;15(4):513. doi: 10.3390/genes15040513.
Eur J Hum Genet. 2010 Sep;18(9):1039-43. doi: 10.1038/ejhg.2009.177. Epub 2009 Oct 14.
4
Prediction and interaction in complex disease genetics: experience in type 1 diabetes.复杂疾病遗传学中的预测与相互作用:1型糖尿病的经验
PLoS Genet. 2009 Jul;5(7):e1000540. doi: 10.1371/journal.pgen.1000540. Epub 2009 Jul 3.
5
Beyond odds ratios--communicating disease risk based on genetic profiles.超越比值比——基于基因图谱传达疾病风险
Nat Rev Genet. 2009 Apr;10(4):264-9. doi: 10.1038/nrg2516.
6
Assessing susceptibility to age-related macular degeneration with proteomic and genomic biomarkers.利用蛋白质组学和基因组学生物标志物评估年龄相关性黄斑变性的易感性。
Mol Cell Proteomics. 2009 Jun;8(6):1338-49. doi: 10.1074/mcp.M800453-MCP200. Epub 2009 Feb 6.
7
Interpretation of genetic association studies: markers with replicated highly significant odds ratios may be poor classifiers.基因关联研究的解读:具有重复出现的高度显著优势比的标记物可能是较差的分类指标。
PLoS Genet. 2009 Feb;5(2):e1000337. doi: 10.1371/journal.pgen.1000337. Epub 2009 Feb 6.
8
Usefulness of genetic polymorphisms and conventional risk factors to predict coronary heart disease in patients with familial hypercholesterolemia.基因多态性和传统危险因素在预测家族性高胆固醇血症患者冠心病方面的实用性。
Am J Cardiol. 2009 Feb 1;103(3):375-80. doi: 10.1016/j.amjcard.2008.09.093. Epub 2008 Nov 21.
9
Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study.瑞典家庭中精神分裂症和双相情感障碍的常见遗传决定因素:一项基于人群的研究。
Lancet. 2009 Jan 17;373(9659):234-9. doi: 10.1016/S0140-6736(09)60072-6.
10
Personal genomes: The case of the missing heritability.个人基因组:“缺失的遗传力”问题
Nature. 2008 Nov 6;456(7218):18-21. doi: 10.1038/456018a.