• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估诊断试验:ROC 曲线下面积和误差平衡。

Evaluating diagnostic tests: The area under the ROC curve and the balance of errors.

机构信息

Imperial College, London.

出版信息

Stat Med. 2010 Jun 30;29(14):1502-10. doi: 10.1002/sim.3859.

DOI:10.1002/sim.3859
PMID:20087877
Abstract

Because accurate diagnosis lies at the heart of medicine, it is important to be able to evaluate the effectiveness of diagnostic tests. A variety of accuracy measures are used. One particularly widely used measure is the AUC, the area under the receiver operating characteristic (ROC) curve. This measure has a well-understood weakness when comparing ROC curves which cross. However, it also has the more fundamental weakness of failing to balance different kinds of misdiagnoses effectively. This is not merely an aspect of the inevitable arbitrariness in choosing a performance measure, but is a core property of the way the AUC is defined. This property is explored, and an alternative, the H measure, is described.

摘要

由于准确的诊断是医学的核心,因此能够评估诊断测试的有效性非常重要。有多种准确性度量标准可供使用。一个特别广泛使用的度量标准是 AUC,即接收器操作特征 (ROC) 曲线下的面积。当比较交叉的 ROC 曲线时,该度量标准具有一个众所周知的弱点。然而,它也具有更根本的弱点,即无法有效地平衡不同类型的误诊。这不仅仅是选择性能度量标准时不可避免的任意性的一个方面,而是 AUC 定义方式的核心属性。本文探讨了这一特性,并描述了一种替代方法,即 H 度量。

相似文献

1
Evaluating diagnostic tests: The area under the ROC curve and the balance of errors.评估诊断试验:ROC 曲线下面积和误差平衡。
Stat Med. 2010 Jun 30;29(14):1502-10. doi: 10.1002/sim.3859.
2
The partial area under the summary ROC curve.汇总ROC曲线下的部分面积。
Stat Med. 2005 Jul 15;24(13):2025-40. doi: 10.1002/sim.2103.
3
A new parametric method based on S-distributions for computing receiver operating characteristic curves for continuous diagnostic tests.一种基于S分布的用于连续诊断测试计算受试者工作特征曲线的新参数方法。
Stat Med. 2002 May 15;21(9):1213-35. doi: 10.1002/sim.1086.
4
An ROC-type measure of diagnostic accuracy when the gold standard is continuous-scale.当金标准为连续尺度时诊断准确性的一种ROC类测量方法。
Stat Med. 2006 Feb 15;25(3):481-93. doi: 10.1002/sim.2228.
5
Reliability and diagnostic accuracy of 5 physical examination tests and combination of tests for subacromial impingement.5种体格检查测试及测试组合对肩峰下撞击症的可靠性和诊断准确性
Arch Phys Med Rehabil. 2009 Nov;90(11):1898-903. doi: 10.1016/j.apmr.2009.05.015.
6
A non-inferiority test for diagnostic accuracy based on the paired partial areas under ROC curves.基于ROC曲线下配对局部面积的诊断准确性非劣效性检验。
Stat Med. 2008 May 10;27(10):1762-76. doi: 10.1002/sim.3121.
7
ROC methodology within a monitoring framework.监测框架内的ROC方法
Stat Med. 2003 Nov 30;22(22):3473-88. doi: 10.1002/sim.1580.
8
Adjusting the generalized ROC curve for covariates.针对协变量调整广义ROC曲线。
Stat Med. 2004 Nov 15;23(21):3319-31. doi: 10.1002/sim.1908.
9
The agreement chart as an alternative to the receiver-operating characteristic curve for diagnostic tests.作为诊断试验的接收者操作特征曲线替代方法的一致性图表。
J Clin Epidemiol. 2008 Sep;61(9):866-74. doi: 10.1016/j.jclinepi.2008.04.002.
10
Hierarchical models for ROC curve summary measures: design and analysis of multi-reader, multi-modality studies of medical tests.用于ROC曲线汇总指标的分层模型:医学检验多阅片者、多模态研究的设计与分析
Stat Med. 2008 Jan 30;27(2):243-56. doi: 10.1002/sim.2828.

引用本文的文献

1
Adapted generative latent diffusion models for accurate pathological analysis in chest X-ray images.适应生成式潜在扩散模型以实现胸部 X 光图像的精确病理分析。
Med Biol Eng Comput. 2024 Jul;62(7):2189-2212. doi: 10.1007/s11517-024-03056-5. Epub 2024 Mar 19.
2
Prediction of malignant lymph nodes in NSCLC by machine-learning classifiers using EBUS-TBNA and PET/CT.基于 EBUS-TBNA 和 PET/CT 的机器学习分类器预测 NSCLC 中的恶性淋巴结。
Sci Rep. 2022 Oct 20;12(1):17511. doi: 10.1038/s41598-022-21637-y.
3
Cardiovascular Risk Assessment Using Artificial Intelligence-Enabled Event Adjudication and Hematologic Predictors.
利用人工智能实现的事件裁决和血液学预测指标进行心血管风险评估。
Circ Cardiovasc Qual Outcomes. 2022 Jun;15(6):e008007. doi: 10.1161/CIRCOUTCOMES.121.008007. Epub 2022 Apr 28.
4
The subcortical basis of outcome and cognitive impairment in TBI: A longitudinal cohort study.脑外伤结局和认知障碍的皮质下基础:一项纵向队列研究。
Neurology. 2020 Oct 27;95(17):e2398-e2408. doi: 10.1212/WNL.0000000000010825. Epub 2020 Sep 9.
5
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.马修斯相关系数(MCC)在二分类评估中优于 F1 得分和准确率的优势。
BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.
6
Characterization of the vaginal microbiome during cytolytic vaginosis using high-throughput sequencing.使用高通量测序对细胞溶解性阴道病期间阴道微生物群进行特征分析。
J Clin Lab Anal. 2019 Jan;33(1):e22653. doi: 10.1002/jcla.22653. Epub 2018 Sep 10.
7
Accuracy of non-invasive prenatal testing using cell-free DNA for detection of Down, Edwards and Patau syndromes: a systematic review and meta-analysis.使用游离DNA进行无创产前检测以检测唐氏、爱德华兹和帕陶氏综合征的准确性:一项系统评价和荟萃分析。
BMJ Open. 2016 Jan 18;6(1):e010002. doi: 10.1136/bmjopen-2015-010002.
8
Comparison of HE4, CA125, and ROMA Diagnostic Accuracy: A Prospective and Multicenter Study for Chinese Women With Epithelial Ovarian Cancer.人附睾蛋白4(HE4)、癌抗原125(CA125)与风险卵巢恶性肿瘤算法(ROMA)诊断准确性的比较:一项针对中国上皮性卵巢癌女性患者的前瞻性多中心研究。
Medicine (Baltimore). 2015 Dec;94(52):e2402. doi: 10.1097/MD.0000000000002402.
9
Inference for correlated effect sizes using multiple univariate meta-analyses.使用多个单变量荟萃分析对相关效应量进行推断。
Stat Med. 2016 Apr 30;35(9):1405-22. doi: 10.1002/sim.6789. Epub 2015 Nov 4.
10
Enrichment of Minor Alleles of Common SNPs and Improved Risk Prediction for Parkinson's Disease.常见单核苷酸多态性次要等位基因的富集与帕金森病风险预测的改善
PLoS One. 2015 Jul 24;10(7):e0133421. doi: 10.1371/journal.pone.0133421. eCollection 2015.