双向部分 AUC 的一种新估计量。

A novel estimator for the two-way partial AUC.

机构信息

Sage Bionetworks, 2901 Third Avenue, 98121, Seattle, USA.

出版信息

BMC Med Inform Decis Mak. 2024 Feb 20;24(1):57. doi: 10.1186/s12911-023-02382-2.

DOI:10.1186/s12911-023-02382-2

PMID:38378636

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10877829/

Abstract

BACKGROUND

The two-way partial AUC has been recently proposed as a way to directly quantify partial area under the ROC curve with simultaneous restrictions on the sensitivity and specificity ranges of diagnostic tests or classifiers. The metric, as originally implemented in the tpAUC R package, is estimated using a nonparametric estimator based on a trimmed Mann-Whitney U-statistic, which becomes computationally expensive in large sample sizes. (Its computational complexity is of order [Formula: see text], where [Formula: see text] and [Formula: see text] represent the number of positive and negative cases, respectively). This is problematic since the statistical methodology for comparing estimates generated from alternative diagnostic tests/classifiers relies on bootstrapping resampling and requires repeated computations of the estimator on a large number of bootstrap samples.

METHODS

By leveraging the graphical and probabilistic representations of the AUC, partial AUCs, and two-way partial AUC, we derive a novel estimator for the two-way partial AUC, which can be directly computed from the output of any software able to compute AUC and partial AUCs. We implemented our estimator using the computationally efficient pROC R package, which leverages a nonparametric approach using the trapezoidal rule for the computation of AUC and partial AUC scores. (Its computational complexity is of order [Formula: see text], where [Formula: see text].). We compare the empirical bias and computation time of the proposed estimator against the original estimator provided in the tpAUC package in a series of simulation studies and on two real datasets.

RESULTS

Our estimator tended to be less biased than the original estimator based on the trimmed Mann-Whitney U-statistic across all experiments (and showed considerably less bias in the experiments based on small sample sizes). But, most importantly, because the computational complexity of the proposed estimator is of order [Formula: see text], rather than [Formula: see text], it is much faster to compute when sample sizes are large.

CONCLUSIONS

The proposed estimator provides an improvement for the computation of two-way partial AUC, and allows the comparison of diagnostic tests/machine learning classifiers in large datasets where repeated computations of the original estimator on bootstrap samples become too expensive to compute.

摘要

背景

双向部分 AUC 最近被提出作为一种直接量化 ROC 曲线下的部分面积的方法，同时限制诊断测试或分类器的灵敏度和特异性范围。该指标最初在 tpAUC R 包中实现，是使用基于修剪的曼-惠特尼 U 统计量的非参数估计量估计的，在大样本量下计算成本很高。（其计算复杂度为 [公式：见文本]，其中 [公式：见文本] 和 [公式：见文本] 分别表示阳性和阴性病例的数量）。这是有问题的，因为比较替代诊断测试/分类器生成的估计值的统计方法依赖于引导重采样，并且需要在大量引导样本上重复计算估计量。

方法

通过利用 AUC、部分 AUC 和双向部分 AUC 的图形和概率表示，我们推导出一种新的双向部分 AUC 估计量，该估计量可以直接从任何能够计算 AUC 和部分 AUC 的软件的输出中计算出来。我们使用计算效率高的 pROC R 包实现了我们的估计量，该包利用使用梯形规则计算 AUC 和部分 AUC 分数的非参数方法。（其计算复杂度为 [公式：见文本]，其中 [公式：见文本]）。我们在一系列模拟研究中和两个真实数据集上比较了提议的估计量与 tpAUC 包中提供的原始估计量的经验偏差和计算时间。

结果

我们的估计量在所有实验中（在基于小样本量的实验中表现出相当小的偏差）往往比基于修剪的曼-惠特尼 U 统计量的原始估计量偏差小。但是，最重要的是，由于提议的估计量的计算复杂度为 [公式：见文本]，而不是 [公式：见文本]，因此在样本量较大时计算速度要快得多。

结论

提议的估计量为双向部分 AUC 的计算提供了改进，并允许在大数据集中比较诊断测试/机器学习分类器，其中在引导样本上重复计算原始估计量变得过于昂贵而无法计算。

相似文献

A novel estimator for the two-way partial AUC.双向部分 AUC 的一种新估计量。

BMC Med Inform Decis Mak. 2024 Feb 20;24(1):57. doi: 10.1186/s12911-023-02382-2.

Two-way partial AUC and its properties.双向部分 AUC 及其性质。

Stat Methods Med Res. 2019 Jan;28(1):184-195. doi: 10.1177/0962280217718866. Epub 2017 Jul 14.

An efficient variance estimator of AUC and its applications to binary classification.一种高效的AUC方差估计器及其在二元分类中的应用。

Stat Med. 2020 Dec 10;39(28):4281-4300. doi: 10.1002/sim.8725. Epub 2020 Sep 10.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Block-Regularized m × 2 Cross-Validated Estimator of the Generalization Error.泛化误差的块正则化m×2交叉验证估计器

Neural Comput. 2017 Feb;29(2):519-554. doi: 10.1162/NECO_a_00923. Epub 2016 Dec 28.

Estimating Standardized Effect Sizes for Two- and Three-Level Partially Nested Data.估计二级和三级部分嵌套数据的标准化效应量

Multivariate Behav Res. 2016 Nov-Dec;51(6):740-756. doi: 10.1080/00273171.2016.1231606. Epub 2016 Nov 1.

On the bias in the AUC variance estimate.关于AUC方差估计中的偏差。

Pattern Recognit Lett. 2024 Feb;178:62-68. doi: 10.1016/j.patrec.2023.12.012. Epub 2023 Dec 27.

Easy and accurate variance estimation of the nonparametric estimator of the partial area under the ROC curve and its application.ROC曲线下部分面积非参数估计量的方差估计简便且准确及其应用

Stat Med. 2016 Jun 15;35(13):2251-82. doi: 10.1002/sim.6863. Epub 2016 Jan 21.

Nonparametric inference of the area under ROC curve under two-phase cluster sampling.两阶段整群抽样下ROC曲线下面积的非参数推断

J Biopharm Stat. 2022 Mar;32(2):346-355. doi: 10.1080/10543406.2021.2009501. Epub 2021 Dec 21.

More efficient approximation of smoothing splines via space-filling basis selection.通过空间填充基选择对平滑样条进行更高效的近似。

Biometrika. 2020 Sep;107(3):723-735. doi: 10.1093/biomet/asaa019. Epub 2020 May 7.

本文引用的文献

Survival prediction of patients with sepsis from age, sex, and septic episode number alone.仅根据年龄、性别和感染发作次数预测脓毒症患者的生存情况。

Sci Rep. 2020 Oct 13;10(1):17156. doi: 10.1038/s41598-020-73558-3.

Computer-aided diagnosis in the era of deep learning.深度学习时代的计算机辅助诊断。

Med Phys. 2020 Jun;47(5):e218-e227. doi: 10.1002/mp.13764.

AI-based computer-aided diagnosis (AI-CAD): the latest review to read first.基于人工智能的计算机辅助诊断（AI-CAD）：最新综述，先睹为快。

Radiol Phys Technol. 2020 Mar;13(1):6-19. doi: 10.1007/s12194-019-00552-4. Epub 2020 Jan 2.

Two-way partial AUC and its properties.双向部分 AUC 及其性质。

Stat Methods Med Res. 2019 Jan;28(1):184-195. doi: 10.1177/0962280217718866. Epub 2017 Jul 14.

Machine learning: Trends, perspectives, and prospects.机器学习：趋势、观点和展望。

Science. 2015 Jul 17;349(6245):255-60. doi: 10.1126/science.aaa8415.

Computer-aided diagnosis of diabetic retinopathy: a review.计算机辅助诊断糖尿病视网膜病变：综述。

Comput Biol Med. 2013 Dec;43(12):2136-55. doi: 10.1016/j.compbiomed.2013.10.007. Epub 2013 Oct 14.

On use of partial area under the ROC curve for evaluation of diagnostic performance.ROC 曲线下面积的使用评估诊断性能。

Stat Med. 2013 Sep 10;32(20):3449-58. doi: 10.1002/sim.5777. Epub 2013 Mar 18.

Estimation of AUC or Partial AUC under Test-Result-Dependent Sampling.在依赖测试结果的抽样下对AUC或部分AUC的估计。

Stat Biopharm Res. 2012 Jan 1;4(4):313-323. doi: 10.1080/19466315.2012.692514. Epub 2012 Oct 1.

pROC: an open-source package for R and S+ to analyze and compare ROC curves.pROC：一个用于 R 和 S+的开源软件包，用于分析和比较 ROC 曲线。

BMC Bioinformatics. 2011 Mar 17;12:77. doi: 10.1186/1471-2105-12-77.

Computer-aided diagnosis in medical imaging: historical review, current status and future potential.医学成像中的计算机辅助诊断：历史回顾、现状与未来潜力

Comput Med Imaging Graph. 2007 Jun-Jul;31(4-5):198-211. doi: 10.1016/j.compmedimag.2007.02.002. Epub 2007 Mar 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

双向部分 AUC 的一种新估计量。

A novel estimator for the two-way partial AUC.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献