Suppr超能文献

用于交叉验证的ROC曲线估计下面积的计算高效的置信区间。

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates.

作者信息

LeDell Erin, Petersen Maya, van der Laan Mark

机构信息

Division of Biostatistics, University of California, Berkeley, Berkeley, CA 94720, USA.

出版信息

Electron J Stat. 2015;9(1):1583-1607. doi: 10.1214/15-EJS1035.

Abstract

In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.

摘要

在二元分类问题中,ROC曲线下面积(AUC)通常用于评估预测模型的性能。通常,它会与交叉验证相结合,以评估结果如何推广到独立数据集。为了评估交叉验证AUC估计值的质量,我们获得其方差的估计值。对于海量数据集,生成单个性能估计值的过程在计算上可能成本很高。此外,当使用复杂的预测方法时,即使在相对较小的数据集上对预测模型进行交叉验证的过程仍可能需要大量计算时间。因此,在许多实际情况下,自助法是一种计算上难以处理的方差估计方法。作为自助法的替代方法,我们展示了一种基于影响曲线的计算高效方法,用于获得交叉验证AUC的方差估计值。

相似文献

2
An efficient variance estimator of AUC and its applications to binary classification.
Stat Med. 2020 Dec 10;39(28):4281-4300. doi: 10.1002/sim.8725. Epub 2020 Sep 10.
4
Model selection based on FDR-thresholding optimizing the area under the ROC-curve.
Stat Appl Genet Mol Biol. 2009;8:Article31. doi: 10.2202/1544-6115.1462. Epub 2009 Jun 25.
5
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
6
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
7
A new approach for interpretability and reliability in clinical risk prediction: Acute coronary syndrome scenario.
Artif Intell Med. 2021 Jul;117:102113. doi: 10.1016/j.artmed.2021.102113. Epub 2021 May 13.
8
On the analysis of glycomics mass spectrometry data via the regularized area under the ROC curve.
BMC Bioinformatics. 2007 Dec 12;8:477. doi: 10.1186/1471-2105-8-477.
9
Comparing multi-class classifier performance by multi-class ROC analysis: A nonparametric approach.
Neurocomputing (Amst). 2024 May 28;583. doi: 10.1016/j.neucom.2024.127520. Epub 2024 Mar 6.
10
On the use of min-max combination of biomarkers to maximize the partial area under the ROC curve.
J Probab Stat. 2019;2019. doi: 10.1155/2019/8953530. Epub 2019 Feb 3.

引用本文的文献

1
Next-generation AI framework for comprehensive oral leukoplakia evaluation and management.
NPJ Digit Med. 2025 Aug 10;8(1):513. doi: 10.1038/s41746-025-01885-8.
3
Circulating metabolite signatures indicate differential gut-liver crosstalk in lean and obese MASLD.
JCI Insight. 2025 Mar 18;10(8). doi: 10.1172/jci.insight.180943. eCollection 2025 Apr 22.
4
Deep learning on CT scans to predict checkpoint inhibitor treatment outcomes in advanced melanoma.
Sci Rep. 2024 Dec 30;14(1):31668. doi: 10.1038/s41598-024-81188-2.
5
GeM-LR: Discovering predictive biomarkers for small datasets in vaccine studies.
PLoS Comput Biol. 2024 Nov 14;20(11):e1012581. doi: 10.1371/journal.pcbi.1012581. eCollection 2024 Nov.
6
Cross-validation: what does it estimate and how well does it do it?
J Am Stat Assoc. 2024;119(546):1434-1445. doi: 10.1080/01621459.2023.2197686. Epub 2023 May 15.
7
Predicting neutralization susceptibility to combination HIV-1 monoclonal broadly neutralizing antibody regimens.
PLoS One. 2024 Sep 6;19(9):e0310042. doi: 10.1371/journal.pone.0310042. eCollection 2024.
8
Risk factors for infectious complications after gastrectomy in older patients.
Exp Ther Med. 2024 Jun 17;28(2):319. doi: 10.3892/etm.2024.12608. eCollection 2024 Aug.
10
Demographic patterns of walleye () reproductive success in a Wisconsin population.
Evol Appl. 2024 Mar 10;17(3):e13665. doi: 10.1111/eva.13665. eCollection 2024 Mar.

本文引用的文献

1
Targeted maximum likelihood estimation of natural direct effects.
Int J Biostat. 2012 Jan 6;8(1):/j/ijb.2012.8.issue-1/1557-4679.1361/1557-4679.1361.xml. doi: 10.2202/1557-4679.1361.
3
ROCR: visualizing classifier performance in R.
Bioinformatics. 2005 Oct 15;21(20):3940-1. doi: 10.1093/bioinformatics/bti623. Epub 2005 Aug 11.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验