通过最大化线性风险评分的 ROC 曲线下部分面积来选择标志物。

Marker selection via maximizing the partial area under the ROC curve of linear risk scores.

机构信息

Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.

出版信息

Biostatistics. 2011 Apr;12(2):369-85. doi: 10.1093/biostatistics/kxq052. Epub 2010 Aug 20.

PMID:20729218

Abstract

Rather than viewing receiver operating characteristic (ROC) curves directly to compare the performances of diagnostic methods, the whole and the partial areas under the ROC curve (area under the ROC curve [AUC] and partial area under the ROC curve [pAUC]) are 2 of the most popularly used summaries of the curve. Moreover, when high specificity is a prerequisite, as in some medical diagnostics, pAUC is preferable. In this paper, we propose a wrapper-type algorithm to select the best linear combination of markers that has high sensitivity within a confined specificity range. The markers selected by the proposed algorithm are different from those selected by AUC-based algorithms and therefore provide different information for further studies. Most notably, for example, within the given range of specificity, the markers selected by the proposed algorithm always have higher individual sensitivities than those selected by other AUC-based methods. This characteristic makes the proposed method a good addition to existing methods. Without assuming the underlying distributions of markers, we prove that the pAUC obtained with the proposed algorithm is a strongly consistent estimate of the true pAUC and then illustrate its performance with numerical studies using synthesized data and 2 real examples. The results are compared with those obtained by its AUC-based counterpart. We found that the classification performance of the final classifier based on the selected markers is very competitive.

摘要

与其直接查看接收器操作特性 (ROC) 曲线来比较诊断方法的性能，ROC 曲线下的整体和部分面积（ROC 曲线下的面积 [AUC] 和 ROC 曲线下的部分面积 [pAUC]）是最常用的曲线摘要的 2 个。此外，当高特异性是先决条件时，如在某些医学诊断中，pAUC 是首选的。在本文中，我们提出了一种包装类型的算法来选择具有高灵敏度的标记的最佳线性组合，在受限的特异性范围内。由所提出的算法选择的标记与基于 AUC 的算法选择的标记不同，因此为进一步的研究提供了不同的信息。最值得注意的是，例如，在所给的特异性范围内，由所提出的算法选择的标记的个体灵敏度总是高于基于其他 AUC 的方法选择的标记。该特性使该方法成为现有方法的很好补充。在不假设标记的基础分布的情况下，我们证明了所提出的算法获得的 pAUC 是真实 pAUC 的强一致估计，然后使用合成数据和 2 个实际示例的数值研究来说明其性能。结果与基于 AUC 的对应物的结果进行了比较。我们发现，基于所选标记的最终分类器的分类性能非常有竞争力。

相似文献

Marker selection via maximizing the partial area under the ROC curve of linear risk scores.

Biostatistics. 2011 Apr;12(2):369-85. doi: 10.1093/biostatistics/kxq052. Epub 2010 Aug 20.

An extension of the receiver operating characteristic curve and AUC-optimal classification.

Neural Comput. 2012 Oct;24(10):2789-824. doi: 10.1162/NECO_a_00336. Epub 2012 Jun 26.

The partial area under the summary ROC curve.

Stat Med. 2005 Jul 15;24(13):2025-40. doi: 10.1002/sim.2103.

A PAUC-based estimation technique for disease classification and biomarker selection.

Stat Appl Genet Mol Biol. 2012 Oct 1;11(5):/j/sagmb.2012.11.issue-5/1544-6115.1792/1544-6115.1792.xml. doi: 10.1515/1544-6115.1792.

Adjusting the generalized ROC curve for covariates.

Stat Med. 2004 Nov 15;23(21):3319-31. doi: 10.1002/sim.1908.

A new parametric method based on S-distributions for computing receiver operating characteristic curves for continuous diagnostic tests.

Stat Med. 2002 May 15;21(9):1213-35. doi: 10.1002/sim.1086.

On linear combinations of dichotomizers for maximizing the area under the ROC curve.

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):610-20. doi: 10.1109/TSMCB.2010.2060325. Epub 2010 Aug 30.

Proteomic classification of pancreatic adenocarcinoma tissue using protein chip technology.

Gastroenterology. 2006 May;130(6):1670-8. doi: 10.1053/j.gastro.2006.02.036. Epub 2006 Mar 6.

Quantitative assessment of tumour extraction from dermoscopy images and evaluation of computer-based extraction methods for an automatic melanoma diagnostic system.

Melanoma Res. 2006 Apr;16(2):183-90. doi: 10.1097/01.cmr.0000215041.76553.58.

Peak selection from MALDI-TOF mass spectra using ant colony optimization.

Bioinformatics. 2007 Mar 1;23(5):619-26. doi: 10.1093/bioinformatics/btl678. Epub 2007 Jan 19.

引用本文的文献

Estimation and inference on the partial volume under the receiver operating characteristic surface.

Stat Methods Med Res. 2024 Sep;33(9):1577-1594. doi: 10.1177/09622802241267356. Epub 2024 Aug 8.

Combining multiple biomarkers linearly to minimize the Euclidean distance of the closest point on the receiver operating characteristic surface to the perfection corner in trichotomous settings.

Stat Methods Med Res. 2024 Apr;33(4):647-668. doi: 10.1177/09622802241233768. Epub 2024 Mar 6.

Conditional concordance-assisted learning under matched case-control design for combining biomarkers for population screening.

Stat Med. 2023 Apr 30;42(9):1398-1411. doi: 10.1002/sim.9677. Epub 2023 Feb 2.

Construction of joint confidence spaces for the optimal true class fraction triplet in the ROC space using alternative biomarker cutoffs.

Biom J. 2022 Aug;64(6):1023-1039. doi: 10.1002/bimj.202100132. Epub 2022 May 13.

Confidence Interval Estimation of the Youden index and corresponding cut-point for a combination of biomarkers under normality.

Commun Stat Theory Methods. 2022;51(2):501-518. doi: 10.1080/03610926.2020.1751852. Epub 2020 Apr 27.

Combining biomarkers by maximizing the true positive rate for a fixed false positive rate.

Biom J. 2021 Aug;63(6):1223-1240. doi: 10.1002/bimj.202000210. Epub 2021 Apr 19.

Identifying optimal biomarker combinations for treatment selection through randomized controlled trials.

Clin Trials. 2015 Aug;12(4):348-56. doi: 10.1177/1740774515580126. Epub 2015 May 6.

Threshold-free measures for assessing the performance of medical screening tests.

Front Public Health. 2015 Apr 20;3:57. doi: 10.3389/fpubh.2015.00057. eCollection 2015.

AucPR: an AUC-based approach using penalized regression for disease prediction with high-dimensional omics data.

BMC Genomics. 2014;15 Suppl 10(Suppl 10):S1. doi: 10.1186/1471-2164-15-S10-S1. Epub 2014 Dec 12.

Biomarker selection for medical diagnosis using the partial area under the ROC curve.

BMC Res Notes. 2014 Jan 10;7:25. doi: 10.1186/1756-0500-7-25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过最大化线性风险评分的 ROC 曲线下部分面积来选择标志物。

Marker selection via maximizing the partial area under the ROC curve of linear risk scores.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献