Institute of Statistical Science, Academia Sinica, Taipei, Taiwan.
Biostatistics. 2011 Apr;12(2):369-85. doi: 10.1093/biostatistics/kxq052. Epub 2010 Aug 20.
Rather than viewing receiver operating characteristic (ROC) curves directly to compare the performances of diagnostic methods, the whole and the partial areas under the ROC curve (area under the ROC curve [AUC] and partial area under the ROC curve [pAUC]) are 2 of the most popularly used summaries of the curve. Moreover, when high specificity is a prerequisite, as in some medical diagnostics, pAUC is preferable. In this paper, we propose a wrapper-type algorithm to select the best linear combination of markers that has high sensitivity within a confined specificity range. The markers selected by the proposed algorithm are different from those selected by AUC-based algorithms and therefore provide different information for further studies. Most notably, for example, within the given range of specificity, the markers selected by the proposed algorithm always have higher individual sensitivities than those selected by other AUC-based methods. This characteristic makes the proposed method a good addition to existing methods. Without assuming the underlying distributions of markers, we prove that the pAUC obtained with the proposed algorithm is a strongly consistent estimate of the true pAUC and then illustrate its performance with numerical studies using synthesized data and 2 real examples. The results are compared with those obtained by its AUC-based counterpart. We found that the classification performance of the final classifier based on the selected markers is very competitive.
与其直接查看接收器操作特性 (ROC) 曲线来比较诊断方法的性能,ROC 曲线下的整体和部分面积(ROC 曲线下的面积 [AUC] 和 ROC 曲线下的部分面积 [pAUC])是最常用的曲线摘要的 2 个。此外,当高特异性是先决条件时,如在某些医学诊断中,pAUC 是首选的。在本文中,我们提出了一种包装类型的算法来选择具有高灵敏度的标记的最佳线性组合,在受限的特异性范围内。由所提出的算法选择的标记与基于 AUC 的算法选择的标记不同,因此为进一步的研究提供了不同的信息。最值得注意的是,例如,在所给的特异性范围内,由所提出的算法选择的标记的个体灵敏度总是高于基于其他 AUC 的方法选择的标记。该特性使该方法成为现有方法的很好补充。在不假设标记的基础分布的情况下,我们证明了所提出的算法获得的 pAUC 是真实 pAUC 的强一致估计,然后使用合成数据和 2 个实际示例的数值研究来说明其性能。结果与基于 AUC 的对应物的结果进行了比较。我们发现,基于所选标记的最终分类器的分类性能非常有竞争力。