Scientific Directorate, IRCCS Istituto Giannina Gaslini, Genoa, Italy.
Rulex Innovation Labs, Genoa, Italy.
Epidemiol Health. 2022;44:e2022088. doi: 10.4178/epih.e2022088. Epub 2022 Oct 17.
The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves.
We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs.
Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients.
The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve.
接收者操作特征(ROC)曲线下面积(AUC)是一种常用的衡量纯诊断准确性的指标,它与分析样本中患病者的比例无关。然而,它在临床环境中的实际用途受到了质疑,因为它似乎与诊断标志物在实际临床环境中识别患病和非患病个体的实际性能没有直接关系。本研究评估了 AUC 与相应 ROC 曲线形状相关的正确分类比例(总体诊断准确性,GDA)之间的关系。
我们证明,对于平衡组的最佳准确性截止值,AUC 代表了 GDA 的向上偏倚度量。偏倚的大小取决于 ROC 图的形状和患病者与非患病者的比例。在适当的曲线中,偏倚与患病/非患病比例无关,可以轻松估计和消除。此外,两个部分 AUC 之间的比较可以用更强大的对应整个 AUC 的检验来代替。
对 3 个真实数据集进行了应用:儿童激素缺乏的标志物、2 种恶性间皮瘤的肿瘤标志物以及卵巢癌患者的 2 个基因表达谱。
AUC 是一种用于评估疾病标志物的具有潜在临床相关性的准确性衡量标准。ROC 参数的临床意义应始终通过分析相应 ROC 曲线的形状来评估。