Hanley J A, McNeil B J
Radiology. 1983 Sep;148(3):839-43. doi: 10.1148/radiology.148.3.6878708.
Receiver operating characteristic (ROC) curves are used to describe and compare the performance of diagnostic technology and diagnostic algorithms. This paper refines the statistical comparison of the areas under two ROC curves derived from the same set of patients by taking into account the correlation between the areas that is induced by the paired nature of the data. The correspondence between the area under an ROC curve and the Wilcoxon statistic is used and underlying Gaussian distributions (binormal) are assumed to provide a table that converts the observed correlations in paired ratings of images into a correlation between the two ROC areas. This between-area correlation can be used to reduce the standard error (uncertainty) about the observed difference in areas. This correction for pairing, analogous to that used in the paired t-test, can produce a considerable increase in the statistical sensitivity (power) of the comparison. For studies involving multiple readers, this method provides a measure of a component of the sampling variation that is otherwise difficult to obtain.
接收者操作特征(ROC)曲线用于描述和比较诊断技术及诊断算法的性能。本文通过考虑数据配对性质所导致的两个区域之间的相关性,对源自同一组患者的两条ROC曲线下面积的统计比较进行了优化。利用ROC曲线下面积与威尔科克森统计量之间的对应关系,并假设潜在的高斯分布(双正态分布),以提供一个表格,将图像配对评分中观察到的相关性转换为两个ROC区域之间的相关性。这种区域间的相关性可用于减少观察到的面积差异的标准误差(不确定性)。这种配对校正类似于配对t检验中使用的校正,可显著提高比较的统计灵敏度(功效)。对于涉及多个读者的研究,该方法提供了一种衡量抽样变异成分的方法,否则该成分难以获得。