Faraggi David, Reiser Benjamin
Department of Statistics, University of Haifa, Israel.
Stat Med. 2002 Oct 30;21(20):3093-106. doi: 10.1002/sim.1228.
The area under the receiver operating characteristic curve is frequently used as a measure for the effectiveness of diagnostic markers. In this paper we discuss and compare estimation procedures for this area. These are based on (i) the Mann-Whitney statistic; (ii) kernel smoothing; (iii) normal assumptions; (iv) empirical transformations to normality. These are compared in terms of bias and root mean square error in a large variety of situations by means of an extensive simulation study. Overall we find that transforming to normality usually is to be preferred except for bimodal cases where kernel methods can be effective.
接受者操作特征曲线下的面积常被用作诊断标志物有效性的一种度量。在本文中,我们讨论并比较了该面积的估计方法。这些方法基于:(i)曼 - 惠特尼统计量;(ii)核平滑;(iii)正态假设;(iv)向正态性的经验变换。通过广泛的模拟研究,在各种情况下从偏差和均方根误差方面对这些方法进行了比较。总体而言,我们发现除了双峰情况(此时核方法可能有效)外,通常优先选择向正态性变换的方法。