Cho Hunyong, Matthews Gregory J, Harel Ofer
Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, USA.
Department of Mathematics and Statistics, Loyola University Chicago, 1032 W. Sheridan Road, Chicago, IL 60660, USA.
Int Stat Rev. 2019 Apr;87(1):152-177. doi: 10.1111/insr.12277. Epub 2018 Aug 9.
Receiver operating characteristic curves are widely used as a measure of accuracy of diagnostic tests and can be summarised using the area under the receiver operating characteristic curve (AUC). Often, it is useful to construct a confidence interval for the AUC; however, because there are a number of different proposed methods to measure variance of the AUC, there are thus many different resulting methods for constructing these intervals. In this article, we compare different methods of constructing Wald-type confidence interval in the presence of missing data where the missingness mechanism is ignorable. We find that constructing confidence intervals using multiple imputation based on logistic regression gives the most robust coverage probability and the choice of confidence interval method is less important. However, when missingness rate is less severe (e.g. less than 70%), we recommend using Newcombe's Wald method for constructing confidence intervals along with multiple imputation using predictive mean matching.
接收者操作特征曲线被广泛用作诊断测试准确性的一种度量,并且可以使用接收者操作特征曲线下的面积(AUC)进行总结。通常,为AUC构建一个置信区间是很有用的;然而,由于有许多不同的方法来测量AUC的方差,因此有许多不同的构建这些区间的方法。在本文中,我们比较了在缺失数据存在且缺失机制可忽略的情况下构建 Wald 型置信区间的不同方法。我们发现,基于逻辑回归使用多重填补来构建置信区间能给出最稳健的覆盖概率,并且置信区间方法的选择不太重要。然而,当缺失率不太严重(例如小于70%)时,我们建议使用纽科姆的 Wald 方法来构建置信区间,并结合使用预测均值匹配的多重填补。