Yao Wenliang, Li Zhaohai, Graubard Barry I
Department of Statistics, The George Washington University, Washington, 20052, DC, U.S.A.; Clinical Biostatistics, MedImmune, LLC, Gaithersburg, 20878, MD, U.S.A.
Stat Med. 2015 Apr 15;34(8):1293-303. doi: 10.1002/sim.6405. Epub 2014 Dec 29.
The receiver operating characteristic (ROC) curve can be utilized to evaluate the performance of diagnostic tests. The area under the ROC curve (AUC) is a widely used summary index for comparing multiple ROC curves. Both parametric and nonparametric methods have been developed to estimate and compare the AUCs. However, these methods are usually only applicable to data collected from simple random samples and not surveys and epidemiologic studies that use complex sample designs such as stratified and/or multistage cluster sampling with sample weighting. Such complex samples can inflate variances from intra-cluster correlation and alter the expectations of test statistics because of the use of sample weights that account for differential sampling rates. In this paper, we modify the nonparametric method to incorporate sampling weights to estimate the AUC and employ leaving-one-out jackknife methods along with the balanced repeated replication method to account for the effects of the complex sampling in the variance estimation of our proposed estimators of the AUC. The finite sample properties of our methods are evaluated using simulations, and our methods are illustrated by comparing the estimated AUC for predicting overweight/obesity using different measures of body weight and adiposity among sampled children and adults in the US Hispanic Health and Nutrition Examination Survey.
接收者操作特征(ROC)曲线可用于评估诊断测试的性能。ROC曲线下面积(AUC)是用于比较多条ROC曲线的广泛使用的汇总指标。已经开发了参数和非参数方法来估计和比较AUC。然而,这些方法通常仅适用于从简单随机样本收集的数据,而不适用于使用复杂样本设计(如分层和/或多阶段整群抽样并带有样本加权)的调查和流行病学研究。此类复杂样本会因群内相关性而使方差膨胀,并由于使用考虑不同抽样率的样本权重而改变检验统计量的期望。在本文中,我们修改了非参数方法以纳入抽样权重来估计AUC,并采用留一法交叉验证方法以及平衡重复复制方法来考虑复杂抽样在我们提出的AUC估计量方差估计中的影响。我们使用模拟评估了我们方法的有限样本性质,并通过比较在美国西班牙裔健康与营养检查调查中抽样的儿童和成年人中使用不同体重和肥胖度测量方法预测超重/肥胖的估计AUC来说明我们的方法。