Rodenberg C, Zhou X H
Procter and Gamble, Inc., Cincinnati, Ohio, USA.
Biometrics. 2000 Dec;56(4):1256-62. doi: 10.1111/j.0006-341x.2000.01256.x.
A receiver operating characteristic (ROC) curve is commonly used to measure the accuracy of a medical test. It is a plot of the true positive fraction (sensitivity) against the false positive fraction (1-specificity) for increasingly stringent positivity criterion. Bias can occur in estimation of an ROC curve if only some of the tested patients are selected for disease verification and if analysis is restricted only to the verified cases. This bias is known as verification bias. In this paper, we address the problem of correcting for verification bias in estimation of an ROC curve when the verification process and efficacy of the diagnostic test depend on covariates. Our method applies the EM algorithm to ordinal regression models to derive ML estimates for ROC curves as a function of covariates, adjusted for covariates affecting the likelihood of being verified. Asymptotic variance estimates are obtained using the observed information matrix of the observed data. These estimates are derived under the missing-at-random assumption, which means that selection for disease verification depends only on the observed data, i.e., the test result and the observed covariates. We also address the issues of model selection and model checking. Finally, we illustrate the proposed method on data from a two-phase study of dementia disorders, where selection for verification depends on the screening test result and age.
接收器操作特征(ROC)曲线通常用于衡量医学检验的准确性。它是针对越来越严格的阳性标准,将真阳性率(灵敏度)与假阳性率(1-特异度)绘制而成的曲线。如果仅选择部分受试患者进行疾病验证,并且分析仅限于已验证的病例,那么在ROC曲线估计中可能会出现偏差。这种偏差称为验证偏差。在本文中,我们解决了在验证过程和诊断检验的效能取决于协变量时,校正ROC曲线估计中的验证偏差问题。我们的方法将期望最大化(EM)算法应用于有序回归模型,以得出作为协变量函数的ROC曲线的极大似然估计,并针对影响验证可能性的协变量进行了调整。使用观测数据的观测信息矩阵获得渐近方差估计。这些估计是在随机缺失假设下得出的,这意味着疾病验证的选择仅取决于观测数据,即检验结果和观测到的协变量。我们还讨论了模型选择和模型检验的问题。最后,我们用痴呆症两阶段研究的数据说明了所提出的方法,其中验证的选择取决于筛查检验结果和年龄。