Liu Danping, Zhou Xiao-Hua
Department of Biostatistics, University of Washington, Seattle, Washington 98195, USA.
Biometrics. 2010 Dec;66(4):1119-28. doi: 10.1111/j.1541-0420.2010.01397.x.
In estimation of the ROC curve, when the true disease status is subject to nonignorable missingness, the observed likelihood involves the missing mechanism given by a selection model. In this article, we proposed a likelihood-based approach to estimate the ROC curve and the area under the ROC curve when the verification bias is nonignorable. We specified a parametric disease model in order to make the nonignorable selection model identifiable. With the estimated verification and disease probabilities, we constructed four types of empirical estimates of the ROC curve and its area based on imputation and reweighting methods. In practice, a reasonably large sample size is required to estimate the nonignorable selection model in our settings. Simulation studies showed that all four estimators of ROC area performed well, and imputation estimators were generally more efficient than the other estimators proposed. We applied the proposed method to a data set from research in Alzheimer's disease.
在估计ROC曲线时,当真实疾病状态存在不可忽略的缺失时,观察到的似然性涉及由选择模型给出的缺失机制。在本文中,我们提出了一种基于似然性的方法,用于在验证偏差不可忽略时估计ROC曲线及其下面积。我们指定了一个参数化疾病模型,以使不可忽略的选择模型可识别。利用估计的验证概率和疾病概率,我们基于插补和重加权方法构建了四种类型的ROC曲线及其面积的经验估计。在实际中,在我们的设定下需要相当大的样本量来估计不可忽略的选择模型。模拟研究表明,所有四种ROC面积估计量表现良好,并且插补估计量通常比其他提出的估计量更有效。我们将所提出的方法应用于来自阿尔茨海默病研究的一个数据集。