Perkins Neil J, Schisterman Enrique F, Vexler Albert
Eunice Kennedy Shriver National Institute of Child Health and Human Development, Rockville, MD 20852, USA.
Biom J. 2011 May;53(3):464-76. doi: 10.1002/bimj.201000083.
The receiver operating characteristic (ROC) curve is a tool commonly used to evaluate biomarker utility in clinical diagnosis of disease. Often, multiple biomarkers are developed to evaluate the discrimination for the same outcome. Levels of multiple biomarkers can be combined via best linear combination (BLC) such that their overall discriminatory ability is greater than any of them individually. Biomarker measurements frequently have undetectable levels below a detection limit sometimes denoted as limit of detection (LOD). Ignoring observations below the LOD or substituting some replacement value as a method of correction has been shown to lead to negatively biased estimates of the area under the ROC curve for some distributions of single biomarkers. In this paper, we develop asymptotically unbiased estimators, via the maximum likelihood technique, of the area under the ROC curve of BLC of two bivariate normally distributed biomarkers affected by LODs. We also propose confidence intervals for this area under curve. Point and confidence interval estimates are scrutinized by simulation study, recording bias and root mean square error and coverage probability, respectively. An example using polychlorinated biphenyl (PCB) levels to classify women with and without endometriosis illustrates the potential benefits of our methods.
受试者工作特征(ROC)曲线是一种常用于评估生物标志物在疾病临床诊断中效用的工具。通常,会开发多种生物标志物来评估对同一结果的区分能力。多种生物标志物的水平可以通过最佳线性组合(BLC)进行合并,以使它们的整体区分能力大于任何单个生物标志物。生物标志物测量结果常常存在低于检测限的不可检测水平,有时将其表示为检测限(LOD)。对于某些单生物标志物的分布,忽略低于LOD的观测值或用某些替代值进行校正已被证明会导致ROC曲线下面积的估计值出现负偏差。在本文中,我们通过最大似然技术,为受LOD影响的两个双变量正态分布生物标志物的BLC的ROC曲线下面积开发了渐近无偏估计量。我们还为该曲线下面积提出了置信区间。通过模拟研究对估计点和置信区间进行了仔细审查,分别记录了偏差、均方根误差和覆盖概率。一个使用多氯联苯(PCB)水平对患有和未患有子宫内膜异位症的女性进行分类的例子说明了我们方法的潜在益处。