Department of Statistics, North Carolina State University, Raleigh, North Carolina.
Stat Med. 2019 Aug 15;38(18):3361-3377. doi: 10.1002/sim.8181. Epub 2019 May 3.
The receiver operating characteristic (ROC) surface, as a generalization of the ROC curve, has been widely used to assess the accuracy of a diagnostic test for three categories. A common problem is verification bias, referring to the situation where not all subjects have their true classes verified. In this paper, we consider the problem of estimating the ROC surface under verification bias. We adopt a Bayesian nonparametric approach by directly modeling the underlying distributions of the three categories by Dirichlet process mixture priors. We propose a robust computing algorithm by only imposing a missing at random assumption for the verification process but no assumption on the distributions. The method can also accommodate covariates information in estimating the ROC surface, which can lead to a more comprehensive understanding of the diagnostic accuracy. It can be adapted and hugely simplified in the case where there is no verification bias, and very fast computation is possible through the Bayesian bootstrap process. The proposed method is compared with other commonly used methods by extensive simulations. We find that the proposed method generally outperforms other approaches. Applying the method to two real datasets, the key findings are as follows: (1) human epididymis protein 4 has a slightly better diagnosis ability compared to CA125 in discriminating healthy, early stage, and late stage patients of epithelial ovarian cancer. (2) Serum albumin has a prognostic ability in distinguishing different stages of hepatocellular carcinoma.
受试者工作特征(ROC)曲面是 ROC 曲线的推广,已广泛用于评估三类诊断测试的准确性。一个常见的问题是验证偏差,指的是并非所有受试者的真实类别都得到验证的情况。在本文中,我们考虑在存在验证偏差的情况下估计 ROC 曲面的问题。我们通过直接使用 Dirichlet 过程混合先验对三个类别的基础分布进行贝叶斯非参数建模来解决这个问题。我们提出了一种稳健的计算算法,仅对验证过程施加随机缺失假设,而不对分布做出任何假设。该方法还可以在估计 ROC 曲面时包含协变量信息,从而可以更全面地了解诊断准确性。在不存在验证偏差的情况下,可以对其进行调整和极大简化,并且通过贝叶斯自举过程可以实现非常快速的计算。通过广泛的模拟,我们将提出的方法与其他常用方法进行了比较。我们发现,该方法通常优于其他方法。将该方法应用于两个真实数据集,得出的主要发现如下:(1)与 CA125 相比,人附睾蛋白 4 在区分上皮性卵巢癌的健康、早期和晚期患者方面具有稍好的诊断能力。(2)血清白蛋白在区分肝细胞癌的不同阶段方面具有预后能力。