Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, MD, USA.
Epidemiology Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, MD, USA.
Stat Methods Med Res. 2022 Aug;31(8):1470-1483. doi: 10.1177/09622802221094940. Epub 2022 Apr 21.
In many diagnostic accuracy studies, orders may be available on multiple receiver operating characteristic curves. For example, being closer to delivery, fetal ultrasound measures in the third trimester should be no less accurate than those in the second trimester in predicting small-for-gestational-age births. Such an order should be incorporated in estimating receiver operating characteristic curves and associated summary accuracy statistics, as it can potentially improve statistical efficiency of these estimates. Early work in the literature has mainly taken an indirect approach to this task and has induced the desired order through modeling test score distributions. We instead propose a new strategy that incorporates the order directly through the modeling of receiver operating characteristic curves. We achieve this by exploiting the link between placement value (the relative position of a diseased test score in the healthy score distribution), the cumulative distribution function of placement value, and receiver operating characteristic curve, and by building stochastically ordered random variables through mixture distributions. We take a Bayesian semiparametric approach in using Dirichlet process mixture models so that the placement values can be flexibly modeled. We conduct extensive simulation studies to examine the performance of the proposed methodology and apply the new framework to data from obstetrics and women's health studies.
在许多诊断准确性研究中,可能会有多个接收器操作特征曲线的订单。例如,在预测小于胎龄儿出生时,接近分娩的胎儿超声测量值在预测小胎龄儿出生时应该不比第二孕期的测量值准确性差。这种订单应该包含在估计接收器操作特征曲线和相关的综合准确性统计中,因为它可以潜在地提高这些估计的统计效率。文献中的早期工作主要采用间接方法来完成这项任务,并通过对测试分数分布进行建模来得出所需的顺序。相反,我们提出了一种新的策略,通过对接收器操作特征曲线进行建模来直接包含订单。我们通过利用位置值(患病测试分数在健康分数分布中的相对位置)、位置值的累积分布函数和接收器操作特征曲线之间的联系,以及通过混合分布构建随机排序的随机变量来实现这一点。我们采用贝叶斯半参数方法使用狄利克雷过程混合模型,以便灵活地对位置值进行建模。我们进行了广泛的模拟研究,以检验所提出的方法的性能,并将新框架应用于妇产科研究的数据。