University of Kansas Medical Center, Kansas City, KS, USA.
EMB Statistical Solutions, LLC KS, USA.
Stat Methods Med Res. 2024 Apr;33(4):647-668. doi: 10.1177/09622802241233768. Epub 2024 Mar 6.
The performance of individual biomarkers in discriminating between two groups, typically the healthy and the diseased, may be limited. Thus, there is interest in developing statistical methodologies for biomarker combinations with the aim of improving upon the individual discriminatory performance. There is extensive literature referring to biomarker combinations under the two-class setting. However, the corresponding literature under a three-class setting is limited. In our study, we provide parametric and nonparametric methods that allow investigators to optimally combine biomarkers that seek to discriminate between three classes by minimizing the Euclidean distance from the receiver operating characteristic surface to the perfection corner. Using this Euclidean distance as the objective function allows for estimation of the optimal combination coefficients along with the optimal cutoff values for the combined score. An advantage of the proposed methods is that they can accommodate biomarker data from all three groups simultaneously, as opposed to a pairwise analysis such as the one implied by the three-class Youden index. We illustrate that the derived true classification rates exhibit narrower confidence intervals than those derived from the Youden-based approach under a parametric, flexible parametric, and nonparametric kernel-based framework. We evaluate our approaches through extensive simulations and apply them to real data sets that refer to liver cancer patients.
单个生物标志物在区分两组(通常是健康组和疾病组)方面的性能可能有限。因此,人们有兴趣开发用于生物标志物组合的统计方法,目的是提高个体的区分性能。有大量文献涉及两分类情况下的生物标志物组合。然而,三分类情况下的相应文献是有限的。在我们的研究中,我们提供了参数和非参数方法,允许研究人员通过最小化从接收器操作特性曲面到完美角的欧几里得距离来最优地组合旨在区分三类的生物标志物。使用该欧几里得距离作为目标函数可以估计最优组合系数以及组合得分的最优截止值。所提出方法的一个优点是,它们可以同时适应来自所有三组的生物标志物数据,而不是像三分类 Youden 指数所暗示的那样进行两两分析。我们表明,在所提出的参数、灵活参数和非参数核框架下,基于推导的真实分类率的置信区间比基于 Youden 的方法推导的置信区间更窄。我们通过广泛的模拟评估我们的方法,并将其应用于涉及肝癌患者的真实数据集。