Enserro Danielle M, Miller Austin
Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14263, USA.
Cancers (Basel). 2025 Apr 8;17(8):1259. doi: 10.3390/cancers17081259.
Proper confidence interval estimation of the area under the receiver operating characteristic curve (AUC), the net reclassification index (NRI), and the integrated discrimination improvement (IDI) is an area of ongoing research. The most common confidence interval estimation methods employ asymptotic theory. However, developments demonstrate that degeneration of the normal distribution assumption under the null hypothesis exists for measures such as the change in AUC (ΔAUC) and IDI, and confidence intervals estimated under the normal distribution assumption may be invalid. We aim to study the performance of confidence intervals derived assuming asymptotic theory and those derived with non-parametric bootstrapping methods. We examine the performance of ΔAUC, NRI, and IDI in both the logistic and survival regression context. We explore empirical distributions and compare coverage probabilities of asymptotic confidence intervals with those produced from bootstrapping methods through simulation. The primary finding in both the logistic framework and the survival analysis framework is that the percentile CIs performed well regarding coverage, without compromise to their width; this finding was robust in most scenarios. Our results suggest that the asymptotic intervals are only appropriate when a strong effect size of the added parameter exists, and that the percentile bootstrap interval exhibits at least a reasonable coverage while maintaining the shortest width in nearly all simulated scenarios, making this interval the most reliable choice. The intent is that these recommendations improve the accuracy in the estimation and the overall assessment of discrimination improvement.
对受试者工作特征曲线下面积(AUC)、净重新分类指数(NRI)和综合鉴别改善指数(IDI)进行恰当的置信区间估计是一个正在进行研究的领域。最常见的置信区间估计方法采用渐近理论。然而,研究进展表明,对于诸如AUC变化(ΔAUC)和IDI等指标,在原假设下正态分布假设存在退化情况,并且在正态分布假设下估计的置信区间可能无效。我们旨在研究基于渐近理论得出的置信区间以及通过非参数自助法得出的置信区间的性能。我们在逻辑回归和生存回归背景下检验ΔAUC、NRI和IDI的性能。我们通过模拟探索经验分布,并比较渐近置信区间与自助法产生的置信区间的覆盖概率。在逻辑框架和生存分析框架中的主要发现是,百分位数置信区间在覆盖方面表现良好,且宽度不受影响;这一发现在大多数情况下都很稳健。我们的结果表明,渐近区间仅在添加参数有强效应大小时才合适,并且百分位数自助区间在几乎所有模拟情况下都至少表现出合理的覆盖,同时保持最短的宽度,使其成为最可靠的选择。目的是这些建议能提高估计的准确性以及对鉴别改善的总体评估。