Department of Radiation Oncology, Division of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX.
Insitro, South San Francisco, CA.
JCO Clin Cancer Inform. 2024 Sep;8:e2400102. doi: 10.1200/CCI.24.00102.
A previous study demonstrated that power against the (unobserved) true effect for the primary end point (PEP) of most phase III oncology trials is low, suggesting an increased risk of false-negative findings in the field of late-phase oncology. Fitting models with prognostic covariates is a potential solution to improve power; however, the extent to which trials leverage this approach, and its impact on trial interpretation at scale, is unknown. To that end, we hypothesized that phase III trials using multivariable PEP analyses are more likely to demonstrate superiority versus trials with univariable analyses.
PEP analyses were reviewed from trials registered on ClinicalTrials.gov. Adjusted odds ratios (aORs) were calculated by logistic regressions.
Of the 535 trials enrolling 454,824 patients, 69% (n = 368) used a multivariable PEP analysis. Trials with multivariable PEP analyses were more likely to demonstrate PEP superiority (57% [209 of 368] 42% [70 of 167]; aOR, 1.78 [95% CI, 1.18 to 2.72]; = .007). Among trials with a multivariable PEP model, 16 conditioned on covariates and 352 stratified by covariates. However, 108 (35%) of 312 trials with stratified analyses lost power by categorizing a continuous variable, which was especially common among immunotherapy trials (aOR, 2.45 [95% CI, 1.23 to 4.92]; = .01).
Trials increasing power by fitting multivariable models were more likely to demonstrate PEP superiority than trials with unadjusted analysis. Underutilization of conditioning models and empirical power loss associated with covariate categorization required by stratification were identified as barriers to power gains. These findings underscore the opportunity to increase power in phase III trials with conventional methodology and improve patient access to effective novel therapies.
先前的研究表明,大多数 III 期肿瘤学试验主要终点(PEP)的实际效果的效力很低,这表明在晚期肿瘤学领域中假阴性结果的风险增加。使用预后协变量拟合模型是提高效力的一种潜在解决方案;然而,试验利用这种方法的程度及其对大规模试验解释的影响尚不清楚。为此,我们假设使用多变量 PEP 分析的 III 期试验比具有单变量分析的试验更有可能显示优越性。
对 ClinicalTrials.gov 上注册的试验进行了 PEP 分析审查。通过逻辑回归计算了调整后的优势比(aOR)。
在纳入 454824 名患者的 535 项试验中,69%(n=368)使用了多变量 PEP 分析。使用多变量 PEP 分析的试验更有可能显示 PEP 优越性(57%[209/368] 比 42%[70/167];aOR,1.78[95%CI,1.18 至 2.72]; =.007)。在具有多变量 PEP 模型的试验中,16 个模型与协变量相关,352 个模型按协变量分层。然而,在 312 项具有分层分析的试验中,有 108 项(35%)因将连续变量分类而失去了效力,这在免疫治疗试验中尤为常见(aOR,2.45[95%CI,1.23 至 4.92]; =.01)。
通过拟合多变量模型来提高效力的试验比未调整分析的试验更有可能显示 PEP 优越性。确定了使用分层需要的协变量分类的建模模型的利用率低和与效力损失相关的问题,这些问题是提高效力的障碍。这些发现强调了使用常规方法在 III 期试验中提高效力并改善患者获得有效新型疗法的机会。