Chaudhary M A, Edmondson-Jones M, Baio G, Mackay E, Penrod J R, Sharpe D J, Yates G, Rafiq S, Johannesen K, Siddiqui M K, Vanderpuye-Orgle J, Briggs A
Bristol-Myers Squibb, Princeton, NJ, USA.
Parexel International Corp, London, UK.
Med Decis Making. 2023 Jan;43(1):91-109. doi: 10.1177/0272989X221132257. Epub 2022 Oct 19.
Immuno-oncology (IO) therapies are often associated with delayed responses that are deep and durable, manifesting as long-term survival benefits in patients with metastatic cancer. Complex hazard functions arising from IO treatments may limit the accuracy of extrapolations from standard parametric models (SPMs). We evaluated the ability of flexible parametric models (FPMs) to improve survival extrapolations using data from 2 trials involving patients with non-small-cell lung cancer (NSCLC).
Our analyses used consecutive database locks (DBLs) at 2-, 3-, and 5-y minimum follow-up from trials evaluating nivolumab versus docetaxel in patients with pretreated metastatic squamous (CheckMate-017) and nonsquamous (CheckMate-057) NSCLC. For each DBL, SPMs, as well as 3 FPMs-landmark response models (LRMs), mixture cure models (MCMs), and Bayesian multiparameter evidence synthesis (B-MPES)-were estimated on nivolumab overall survival (OS). The performance of each parametric model was assessed by comparing milestone restricted mean survival times (RMSTs) and survival probabilities with results obtained from externally validated SPMs.
For the 2- and 3-y DBLs of both trials, all models tended to underestimate 5-y OS. Predictions from nonvalidated SPMs fitted to the 2-y DBLs were highly unreliable, whereas extrapolations from FPMs were much more consistent between models fitted to successive DBLs. For CheckMate-017, in which an apparent survival plateau emerges in the 3-y DBL, MCMs fitted to this DBL estimated 5-y OS most accurately (11.6% v. 12.3% observed), and long-term predictions were similar to those from the 5-y validated SPM (20-y RMST: 30.2 v. 30.5 mo). For CheckMate-057, where there is no clear evidence of a survival plateau in the early DBLs, only B-MPES was able to accurately predict 5-y OS (14.1% v. 14.0% observed [3-y DBL]).
We demonstrate that the use of FPMs for modeling OS in NSCLC patients from early follow-up data can yield accurate estimates for RMST observed with longer follow-up and provide similar long-term extrapolations to externally validated SPMs based on later data cuts. B-MPES generated reasonable predictions even when fitted to the 2-y DBLs of the studies, whereas MCMs were more reliant on longer-term data to estimate a plateau and therefore performed better from 3 y. Generally, LRM extrapolations were less reliable than those from alternative FPMs and validated SPMs but remained superior to nonvalidated SPMs. Our work demonstrates the potential benefits of using advanced parametric models that incorporate external data sources, such as B-MPES and MCMs, to allow for accurate evaluation of treatment clinical and cost-effectiveness from trial data with limited follow-up.
Flexible advanced parametric modeling methods can provide improved survival extrapolations for immuno-oncology cost-effectiveness in health technology assessments from early clinical trial data that better anticipate extended follow-up.Advantages include leveraging additional observable trial data, the systematic integration of external data, and more detailed modeling of underlying processes.Bayesian multiparameter evidence synthesis performed particularly well, with well-matched external data.Mixture cure models also performed well but may require relatively longer follow-up to identify an emergent plateau, depending on the specific setting.Landmark response models offered marginal benefits in this scenario and may require greater numbers in each response group and/or increased follow-up to support improved extrapolation within each subgroup.
免疫肿瘤学(IO)疗法通常与延迟但深刻且持久的反应相关,表现为转移性癌症患者的长期生存获益。IO治疗产生的复杂风险函数可能会限制标准参数模型(SPM)外推的准确性。我们使用来自两项涉及非小细胞肺癌(NSCLC)患者的试验数据,评估了灵活参数模型(FPM)改善生存外推的能力。
我们的分析使用了在评估纳武利尤单抗与多西他赛治疗既往治疗过的转移性鳞状(CheckMate-017)和非鳞状(CheckMate-057)NSCLC患者的试验中,在2年、3年和5年最小随访时的连续数据库锁定(DBL)。对于每个DBL,在纳武利尤单抗总生存期(OS)上估计了SPM以及3种FPM——标志性反应模型(LRM)、混合治愈模型(MCM)和贝叶斯多参数证据合成(B-MPES)。通过将里程碑受限平均生存时间(RMST)和生存概率与从外部验证的SPM获得的结果进行比较,评估每个参数模型的性能。
对于两项试验的2年和3年DBL,所有模型往往都低估了5年OS。拟合到2年DBL的未经验证的SPM的预测高度不可靠,而FPM的外推在拟合到连续DBL的模型之间更加一致。对于CheckMate-017,在3年DBL中出现了明显的生存平台期,拟合到该DBL的MCM最准确地估计了5年OS(11.6%对观察到的12.3%),并且长期预测与5年验证的SPM相似(20年RMST:30.2对30.5个月)。对于CheckMate-057,在早期DBL中没有明确的生存平台期证据,只有B-MPES能够准确预测5年OS(14.1%对观察到的14.0%[3年DBL])。
我们证明,使用FPM根据早期随访数据对NSCLC患者的OS进行建模,可以对更长随访期观察到的RMST得出准确估计,并提供与基于后期数据截点的外部验证SPM相似的长期外推。即使拟合到研究的2年DBL,B-MPES也能产生合理的预测,而MCM更依赖于长期数据来估计平台期,因此从3年起表现更好。一般来说,LRM的外推比其他FPM和验证的SPM更不可靠,但仍优于未经验证的SPM。我们的工作证明了使用纳入外部数据源的先进参数模型(如B-MPES和MCM)的潜在益处,以便从随访有限的试验数据中准确评估治疗的临床和成本效益。
灵活的先进参数建模方法可以从早期临床试验数据中为免疫肿瘤学成本效益提供更好的生存外推,从而更好地预测延长随访。优点包括利用额外的可观察试验数据、系统整合外部数据以及对潜在过程进行更详细的建模。贝叶斯多参数证据合成表现尤其出色,外部数据匹配良好。混合治愈模型也表现良好,但可能需要相对更长的随访来识别出现的平台期,具体取决于特定情况。在这种情况下,标志性反应模型提供的益处有限,可能需要每个反应组中有更多数量和/或增加随访以支持每个亚组内更好的外推。