Department of Hematology, Clinical Cancer Research Centre, Aalborg University Hospital, Aalborg, Denmark.
Department of Clinical Medicine, Aalborg University, Aalborg, Denmark.
JCO Clin Cancer Inform. 2024 Apr;8:e2300255. doi: 10.1200/CCI.23.00255.
Patients diagnosed with advanced-stage Hodgkin lymphoma (aHL) have historically been risk-stratified using the International Prognostic Score (IPS). This study investigated if a machine learning (ML) approach could outperform existing models when it comes to predicting overall survival (OS) and progression-free survival (PFS).
This study used patient data from the Danish National Lymphoma Register for model development (development cohort). The ML model was developed using stacking, which combines several predictive survival models (Cox proportional hazard, flexible parametric model, IPS, principal component, penalized regression) into a single model, and was compared with two versions of IPS (IPS-3 and IPS-7) and the newly developed aHL international prognostic index (A-HIPI). Internal model validation was performed using nested cross-validation, and external validation was performed using patient data from the Swedish Lymphoma Register and Cancer Registry of Norway (validation cohort).
In total, 707 and 760 patients with aHL were included in the development and validation cohorts, respectively. Examining model performance for OS in the development cohort, the concordance index (C-index) for the ML model, IPS-7, IPS-3, and A-HIPI was found to be 0.789, 0.608, 0.650, and 0.768, respectively. The corresponding estimates in the validation cohort were 0.749, 0.700, 0.663, and 0.741. For PFS, the ML model achieved the highest C-index in both cohorts (0.665 in the development cohort and 0.691 in the validation cohort). The time-varying AUCs for both the ML model and the A-HIPI were consistently higher in both cohorts compared with the IPS models within the first 5 years after diagnosis.
The new prognostic model for aHL on the basis of ML techniques demonstrated a substantial improvement compared with the IPS models, but yielded a limited improvement in predictive performance compared with the A-HIPI.
患有晚期霍奇金淋巴瘤(aHL)的患者过去一直使用国际预后评分(IPS)进行风险分层。本研究调查了机器学习(ML)方法在预测总生存(OS)和无进展生存(PFS)方面是否优于现有模型。
本研究使用丹麦国家淋巴瘤登记处的患者数据进行模型开发(开发队列)。使用堆叠方法开发 ML 模型,该方法将几个预测生存模型(Cox 比例风险、灵活参数模型、IPS、主成分、惩罚回归)组合成一个单一模型,并与两个版本的 IPS(IPS-3 和 IPS-7)和新开发的 aHL 国际预后指数(A-HIPI)进行比较。内部模型验证使用嵌套交叉验证进行,外部验证使用瑞典淋巴瘤登记处和挪威癌症登记处的患者数据进行(验证队列)。
共有 707 例和 760 例 aHL 患者分别纳入开发队列和验证队列。在开发队列中,评估 OS 的模型性能时,ML 模型、IPS-7、IPS-3 和 A-HIPI 的一致性指数(C-index)分别为 0.789、0.608、0.650 和 0.768。验证队列中的相应估计值分别为 0.749、0.700、0.663 和 0.741。对于 PFS,ML 模型在两个队列中均获得了最高的 C-index(开发队列中为 0.665,验证队列中为 0.691)。在诊断后 5 年内,ML 模型和 A-HIPI 的时间变化 AUC 始终高于 IPS 模型。
基于 ML 技术的新的 aHL 预后模型与 IPS 模型相比显示出显著改善,但与 A-HIPI 相比,预测性能的改善有限。