Qiu Xianxin, Gao Jing, Yang Jing, Hu Jiyi, Hu Weixu, Kong Lin, Lu Jiade J
Shanghai Engineering Research Center of Proton and Heavy Ion Radiation Therapy, Shanghai, China.
Department of Radiation Oncology, Shanghai Proton and Heavy Ion Center, Shanghai, China.
Front Oncol. 2020 Oct 30;10:551420. doi: 10.3389/fonc.2020.551420. eCollection 2020.
Machine learning (ML) algorithms are increasingly explored in glioma prognostication. Random survival forest (RSF) is a common ML approach in analyzing time-to-event survival data. However, it is controversial which method between RSF and traditional cornerstone method Cox proportional hazards (CPH) is better fitted. The purpose of this study was to compare RSF and CPH in predicting tumor progression of high-grade glioma (HGG) after particle beam radiotherapy (PBRT).
The study enrolled 82 consecutive HGG patients who were treated with PBRT at Shanghai Proton and Heavy Ion Center between 6/2015 and 11/2019. The entire cohort was split into the training and testing set in an 80/20 ratio. Ten variables from patient-related, tumor-related and treatment-related information were utilized for developing CPH and RSF for predicting progression-free survival (PFS). The model performance was compared in concordance index (C-index) for discrimination (accuracy), brier score (BS) for calibration (precision) and variable importance for interpretability.
The CPH model demonstrated a better performance in terms of integrated C-index (62.9%) and BS (0.159) compared to RSF model (C-index = 61.1%, BS = 0.174). In the context of variable importance, CPH model indicated that age (P = 0.024), WHO grade (P = 0.020), IDH gene (P = 0.019), and MGMT promoter status (P = 0.040) were significantly correlated with PFS in the univariate analysis; multivariate analysis showed that age (P = 0.041), surgical completeness (P = 0.084), IDH gene (P = 0.057), and MGMT promoter (P = 0.092) had a significant or trend toward the relation with PFS. RSF showed that merely IDH and age were of positive importance for predicting PFS. A final nomogram was developed to predict tumor progression at the individual level based on CPH model.
In a relatively small dataset with HGG patients treated with PBRT, CPH outperformed RSF for predicting tumor progression. A comprehensive criterion with accuracy, precision, and interpretability is recommended in evaluating ML prognostication approaches for clinical deployment.
机器学习(ML)算法在胶质瘤预后评估中的应用越来越广泛。随机生存森林(RSF)是分析事件发生时间生存数据的一种常见ML方法。然而,RSF与传统的基石方法Cox比例风险(CPH)哪种方法拟合效果更好仍存在争议。本研究的目的是比较RSF和CPH在预测粒子束放疗(PBRT)后高级别胶质瘤(HGG)肿瘤进展方面的效果。
本研究纳入了2015年6月至2019年11月期间在上海质子重离子中心接受PBRT治疗的82例连续HGG患者。整个队列按照80/20的比例分为训练集和测试集。利用患者相关、肿瘤相关和治疗相关信息中的10个变量来构建CPH和RSF模型,以预测无进展生存期(PFS)。通过一致性指数(C指数)评估判别能力(准确性)、Brier评分(BS)评估校准能力(精确性)以及变量重要性评估可解释性,对模型性能进行比较。
与RSF模型(C指数 = 61.1%,BS = 0.174)相比,CPH模型在综合C指数(62.9%)和BS(0.159)方面表现更好。在变量重要性方面,CPH模型在单因素分析中表明年龄(P = 0.024)、世界卫生组织(WHO)分级(P = 0.020)、异柠檬酸脱氢酶(IDH)基因(P = 0.019)和O6-甲基鸟嘌呤-DNA甲基转移酶(MGMT)启动子状态(P = 0.040)与PFS显著相关;多因素分析显示年龄(P = 0.041)、手术完整性(P = 0.084)、IDH基因(P = 0.057)和MGMT启动子(P = 0.092)与PFS有显著或有相关趋势。RSF显示仅IDH和年龄对预测PFS具有正向重要性。基于CPH模型开发了一个最终的列线图,用于在个体水平上预测肿瘤进展。
在一个相对较小的接受PBRT治疗的HGG患者数据集中,CPH在预测肿瘤进展方面优于RSF。在评估用于临床应用的ML预后评估方法时,建议采用准确性、精确性和可解释性的综合标准。