Institute of Neurosurgery, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Catholic University, Rome, Italy.
Neurosurgery Unit, Department of Neuroscience, Santa Maria della Misericordia, University Hospital, Udine, Italy.
Neurosurgery. 2021 Oct 13;89(5):873-883. doi: 10.1093/neuros/nyab320.
Ability to thrive and time-to-recurrence following treatment are important parameters to assess in patients with glioblastoma multiforme (GBM), given its dismal prognosis. Though there is an ongoing debate whether it can be considered an appropriate surrogate endpoint for overall survival in clinical trials, progression-free survival (PFS) is routinely used for clinical decision-making.
To investigate whether machine learning (ML)-based models can reliably stratify newly diagnosed GBM patients into prognostic subclasses on PFS basis, identifying those at higher risk for an early recurrence (≤6 mo).
Data were extracted from a multicentric database, according to the following eligibility criteria: histopathologically verified GBM and follow-up >12 mo: 474 patients met our inclusion criteria and were included in the analysis. Relevant demographic, clinical, molecular, and radiological variables were selected by a feature selection algorithm (Boruta) and used to build a ML-based model.
Random forest prediction model, evaluated on an 80:20 split ratio, achieved an AUC of 0.81 (95% CI: 0.77; 0.83) demonstrating high discriminative ability. Optimizing the predictive value derived from the linear and nonlinear combinations of the selected input features, our model outperformed across all performance metrics multivariable logistic regression.
A robust ML-based prediction model that identifies patients at high risk for early recurrence was successfully trained and internally validated. Considerable effort remains to integrate these predictions in a patient-centered care context.
鉴于胶质母细胞瘤(GBM)预后较差,评估患者的治疗后生存能力和复发时间是重要的参数。虽然目前仍在争论是否可以将其视为临床试验中总生存的合适替代终点,但无进展生存期(PFS)通常用于临床决策。
研究基于机器学习(ML)的模型是否可以可靠地根据 PFS 将新诊断的 GBM 患者分层为预后亚组,确定那些复发风险较高(≤6 个月)的患者。
根据以下纳入标准从多中心数据库中提取数据:组织病理学证实的 GBM 和随访时间>12 个月:474 名患者符合我们的纳入标准,并纳入分析。通过特征选择算法(Boruta)选择相关的人口统计学、临床、分子和影像学变量,并用于构建基于 ML 的模型。
随机森林预测模型在 80:20 的分割比上进行评估,AUC 为 0.81(95%CI:0.77;0.83),显示出较高的判别能力。通过优化从选定输入特征的线性和非线性组合中得出的预测值,我们的模型在所有性能指标上均优于多变量逻辑回归。
成功训练并内部验证了一种基于强大 ML 的预测模型,可识别出早期复发风险较高的患者。仍需要付出相当大的努力将这些预测整合到以患者为中心的护理环境中。