Yan Zhiqiang, Wang Jiang, Dong Qiufeng, Zhu Lian, Lin Wei, Jiang Xiaofan
Department of Neurosurgery, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
Ann Transl Med. 2022 Aug;10(16):860. doi: 10.21037/atm-22-3384.
Glioma is the most common primary intracranial tumor with poor prognosis. The prediction of glioma prognosis has not been well investigated. XGBoost algorithm has been widely used in and data analysis. The predictive value of XGBoost algorithm in glioma remains unclear. This current study used the XGBoost algorithm to construct a predictive model for postoperative outcomes of glioma patients.
Patients with glioma who underwent surgery from January 2006 to April 2017 were retrospectively included in this study. Clinical and follow-up data were collected. The XGBoost model and multivariate logistic regression analysis model were used to screen the factors related to postoperative outcomes, and the results of the two models were compared. The area under the receiver operating characteristic (ROC) curve (AUC), sensitivity, specificity, and Youden index were calculated to evaluate the predictive value of the XGBoost model.
A total of 638 patients were included. In total, 336 (52.7%) cases died within 5 years after the operation. Multivariate logistic regression analysis showed that age, gender, World Health Organization (WHO) grade, extent of tumor resection, Karnofsy performance score (KPS), tumor diameter, and whether postoperative radiotherapy and chemotherapy were administered, were the most important risk factors for death within 5 years after surgery in glioma patients. The XGBoost model showed that the top 5 factors related to death of glioma patients within 5 years after surgery were WHO grade (30 points), extent of tumor resection (19 points), postoperative radiotherapy and chemotherapy (16 points), KPS (14 points), and age (11 points). The AUC of the XGBoost model for predicting the death of glioma patients within 5 years after surgery was 0.803 [95% confidence interval (CI): 0.718-0.832], and the sensitivity and specificity were 0.894 and 0.581, respectively. The Youden index was 0.475. The AUC of the multivariate logistic regression model was 0.738 (95% CI: 0.704-0.781), the sensitivity and specificity were 0.785 and 0.632, respectively, and the Youden index was 0.417.
Compared with multivariate logistic regression model, XGBoost model has better performance in predicting the risk of death within 5 years after surgery in patients with glioma.
胶质瘤是最常见的原发性颅内肿瘤,预后较差。胶质瘤预后的预测尚未得到充分研究。XGBoost算法已广泛应用于数据分析。XGBoost算法在胶质瘤中的预测价值仍不明确。本研究采用XGBoost算法构建胶质瘤患者术后预后的预测模型。
回顾性纳入2006年1月至2017年4月接受手术的胶质瘤患者。收集临床和随访数据。采用XGBoost模型和多因素logistic回归分析模型筛选与术后预后相关的因素,并比较两种模型的结果。计算受试者工作特征(ROC)曲线下面积(AUC)、敏感性、特异性和约登指数,以评估XGBoost模型的预测价值。
共纳入638例患者。其中,336例(52.7%)在术后5年内死亡。多因素logistic回归分析显示,年龄、性别、世界卫生组织(WHO)分级、肿瘤切除范围、卡诺夫斯基功能状态评分(KPS)、肿瘤直径以及是否进行术后放疗和化疗,是胶质瘤患者术后5年内死亡的最重要危险因素。XGBoost模型显示,与胶质瘤患者术后5年内死亡相关的前5个因素分别为WHO分级(30分)、肿瘤切除范围(19分)、术后放疗和化疗(16分)、KPS(14分)和年龄(11分)。XGBoost模型预测胶质瘤患者术后5年内死亡的AUC为0.803[95%置信区间(CI):0.718-0.832],敏感性和特异性分别为0.894和0.581。约登指数为0.475。多因素logistic回归模型的AUC为0.738(95%CI:0.704-0.781),敏感性和特异性分别为0.785和0.632,约登指数为0.417。
与多因素logistic回归模型相比,XGBoost模型在预测胶质瘤患者术后5年内死亡风险方面具有更好的性能。