Xiao Wei, Yang Binbin, Ke Shanbao
Department of Gastroenterology, Henan Provincial People's Hospital, Zhengzhou University People's Hospital, Zhengzhou, Henan, China.
Division of Gastroenterology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.
Front Oncol. 2025 Sep 1;15:1644141. doi: 10.3389/fonc.2025.1644141. eCollection 2025.
Patients with unresectable pancreatic cancer have poor outcomes despite chemoradiotherapy (CRT). Traditional prognostic tools lack accuracy in predicting survival. This study aimed to develop an artificial intelligence (AI)-based model to improve survival prediction.
We retrospectively included 214 patients treated with CRT between 2018 and 2024. Five models-Cox, LASSO, RSF, SVM, and XGBoost-were trained to predict overall survival. Model performance was evaluated using the C-index, time-dependent ROC, calibration, and decision curve analysis. SHAP was used to interpret feature importance.
The median overall survival (mOS) for the entire cohort was 18.4 months (95% CI, 16.3-28.1). XGBoost showed the best performance (C-index = 0.949). It also achieved higher area under the receiver operating characteristic curves at 6 and 12 months (0.751 and 0.732) compared to other models. Calibration and clinical benefit were superior. SHAP analysis identified CA199, tumor size, platelet count, and age as the most important predictors. The model stratified patients into risk groups with significant survival differences (p < 0.001).
The XGBoost-based model accurately predicted survival in unresectable pancreatic cancer patients receiving CRT. It may serve as a useful tool for personalized risk assessment and treatment planning.
尽管接受了放化疗(CRT),不可切除胰腺癌患者的预后仍然较差。传统的预后工具在预测生存率方面缺乏准确性。本研究旨在开发一种基于人工智能(AI)的模型,以改善生存预测。
我们回顾性纳入了2018年至2024年间接受CRT治疗的214例患者。训练了五个模型——Cox、LASSO、随机生存森林(RSF)、支持向量机(SVM)和极端梯度提升(XGBoost)——来预测总生存期。使用C指数、时间依赖的ROC曲线、校准和决策曲线分析评估模型性能。使用SHAP来解释特征重要性。
整个队列的中位总生存期(mOS)为18.4个月(95%CI,16.3 - 28.1)。XGBoost表现最佳(C指数 = 0.949)。与其他模型相比,它在6个月和12个月时的受试者操作特征曲线下面积也更高(分别为0.751和0.732)。校准和临床获益更优。SHAP分析确定CA199、肿瘤大小、血小板计数和年龄是最重要的预测因素。该模型将患者分层为具有显著生存差异的风险组(p < 0.001)。
基于XGBoost的模型准确预测了接受CRT的不可切除胰腺癌患者的生存情况。它可能成为个性化风险评估和治疗计划的有用工具。