Huang Ze-Ning, Zhang Hao-Xiang, Sun Yu-Qin, Zhang Xing-Qi, Lin Yi-Fen, Weng Cai-Ming, Zheng Chao-Hui, Wang Jia-Bin, Chen Qi-Yue, Cao Long-Long, Lin Mi, Tu Ru-Hong, Huang Chang-Ming, Lin Jian-Xian, Xie Jian-Wei
Department of Gastric Surgery, Fujian Medical University Union Hospital, No. 29 Xin-quan Road, Fuzhou, Fujian Province, 350001, China.
Key Laboratory of Ministry of Education of Gastrointestinal Cancer, Fujian Medical University, Fuzhou, China.
J Transl Med. 2025 Mar 24;23(1):362. doi: 10.1186/s12967-025-06363-z.
Neoadjuvant immunotherapy has been shown to improve survival in patients with gastric cancer. This study sought to develop and validate a radiomics-based machine learning (ML) model for patients with locally advanced gastric cancer (LAGC), specifically to predict whether patients will achieve a major pathological response (MPR) following neoadjuvant immunotherapy. With its predictive capabilities, this tool shows promise for enhancing clinical decision-making processes in the future.
This study utilized a multicenter cohort design, retrospectively gathering clinical data and computed tomography (CT) images from 268 patients diagnosed with advanced gastric cancer who underwent neoadjuvant immunotherapy between January 2019 and December 2023 from two medical centers. Radiomic features were extracted from CT images, and a multi-step feature selection procedure was applied to identify the top 20 representative features. Nine ML algorithms were implemented to build prediction models, with the optimal algorithm selected for the final prediction model. The hyperparameters of the chosen model were fine-tuned using Bayesian optimization and grid search. The performance of the model was evaluated using several metrics, including the area under the curve (AUC), accuracy, and Cohen's kappa coefficient.
Three cohorts were included in this study: the development cohort (DC, n = 86), the internal validation cohort (IVC, n = 59), and the external validation cohort (EVC, n = 52). Nine ML models were developed using DC cases. Among these, an optimized Bayesian-LightGBM model, demonstrated robust predictive performance for MPR following neoadjuvant immunotherapy in LAGC patients across all cohorts. Specifically, within DC, the LightGBM model attained an AUC of 0.828, an overall accuracy of 0.791, a Cohen's kappa coefficient of 0.552, a sensitivity of 0.742, a specificity of 0.818, a positive predictive value (PPV) of 0.586, a negative predictive value (NPV) of 0.867, a Matthews correlation coefficient (MCC) of 0.473, and a balanced accuracy of 0.780. Comparable performance metrics were validated in both the IVC and the EVC, with AUC values of 0.777 and 0.714, and overall accuracies of 0.729 and 0.654, respectively. These results suggested good fitness and generalization of the Bayesian-LightGBM model. Shapley Additive Explanations (SHAP) analysis identified significant radiomic features contributing to the model's predictive capability. The SHAP values of the features wavelet.LLH_gldm_SmallDependenceLowGrayLevelEmphasis, wavelet.HHL_glrlm_RunVariance, and wavelet.LLH_glszm_LargeAreaHighGrayLevelEmphasis were ranked among the top three, highlighting their significant contribution to the model's predictive performance. In contrast to existing radiomic models that exclusively focus on neoadjuvant chemotherapy, our model integrates both neoadjuvant immunotherapy and chemotherapy, thereby offering more precise predictive capabilities.
The radiomics-based ML model demonstrated significant efficacy in predicting the pathological response to neoadjuvant immunotherapy in LAGC patients, thereby providing a foundation for personalized treatment strategies.
新辅助免疫疗法已被证明可提高胃癌患者的生存率。本研究旨在开发并验证一种基于影像组学的机器学习(ML)模型,用于局部晚期胃癌(LAGC)患者,特别是预测患者在新辅助免疫治疗后是否会达到主要病理缓解(MPR)。凭借其预测能力,该工具有望在未来增强临床决策过程。
本研究采用多中心队列设计,回顾性收集了2019年1月至2023年12月期间在两个医疗中心接受新辅助免疫治疗的268例晚期胃癌患者的临床数据和计算机断层扫描(CT)图像。从CT图像中提取影像组学特征,并应用多步骤特征选择程序来识别前20个代表性特征。实施了九种ML算法来构建预测模型,并为最终预测模型选择了最优算法。使用贝叶斯优化和网格搜索对所选模型的超参数进行微调。使用包括曲线下面积(AUC)、准确率和科恩kappa系数在内的多个指标评估模型的性能。
本研究纳入了三个队列:开发队列(DC,n = 86)、内部验证队列(IVC,n = 59)和外部验证队列(EVC,n = 52)。使用DC病例开发了九个ML模型。其中,优化后的贝叶斯 - LightGBM模型在所有队列的LAGC患者新辅助免疫治疗后的MPR预测中表现出强大的预测性能。具体而言,在DC中,LightGBM模型的AUC为0.828,总体准确率为0.791,科恩kappa系数为0.552,灵敏度为0.742,特异性为0.818,阳性预测值(PPV)为0.586,阴性预测值(NPV)为0.867,马修斯相关系数(MCC)为0.473,平衡准确率为0.780。在IVC和EVC中也验证了类似的性能指标,AUC值分别为0.777和0.714,总体准确率分别为0.729和0.654。这些结果表明贝叶斯 - LightGBM模型具有良好的拟合度和泛化能力。Shapley值相加解释(SHAP)分析确定了对模型预测能力有显著贡献的影像组学特征。特征小波.LLH_gldm_SmallDependenceLowGrayLevelEmphasis、小波.HHL_glrlm_RunVariance和小波.LLH_glszm_LargeAreaHighGrayLevelEmphasis的SHAP值排名前三,突出了它们对模型预测性能的显著贡献。与现有的仅专注于新辅助化疗的影像组学模型不同,我们的模型整合了新辅助免疫疗法和化疗,从而提供了更精确的预测能力。
基于影像组学的ML模型在预测LAGC患者对新辅助免疫治疗的病理反应方面显示出显著疗效,从而为个性化治疗策略提供了基础。