Ling Tong, Zuo Zhichao, Huang Mingwei, Ma Jie, Wu Liucheng
Department of Gastrointestinal Surgery, Guangxi Medical University Cancer Hospital, Nanning, Guangxi, China.
School of Mathematics and Computational Science, Xiangtan University, Xiangtan, Hunan province, China.
BMC Cancer. 2025 May 6;25(1):834. doi: 10.1186/s12885-025-14259-w.
The early prediction of lymph node positivity (LN+) after neoadjuvant chemotherapy (NAC) is crucial for optimizing individualized treatment strategies. This study aimed to integrate radiomic features and clinical biomarkers through machine learning (ML) approaches to enhance prediction accuracy by focusing on patients with locally advanced gastric cancer (LAGC).
We retrospectively enrolled 277 patients with LAGC and randomly divided them into training (n = 193) and validation (n = 84) sets at a 7:3 ratio. In total, 1,130 radiomics features were extracted from pre-treatment portal venous phase computed tomography scans. These features were linearly combined to develop a radiomics score (rad score) through feature engineering. Then, using the rad score and clinical biomarkers as input features, we applied simple statistical strategies (relying on a single ML model) and integrated statistical strategies (including classification model integration techniques, such as hard voting, soft voting, and stacking) to predict LN+ post-NAC. The diagnostic performance of the model was assessed using receiver operating characteristic curves with corresponding areas under the curve (AUC).
Of all ML models, the stacking classifier, an integrated statistical strategy, exhibited the best performance, achieving an AUC of 0.859 for predicting LN+ in patients with LAGC. This predictive model was transformed into a publicly available online risk calculator.
We developed a stacking classifier that integrates radiomics and clinical biomarkers to predict LN+ in patients with LAGC undergoing surgical resection, providing personalized treatment insights.
新辅助化疗(NAC)后淋巴结阳性(LN+)的早期预测对于优化个体化治疗策略至关重要。本研究旨在通过机器学习(ML)方法整合影像组学特征和临床生物标志物,以提高局部晚期胃癌(LAGC)患者的预测准确性。
我们回顾性纳入了277例LAGC患者,并以7:3的比例将他们随机分为训练集(n = 193)和验证集(n = 84)。总共从治疗前门静脉期计算机断层扫描中提取了1130个影像组学特征。通过特征工程将这些特征进行线性组合以生成影像组学评分(rad评分)。然后,将rad评分和临床生物标志物作为输入特征,我们应用简单统计策略(依赖单一ML模型)和整合统计策略(包括分类模型整合技术,如硬投票、软投票和堆叠)来预测NAC后的LN+。使用受试者操作特征曲线及其相应的曲线下面积(AUC)评估模型诊断性能。
在所有ML模型中,整合统计策略的堆叠分类器表现最佳,预测LAGC患者LN+的AUC为0.859。该预测模型被转化为一个可公开获取的在线风险计算器。
我们开发了一种整合影像组学和临床生物标志物的堆叠分类器,以预测接受手术切除的LAGC患者的LN+,提供个性化治疗见解。