Liu Yuan, Shang Xingchen, Du Wenyi, Shen Wei, Zhu Yanfei
Department of General Surgery, Wuxi Medical Center of Nanjing Medical University, Wuxi, People's Republic of China.
Department of General Surgery, Tengzhou Central People's Hospital, Jining Medical College, Shandong, People's Republic of China.
Int J Gen Med. 2024 Oct 29;17:4999-5014. doi: 10.2147/IJGM.S485347. eCollection 2024.
The reappearance of gastric cancer, a frequent postoperative complication following radical gastric cancer surgery, substantially impacts the near-term and far-reaching medical outlook of patients. The objective of this research was to create a machine learning algorithm that could recognize high-risk factors for gastric cancer recurrence and anticipate the correlation between gastric cancer recurrence and Helicobacter pylori (H. pylori) infection.
This investigation comprised 1234 patients diagnosed with gastric cancer, and 37 characteristic variables were obtained. Four machine learning algorithms, namely, extreme gradient boosting (XGBoost), random forest (RF), k-nearest neighbor algorithm (KNN), and multilayer perceptron (MLP), were implemented to develop the models. The k-fold cross-validation technique was utilized to perform internal validation of the four models, while independent datasets were employed for external validation of the models.
In contrast to the other machine learning models, the XGBoost algorithm demonstrated superior predictive ability regarding high-risk factors for gastric cancer recurrence. The outcomes of Shapley additive explanation (SHAP) analysis revealed that tumor invasion depth, tumor lymph node metastasis, H. pylori infection, postoperative carcinoembryonic antigen (CEA), tumor size, and tumor number were risk elements for gastric cancer recurrence in patients, with H. pylori infection being the primary high-risk factor.
Out of the four machine learning models, the XGBoost algorithm exhibited superior performance in predicting the recurrence of gastric cancer. In addition, machine learning models can help clinicians identify key prognostic factors that are clinically meaningful for the application of personalized patient monitoring and immunotherapy.
胃癌复发是胃癌根治术后常见的术后并发症,对患者的近期和远期医学前景有重大影响。本研究的目的是创建一种机器学习算法,该算法能够识别胃癌复发的高危因素,并预测胃癌复发与幽门螺杆菌(H. pylori)感染之间的相关性。
本研究纳入了1234例确诊为胃癌的患者,并获取了37个特征变量。实施了四种机器学习算法,即极端梯度提升(XGBoost)、随机森林(RF)、k近邻算法(KNN)和多层感知器(MLP)来开发模型。采用k折交叉验证技术对这四种模型进行内部验证,同时使用独立数据集对模型进行外部验证。
与其他机器学习模型相比,XGBoost算法在胃癌复发高危因素的预测能力方面表现更优。夏普利加性解释(SHAP)分析结果显示,肿瘤浸润深度、肿瘤淋巴结转移、幽门螺杆菌感染、术后癌胚抗原(CEA)、肿瘤大小和肿瘤数量是患者胃癌复发的危险因素,其中幽门螺杆菌感染是主要的高危因素。
在四种机器学习模型中,XGBoost算法在预测胃癌复发方面表现出更优的性能。此外,机器学习模型可以帮助临床医生识别对个性化患者监测和免疫治疗应用具有临床意义的关键预后因素。