Department of Anesthesiology, Pain and Perioperative Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450000, China.
School of Medicine, Southeast University, Nanjing, 210009, China.
Sci Rep. 2021 Jan 15;11(1):1571. doi: 10.1038/s41598-021-81188-6.
To explore the predictive performance of machine learning on the recurrence of patients with gastric cancer after the operation. The available data is divided into two parts. In particular, the first part is used as a training set (such as 80% of the original data), and the second part is used as a test set (the remaining 20% of the data). And we use fivefold cross-validation. The weight of recurrence factors shows the top four factors are BMI, Operation time, WGT and age in order. In training group:among the 5 machine learning models, the accuracy of gbm was 0.891, followed by gbm algorithm was 0.876; The AUC values of the five machine learning algorithms are from high to low as forest (0.962), gbm (0.922), GradientBoosting (0.898), DecisionTree (0.790) and Logistic (0.748). And the precision of the forest is the highest 0.957, followed by the GradientBoosting algorithm (0.878). At the same time, in the test group is as follows: the highest accuracy of Logistic was 0.801, followed by forest algorithm and gbm; the AUC values of the five algorithms are forest (0.795), GradientBoosting (0.774), DecisionTree (0.773), Logistic (0.771) and gbm (0.771), from high to low. Among the five machine learning algorithms, the highest precision rate of Logistic is 1.000, followed by the gbm (0.487). Machine learning can predict the recurrence of gastric cancer patients after an operation. Besides, the first four factors affecting postoperative recurrence of gastric cancer were BMI, Operation time, WGT and age.
探索机器学习对胃癌患者手术后复发的预测性能。可用数据分为两部分。特别是,第一部分用作训练集(例如原始数据的 80%),第二部分用作测试集(其余 20%的数据)。我们使用五重交叉验证。复发因素的权重显示前四个因素按顺序为 BMI、手术时间、WGT 和年龄。在训练组中:在 5 种机器学习模型中,gbm 的准确率最高为 0.891,其次是 gbm 算法为 0.876;五种机器学习算法的 AUC 值从高到低依次为森林(0.962)、gbm(0.922)、梯度提升(0.898)、决策树(0.790)和逻辑(0.748)。森林的精度最高为 0.957,其次是梯度提升算法(0.878)。同时,在测试组中如下:逻辑的准确率最高为 0.801,其次是森林算法和 gbm;五种算法的 AUC 值从高到低依次为森林(0.795)、梯度提升(0.774)、决策树(0.773)、逻辑(0.771)和 gbm(0.771)。在五种机器学习算法中,逻辑的精度最高为 1.000,其次是 gbm(0.487)。机器学习可以预测胃癌患者手术后的复发情况。此外,影响胃癌患者术后复发的前四个因素为 BMI、手术时间、WGT 和年龄。