Department of Anesthesiology, Pain and Perioperative Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China.
Department of Pathology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China.
Sci Rep. 2021 Jan 14;11(1):1300. doi: 10.1038/s41598-020-80582-w.
To construct a machine learning algorithm model of lymph node metastasis (LNM) in patients with poorly differentiated-type intramucosal gastric cancer. 1169 patients with postoperative gastric cancer were divided into a training group and a test group at a ratio of 7:3. The model for lymph node metastasis was established with python machine learning. The Gbdt algorithm in the machine learning results finds that number of resected nodes, lymphovascular invasion and tumor size are the primary 3 factors that account for the weight of LNM. Effect of the LNM model of PDC gastric cancer patients in the training group: Among the 7 algorithm models, the highest accuracy rate was that of GBDT (0.955); The AUC values for the 7 algorithms were, from high to low, XGB (0.881), RF (0.802), GBDT (0.798), LR (0.778), XGB + LR (0.739), RF + LR (0.691) and GBDT + LR (0.626). Results of the LNM model of PDC gastric cancer patients in test group : Among the 7 algorithmic models, XGB had the highest accuracy rate (0.952); Among the 7 algorithms, the AUC values, from high to low, were GBDT (0.788), RF (0.765), XGB (0.762), LR (0.750), RF + LR (0.678), GBDT + LR (0.650) and XGB + LR (0.619). Single machine learning algorithm can predict LNM in poorly differentiated-type intramucosal gastric cancer, but fusion algorithm can not improve the effect of machine learning in predicting LNM.
建立用于预测低分化型黏膜内胃癌患者淋巴结转移(LNM)的机器学习算法模型。将 1169 例术后胃癌患者按 7∶3 的比例分为训练组和测试组,采用 python 机器学习建立淋巴结转移模型。机器学习结果中的 Gbdt 算法发现,切除淋巴结数目、血管淋巴管侵犯和肿瘤大小是导致 LNM 的主要 3 个因素。在训练组中,PDC 胃癌患者 LNM 模型的效果:在 7 个算法模型中,GBDT 的准确率最高(0.955);7 种算法的 AUC 值从高到低依次为 XGB(0.881)、RF(0.802)、GBDT(0.798)、LR(0.778)、XGB+LR(0.739)、RF+LR(0.691)和 GBDT+LR(0.626)。PDC 胃癌患者 LNM 模型在测试组中的效果:在 7 个算法模型中,XGB 的准确率最高(0.952);7 种算法的 AUC 值从高到低依次为 GBDT(0.788)、RF(0.765)、XGB(0.762)、LR(0.750)、RF+LR(0.678)、GBDT+LR(0.650)和 XGB+LR(0.619)。单一的机器学习算法可以预测低分化型黏膜内胃癌的 LNM,但融合算法并不能提高机器学习预测 LNM 的效果。