Ashitomi Yuya, Takahashi Ryosuke, Okazaki Shinji, Sugawara Shuichiro, Kamio Yukinori, Musha Hiroaki, Motoi Fuyuhiko
First Department of Surgery, Faculty of Medicine, Yamagata University, 2-2-2 Iida-Nishi, Yamagata City, Yamagata, 990-9585, Japan.
Surg Today. 2025 Aug 18. doi: 10.1007/s00595-025-03110-1.
Machine learning (ML) is a method of creating models by learning latent patterns and features from collected data to predict and classify new unknown data. We constructed an ML model to predict postoperative complications using various pre- and intraoperative factors from electronic medical records and examined its prediction accuracy.
Data of 617 patients who underwent major organ resection were included in this study. Patient information was collected from the medical records. Consequently, we created Dataset 1, which included all of the data, and Dataset 2, which was adjusted for the factors. The ML models were applied to the two datasets, and the performances of the ML models were compared.
In Dataset 1, the Logistic Regression model showed the best performance, with correct predictions of serious postoperative complications (accuracy), area under the receiver operating characteristic curve (AUROC), and area under the precision-recall curve (AUPRC) values of 0.798, 0.671, and 0.374, respectively. The random forest (RF) model performed best in Dataset 2, with accuracy, AUROC, and AUPRC values of 0.855, 0.725, and 0.412, respectively.
The RF model performed well in predicting serious postoperative complications of gastrointestinal surgery. Further studies are required to improve the accuracy of this ML model for clinical applications.
机器学习(ML)是一种通过从收集的数据中学习潜在模式和特征来创建模型,以预测和分类新的未知数据的方法。我们构建了一个ML模型,使用电子病历中的各种术前和术中因素来预测术后并发症,并检验其预测准确性。
本研究纳入了617例行主要器官切除术患者的数据。从病历中收集患者信息。因此,我们创建了包含所有数据的数据集1和针对这些因素进行调整的数据集2。将ML模型应用于这两个数据集,并比较ML模型的性能。
在数据集1中,逻辑回归模型表现最佳,对严重术后并发症的正确预测(准确率)、受试者工作特征曲线下面积(AUROC)和精确召回率曲线下面积(AUPRC)值分别为0.798、0.671和0.374。随机森林(RF)模型在数据集2中表现最佳,准确率、AUROC和AUPRC值分别为0.855、0.725和0.412。
RF模型在预测胃肠手术严重术后并发症方面表现良好。需要进一步研究以提高该ML模型在临床应用中的准确性。