Yu Miao, Yuan Zihan, Li Ruijie, Shi Bo, Wan Daiwei, Dong Xiaoqiang
Department of General Surgery, The First Affiliated Hospital of Soochow University, Suzhou, China.
Front Oncol. 2024 Feb 6;14:1337219. doi: 10.3389/fonc.2024.1337219. eCollection 2024.
Laparoscopic total mesorectal excision (LaTME) is standard surgical methods for rectal cancer, and LaTME operation is a challenging procedure. This study is intended to use machine learning to develop and validate prediction models for surgical difficulty of LaTME in patients with rectal cancer and compare these models' performance.
We retrospectively collected the preoperative clinical and MRI pelvimetry parameter of rectal cancer patients who underwent laparoscopic total mesorectal resection from 2017 to 2022. The difficulty of LaTME was defined according to the scoring criteria reported by Escal. Patients were randomly divided into training group (80%) and test group (20%). We selected independent influencing features using the least absolute shrinkage and selection operator (LASSO) and multivariate logistic regression method. Adopt synthetic minority oversampling technique (SMOTE) to alleviate the class imbalance problem. Six machine learning model were developed: light gradient boosting machine (LGBM); categorical boosting (CatBoost); extreme gradient boost (XGBoost), logistic regression (LR); random forests (RF); multilayer perceptron (MLP). The area under receiver operating characteristic curve (AUROC), accuracy, sensitivity, specificity and F1 score were used to evaluate the performance of the model. The Shapley Additive Explanations (SHAP) analysis provided interpretation for the best machine learning model. Further decision curve analysis (DCA) was used to evaluate the clinical manifestations of the model.
A total of 626 patients were included. LASSO regression analysis shows that tumor height, prognostic nutrition index (PNI), pelvic inlet, pelvic outlet, sacrococcygeal distance, mesorectal fat area and angle 5 (the angle between the apex of the sacral angle and the lower edge of the pubic bone) are the predictor variables of the machine learning model. In addition, the correlation heatmap shows that there is no significant correlation between these seven variables. When predicting the difficulty of LaTME surgery, the XGBoost model performed best among the six machine learning models (AUROC=0.855). Based on the decision curve analysis (DCA) results, the XGBoost model is also superior, and feature importance analysis shows that tumor height is the most important variable among the seven factors.
This study developed an XGBoost model to predict the difficulty of LaTME surgery. This model can help clinicians quickly and accurately predict the difficulty of surgery and adopt individualized surgical methods.
腹腔镜全直肠系膜切除术(LaTME)是直肠癌的标准手术方法,而LaTME手术是一项具有挑战性的操作。本研究旨在使用机器学习开发并验证直肠癌患者LaTME手术难度的预测模型,并比较这些模型的性能。
我们回顾性收集了2017年至2022年接受腹腔镜全直肠系膜切除术的直肠癌患者的术前临床和MRI骨盆测量参数。LaTME的难度根据Escal报告的评分标准定义。患者被随机分为训练组(80%)和测试组(20%)。我们使用最小绝对收缩和选择算子(LASSO)和多变量逻辑回归方法选择独立影响因素。采用合成少数过采样技术(SMOTE)来缓解类别不平衡问题。开发了六种机器学习模型:轻梯度提升机(LGBM);分类提升(CatBoost);极端梯度提升(XGBoost)、逻辑回归(LR);随机森林(RF);多层感知器(MLP)。采用受试者操作特征曲线下面积(AUROC)、准确率、灵敏度、特异度和F1分数来评估模型的性能。Shapley加法解释(SHAP)分析为最佳机器学习模型提供了解释。进一步的决策曲线分析(DCA)用于评估模型的临床表现。
共纳入626例患者。LASSO回归分析显示,肿瘤高度、预后营养指数(PNI)、骨盆入口、骨盆出口、骶尾距离、直肠系膜脂肪面积和角度5(骶角顶点与耻骨下缘之间的角度)是机器学习模型的预测变量。此外,相关热图显示这七个变量之间无显著相关性。在预测LaTME手术难度时,XGBoost模型在六种机器学习模型中表现最佳(AUROC=0.855)。基于决策曲线分析(DCA)结果,XGBoost模型也更具优势,特征重要性分析显示肿瘤高度是七个因素中最重要的变量。
本研究开发了一种XGBoost模型来预测LaTME手术的难度。该模型可帮助临床医生快速准确地预测手术难度,并采用个体化手术方法。