Department of Anesthesiology, The Affiliated Changzhou Second People's Hospital of Nanjing Medical University, Changzhou, Jiangsu, China.
The Third Affiliated Hospital of Soochow University, Changzhou, Jiangsu, China.
Biomol Biomed. 2024 May 2;24(3):593-605. doi: 10.17305/bb.2023.9519.
Postoperative sore throat (POST) is a prevalent complication after general anesthesia and targeting high-risk patients helps in its prevention. This study developed and validated a machine learning model to predict POST. A total number of 834 patients who underwent general anesthesia with endotracheal intubation were included in this study. Data from a cohort of 685 patients was used for model development and validation, while a cohort of 149 patients served for external validation. The prediction performance of random forest (RF), neural network (NN), and extreme gradient boosting (XGBoost) models was compared using comprehensive performance metrics. The Local Interpretable Model-Agnostic Explanations (LIME) methods elucidated the best-performing model. POST incidences across training, validation, and testing cohorts were 41.7%, 38.4%, and 36.2%, respectively. Five predictors were age, sex, endotracheal tube cuff pressure, endotracheal tube insertion depth, and the time interval between extubation and the first drinking of water after extubation. After incorporating these variables, the NN model demonstrated superior generalization capabilities in predicting POST when compared to the XGBoost and RF models in external validation, achieving an area under the receiver operating characteristic curve (AUROC) of 0.81 (95% CI 0.74-0.89) and a precision-recall curve (AUPRC) of 0.77 (95% CI 0.66-0.86). The model also showed good calibration and clinical usage values. The NN model outperforms the XGBoost and RF models in predicting POST, with potential applications in the healthcare industry for reducing the incidence of this common postoperative complication.
术后咽喉痛(POST)是全身麻醉后常见的并发症,针对高危患者有助于预防。本研究开发并验证了一种用于预测 POST 的机器学习模型。本研究共纳入 834 例全身麻醉气管插管患者。其中 685 例患者的数据用于模型开发和验证,149 例患者的数据用于外部验证。使用综合性能指标比较了随机森林(RF)、神经网络(NN)和极端梯度提升(XGBoost)模型的预测性能。使用局部可解释模型不可知解释(LIME)方法阐明了表现最佳的模型。训练、验证和测试队列中的 POST 发生率分别为 41.7%、38.4%和 36.2%。五个预测因子是年龄、性别、气管导管套囊压力、气管导管插入深度和拔管后第一次饮水的时间间隔。在纳入这些变量后,与 XGBoost 和 RF 模型相比,NN 模型在外部验证中表现出更好的预测 POST 的泛化能力,在接受者操作特征曲线(AUROC)下面积为 0.81(95%置信区间 0.74-0.89)和精度-召回曲线(AUPRC)为 0.77(95%置信区间 0.66-0.86)。该模型还表现出良好的校准和临床应用价值。NN 模型在预测 POST 方面优于 XGBoost 和 RF 模型,有望在医疗保健行业中应用,以降低这种常见术后并发症的发生率。