Emergency Center, Hubei Clinical Research Center for Emergency and Resuscitation, Zhongnan Hospital of Wuhan University, Wuhan, China.
Wuhan University School of Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China.
Surg Innov. 2024 Dec;31(6):583-597. doi: 10.1177/15533506241273449. Epub 2024 Aug 16.
The development of emergency department (ED) triage systems remains challenging in accurately differentiating patients with acute abdominal pain (AAP) who are critical and urgent for surgery due to subjectivity and limitations. We use machine learning models to predict emergency surgical abdominal pain patients in triage, and then compare their performance with conventional Logistic regression models.
Using 38 214 patients presenting with acute abdominal pain at Zhongnan Hospital of Wuhan University between March 1, 2014, and March 1, 2022, we identified all adult patients (aged ≥18 years). We utilized routinely available triage data in electronic medical records as predictors, including structured data (eg, triage vital signs, gender, and age) and unstructured data (chief complaints and physical examinations in free-text format). The primary outcome measure was whether emergency surgery was performed. The dataset was randomly sampled, with 80% assigned to the training set and 20% to the test set. We developed 5 machine learning models: Light Gradient Boosting Machine (Light GBM), eXtreme Gradient Boosting (XGBoost), Deep Neural Network (DNN), and Random Forest (RF). Logistic regression (LR) served as the reference model. Model performance was calculated for each model, including the area under the receiver-work characteristic curve (AUC) and net benefit (decision curve), as well as the confusion matrix.
Of all the 38 214 acute abdominal pain patients, 4208 underwent emergency abdominal surgery while 34 006 received non-surgical treatment. In the surgery outcome prediction, all 4 machine learning models outperformed the reference model (eg, AUC, 0.899 [95%CI 0.891-0.903] in the Light GBM vs. 0.885 [95%CI 0.876-0.891] in the reference model), Similarly, most machine learning models exhibited significant improvements in net reclassification compared to the reference model (eg, NRIs of 0.0812[95%CI, 0.055-0.1105] in the XGBoost), with the exception of the RF model. Decision curve analysis shows that across the entire range of thresholds, the net benefits of the XGBoost and the Light GBM models were higher than the reference model. In particular, the Light GBM model performed well in predicting the need for emergency abdominal surgery with higher sensitivity, specificity, and accuracy.
Machine learning models have demonstrated superior performance in predicting emergency abdominal pain surgery compared to traditional models. Modern machine learning improves clinical triage decisions and ensures that critically needy patients receive priority for emergency resources and timely, effective treatment.
急诊科分诊系统的发展仍然具有挑战性,因为其在区分因急性腹痛(AAP)需要紧急手术的危急和紧急患者方面存在主观性和局限性。我们使用机器学习模型来预测分诊中的急诊外科腹痛患者,然后将其性能与传统的逻辑回归模型进行比较。
使用 2014 年 3 月 1 日至 2022 年 3 月 1 日期间在武汉大学中南医院就诊的 38214 名急性腹痛患者,我们确定了所有成年患者(年龄≥18 岁)。我们利用电子病历中常规可用的分诊数据作为预测因素,包括结构化数据(例如,分诊生命体征、性别和年龄)和非结构化数据(自由文本格式的主要抱怨和体格检查)。主要结局指标是是否进行急诊手术。数据集被随机抽样,80%分配给训练集,20%分配给测试集。我们开发了 5 种机器学习模型:Light Gradient Boosting Machine(Light GBM)、eXtreme Gradient Boosting(XGBoost)、Deep Neural Network(DNN)和 Random Forest(RF)。逻辑回归(LR)作为参考模型。为每个模型计算模型性能,包括接收者工作特征曲线下的面积(AUC)和净收益(决策曲线),以及混淆矩阵。
在所有 38214 例急性腹痛患者中,4208 例行急诊腹部手术,34006 例接受非手术治疗。在手术结果预测中,所有 4 种机器学习模型的表现均优于参考模型(例如,在 Light GBM 中的 AUC 为 0.899[95%CI 0.891-0.903],而参考模型中的 AUC 为 0.885[95%CI 0.876-0.891])。同样,大多数机器学习模型在与参考模型相比时,在净再分类方面表现出显著改善(例如,在 XGBoost 中的 NRI 为 0.0812[95%CI,0.055-0.1105]),除了 RF 模型。决策曲线分析表明,在整个阈值范围内,XGBoost 和 Light GBM 模型的净收益均高于参考模型。特别是,Light GBM 模型在预测急诊腹部手术需求方面表现出色,具有更高的敏感性、特异性和准确性。
与传统模型相比,机器学习模型在预测急诊腹痛手术方面表现出优异的性能。现代机器学习可以改善临床分诊决策,并确保有紧急需要的患者优先获得急诊资源和及时、有效的治疗。