Ward Logan Morgan, Lindskou Tim Alex, Mogensen Mads Lause, Christensen Erika Frischknecht, Søvsø Morten Breinholt
Treat Systems ApS, Hasserisvej 125, 9000, Aalborg, Denmark.
Centre for Prehospital and Emergency Research and Danish Centre for Health Services Research, Aalborg University Hospital and Department of Clinical Medicine, Aalborg University, Selma Lagerløfs Vej 249, 9260, Gistrup, Denmark.
Sci Rep. 2025 Jul 1;15(1):21459. doi: 10.1038/s41598-025-08247-0.
Early warning scores are used to assess acute patients' risk of being in a critical situation, allowing for early appropriate treatment, avoiding critical outcomes. The early warning scores use changes in vital signs to provide an assessment, however they tend to identify a considerable number of false positive cases, especially among prehospital patients. We investigated the development and validation of predictive scores based on machine learning models among patients (aged ≥ 18 years) who used ambulances in the North Denmark Region from July 1, 2016, to December 31, 2020. The machine learning models were compared to standard early warning scores (NEWS2 and DEPT), on 7- and 30-day mortality and intensive care admission. The cohort of 219,323 patients was split into development (n = 175,458 (80%)) and validation (n = 43,865 (20%)) datasets to respectively develop and test the machine learning models. These models were logistic regression, random forest, Bayesian networks, and gradient boosting. The machine learning models outperformed NEWS2 and DEPT, with fewer false positives, reducing the number of patients needed to screen by nearly half, for 7 day mortality. This has the potential to reduce both under- and over-triage, improving the precision of the triage among prehospital patients.
早期预警评分用于评估急性病患者处于危急状况的风险,以便能尽早进行适当治疗,避免出现危急后果。早期预警评分利用生命体征的变化来进行评估,然而,它们往往会识别出相当数量的假阳性病例,尤其是在院前患者中。我们对2016年7月1日至2020年12月31日期间在丹麦北部地区使用救护车的18岁及以上患者中基于机器学习模型的预测评分的开发和验证进行了调查。将机器学习模型与标准早期预警评分(NEWS2和DEPT)在7天和30天死亡率以及重症监护入院方面进行了比较。219323名患者的队列被分为开发数据集(n = 175458(80%))和验证数据集(n = 43865(20%)),分别用于开发和测试机器学习模型。这些模型包括逻辑回归、随机森林、贝叶斯网络和梯度提升。机器学习模型的表现优于NEWS2和DEPT,假阳性更少,对于7天死亡率而言,将需要筛查的患者数量减少了近一半。这有可能减少分诊不足和过度分诊的情况,提高院前患者分诊的准确性。