Caraballo Pedro J, Meehan Anne M, Fischer Karen M, Rahman Parvez, Simon Gyorgy J, Melton Genevieve B, Salehinejad Hojjat, Borah Bijan J
Department of Medicine, Mayo Clinic, Rochester, MN 55905, United States.
Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, United States.
JAMIA Open. 2025 Jan 6;8(1):ooae156. doi: 10.1093/jamiaopen/ooae156. eCollection 2025 Feb.
In the general hospital wards, machine learning (ML)-based early warning systems (EWSs) can identify patients at risk of deterioration to facilitate rescue interventions. We assess subpopulation performance of a ML-based EWS on medical and surgical adult patients admitted to general hospital wards.
We assessed the scores of an EWS integrated into the electronic health record and calculated every 15 minutes to predict a composite adverse event (AE): all-cause mortality, transfer to intensive care, cardiac arrest, or rapid response team evaluation. The distributions of the First Score 3 hours after admission, the Highest Score at any time during the hospitalization, and the Last Score just before an AE or dismissal without an AE were calculated. The Last Score was used to calculate the area under the receiver operating characteristic curve (ROC-AUC) and the precision-recall curve (PRC-AUC).
From August 23, 2021 to March 31, 2022, 35 937 medical admissions had 2173 (6.05%) AE compared to 25 214 surgical admissions with 4984 (19.77%) AE. Medical and surgical admissions had significant different ( <.001) distributions of the First Score, Highest Score, and Last Score among those with an AE and without an AE. The model performed better in the medical group when compared to the surgical group, ROC-AUC 0.869 versus 0.677, and RPC-AUC 0.988 versus 0.878, respectively.
Heterogeneity of medical and surgical patients can significantly impact the performance of a ML-based EWS, changing the model validity and clinical discernment.
Characterization of the target patient subpopulations has clinical implications and should be considered when developing models to be used in general hospital wards.
在综合医院病房中,基于机器学习(ML)的早期预警系统(EWS)可以识别有病情恶化风险的患者,以便于进行救援干预。我们评估了基于ML的EWS在综合医院病房收治的成年内科和外科患者中的亚组表现。
我们评估了集成到电子健康记录中的EWS的分数,每15分钟计算一次,以预测复合不良事件(AE):全因死亡率、转入重症监护、心脏骤停或快速反应小组评估。计算入院后3小时的首次评分、住院期间任何时间的最高评分以及AE发生前或无AE出院前的末次评分的分布情况。末次评分用于计算受试者操作特征曲线下面积(ROC-AUC)和精确召回率曲线下面积(PRC-AUC)。
2021年8月23日至2022年3月31日,35937例内科入院患者中有2173例(6.05%)发生AE,而25214例外科入院患者中有4984例(19.77%)发生AE。在内科和外科入院患者中,有AE和无AE患者的首次评分、最高评分和末次评分分布存在显著差异(P<0.001)。与外科组相比,该模型在内科组中的表现更好,ROC-AUC分别为0.869和0.677,RPC-AUC分别为0.988和0.878。
内科和外科患者的异质性会显著影响基于ML的EWS的性能,改变模型的有效性和临床辨别力。
目标患者亚组的特征具有临床意义,在开发用于综合医院病房的模型时应予以考虑。