Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique IPLESP, Paris, F75012, France.
AP-HP.Sorbonne Université, Public Health Department, Saint-Antoine Hospital, Paris, F75012, France.
BMC Med Inform Decis Mak. 2023 Aug 30;23(1):170. doi: 10.1186/s12911-023-02264-7.
The risk of mortality in intensive care units (ICUs) is currently addressed by the implementation of scores using admission data. Their performances are satisfactory when complications occur early after admission; however, they may become irrelevant in the case of long hospital stays. In this study, we developed predictive models of short-term mortality in the ICU from longitudinal data.
Using data collected throughout patients' stays of at least 48 h from the MIMIC-III database, several statistical learning approaches were compared, including deep neural networks and penalized regression. Missing data were handled using complete-case analysis or multiple imputation.
Complete-case analyses from 19 predictors showed good discrimination (AUC > 0.77 for several approaches) to predict death between 12 and 24 h onward, yet excluded 75% of patients from the initial target cohort, as data was missing for some of the predictors. Multiple imputation allowed us to include 70 predictors and keep 95% of patients, with similar performances.
This proof-of-concept study supports that automated analysis of electronic health records can be of great interest throughout patients' stays as a surveillance tool. Although this framework relies on a large set of predictors, it is robust to data imputation and may be effective early after admission, when data are still scarce.
目前,通过使用入院数据评分来评估重症监护病房(ICU)的死亡率风险。这些评分在入院后早期发生并发症时表现良好;然而,在住院时间较长的情况下,它们可能变得不相关。在这项研究中,我们从纵向数据中开发了 ICU 短期死亡率的预测模型。
使用来自 MIMIC-III 数据库的至少 48 小时的患者住院期间的数据,比较了几种统计学习方法,包括深度神经网络和惩罚回归。使用完整病例分析或多重插补处理缺失数据。
来自 19 个预测因子的完整病例分析显示出良好的区分度(几种方法的 AUC>0.77),可预测 12 至 24 小时后死亡,但排除了初始目标队列中 75%的患者,因为一些预测因子的数据缺失。多重插补允许我们纳入 70 个预测因子并保留 95%的患者,表现相似。
这项概念验证研究支持自动化电子病历分析可以作为一种监测工具,在患者住院期间具有很大的意义。虽然这个框架依赖于大量的预测因子,但它对数据插补具有稳健性,并且可能在入院后早期有效,此时数据仍然较少。