基于 ICD 编码和人口统计学数据的深度注意力模型，可在入院时预测住院时间和院内死亡率。

A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data.

机构信息

Department of Computer Science, Sangmyung University, Seoul, Republic of Korea.

Graduate School of Information, Yonsei University, Seoul, Republic of Korea.

出版信息

J Biomed Inform. 2021 Jun;118:103778. doi: 10.1016/j.jbi.2021.103778. Epub 2021 Apr 17.

DOI:10.1016/j.jbi.2021.103778

PMID:33872817

Abstract

Leveraging the Electronic Health Records (EHR) longitudinal data to produce actionable clinical insights has always been a critical issue for recent studies. Non-forecasted extended hospitalizations account for a disproportionate amount of resource use, the mediocre quality of inpatient care, and avoidable fatalities. The capability to predict the Length of Stay (LoS) and mortality in the early stages of the admission provides opportunities to improve care and prevent many preventable losses. Forecasting the in-hospital mortality is important in providing clinicians with enough insights to make decisions and hospitals to allocate resources, hence predicting the LoS and mortality within the first day of admission is a difficult but a paramount endeavor. The biggest challenge is that few data are available by this time, thus the prediction has to bring in the previous admissions history and free text diagnosis that are recorded immediately on admission. We propose a model that uses the multi-modal EHR structured medical codes and key demographic information to classify the LoS in 3 classes; Short Los (LoS⩽10 days), Medium LoS (10<LoS⩽30 days) and Long LoS (LoS>30 days) as well as mortality as a binary classification of a patient's death during current admission. The prediction has to use data available only within 24 h of admission. The key predictors include previous ICD9 diagnosis codes, ICD9 procedures, key demographic data, and free text diagnosis of the current admission recorded right on admission. We propose a Hierarchical Attention Network (HAN-LoS and HAN-Mor) model and train it to a dataset of over 45321 admissions recorded in the de-identified MIMIC-III dataset. For improved prediction, our attention mechanisms can focus on the most influential past admissions and most influential codes in these admissions. For fair performance evaluation, we implemented and compared the HAN model with previous approaches. With dataset balancing techniques HAN-LoS achieved an AUROC of over 0.82 and a Micro-F1 score of 0.24 and HAN-Mor achieved AUC-ROC of 0.87 hence outperforming the existing baselines that use structured medical codes as well as clinical time series for LoS and Mortality forecasting. By predicting mortality and LoS using the same model, we show that with little tuning the proposed model can be used for other clinical predictive tasks like phenotyping, decompensation,re-admission prediction, and survival analysis.

摘要

利用电子健康记录（EHR）的纵向数据来生成可操作的临床见解一直是近期研究的关键问题。未预测到的延长住院时间占资源使用的不成比例，住院护理质量一般，以及可避免的死亡。在入院早期预测住院时间（LoS）和死亡率的能力为改善护理和预防许多可预防的损失提供了机会。预测住院期间的死亡率对于为临床医生提供足够的决策洞察力和医院分配资源非常重要，因此，在入院的第一天内预测 LoS 和死亡率是一项困难但至关重要的任务。最大的挑战是，此时可用的数据很少，因此预测必须引入之前的入院记录和入院时立即记录的自由文本诊断。我们提出了一种使用多模态 EHR 结构化医疗代码和关键人口统计学信息来对 3 类 LoS 进行分类的模型；短 LoS（LoS ⩽10 天）、中 LoS（10<LoS ⩽30 天）和长 LoS（LoS>30 天）以及死亡率作为患者当前入院期间死亡的二元分类。预测必须仅使用入院后 24 小时内可用的数据。关键预测因素包括之前的 ICD9 诊断代码、ICD9 程序、关键人口统计学数据以及入院时立即记录的当前入院的自由文本诊断。我们提出了一种层次注意网络（HAN-LoS 和 HAN-Mor）模型，并将其训练到 MIMIC-III 数据集记录的超过 45321 次入院的数据集上。为了提高预测性能，我们的注意力机制可以关注最有影响力的过去入院和这些入院中最有影响力的代码。为了进行公平的性能评估，我们实现并比较了 HAN 模型与之前的方法。通过使用数据集平衡技术，HAN-LoS 实现了超过 0.82 的 AUROC 和 0.24 的 Micro-F1 分数，HAN-Mor 实现了 AUC-ROC 为 0.87，因此优于使用结构化医疗代码以及 LoS 和死亡率预测的临床时间序列的现有基线。通过使用相同的模型预测死亡率和 LoS，我们表明，通过进行少量调整，所提出的模型可用于其他临床预测任务，如表型分析、失代偿、再入院预测和生存分析。

相似文献

A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data.基于 ICD 编码和人口统计学数据的深度注意力模型，可在入院时预测住院时间和院内死亡率。

J Biomed Inform. 2021 Jun;118:103778. doi: 10.1016/j.jbi.2021.103778. Epub 2021 Apr 17.

Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.动态可解释机器学习预测 ICU 患者死亡率：电子患者记录中高频数据的回顾性研究。

Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12.

Leveraging electronic health record data to inform hospital resource management : A systematic data mining approach.利用电子健康记录数据为医院资源管理提供信息：一种系统的数据挖掘方法。

Health Care Manag Sci. 2021 Dec;24(4):716-741. doi: 10.1007/s10729-021-09554-4. Epub 2021 May 24.

Multi-modal learning for inpatient length of stay prediction.多模态学习在住院时间预测中的应用。

Comput Biol Med. 2024 Mar;171:108121. doi: 10.1016/j.compbiomed.2024.108121. Epub 2024 Feb 9.

The impact of a new emergency admission avoidance system for older people on length of stay and same-day discharges.新的老年人急诊入院回避系统对住院时间和当日出院的影响。

Age Ageing. 2014 Jan;43(1):116-21. doi: 10.1093/ageing/aft086. Epub 2013 Aug 1.

Predictors of in-hospital length of stay among cardiac patients: A machine learning approach.心脏病人住院时间的预测因素：一种机器学习方法。

Int J Cardiol. 2019 Aug 1;288:140-147. doi: 10.1016/j.ijcard.2019.01.046. Epub 2019 Jan 19.

What diagnoses may make patients more seriously ill than they first appear? Mortality according to the Simple Clinical Score Risk Class at the time of admission compared to the observed mortality of different ICD9 codes identified on death or discharge.哪些诊断可能会使患者的病情比最初看起来更严重？将入院时根据简单临床评分风险等级得出的死亡率与在死亡或出院时确定的不同ICD9编码的观察到的死亡率进行比较。

Eur J Intern Med. 2009 Jan;20(1):89-93. doi: 10.1016/j.ejim.2008.04.012. Epub 2008 Jun 10.

Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data.用于预测COVID-19患者入院时预后的循环神经网络模型（CovRNN）：使用电子健康记录数据进行模型开发和验证

Lancet Digit Health. 2022 Jun;4(6):e415-e425. doi: 10.1016/S2589-7500(22)00049-8. Epub 2022 Apr 21.

Quantifying the impact of addressing data challenges in prediction of length of stay.量化解决数据挑战对住院时间预测的影响。

BMC Med Inform Decis Mak. 2021 Oct 30;21(1):298. doi: 10.1186/s12911-021-01660-1.

M3T-LM: A multi-modal multi-task learning model for jointly predicting patient length of stay and mortality.M3T-LM：一种用于联合预测患者住院时间和死亡率的多模态多任务学习模型。

Comput Biol Med. 2024 Dec;183:109237. doi: 10.1016/j.compbiomed.2024.109237. Epub 2024 Oct 7.

引用本文的文献

Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review.通过深度学习利用结构化电子健康记录数据中的顺序诊断代码增强患者预后预测：系统评价

J Med Internet Res. 2025 Mar 18;27:e57358. doi: 10.2196/57358.

Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke.注意力能否用于解释基于电子健康记录的死亡率预测任务：以出血性中风为例的研究

ACM BCB. 2023 Sep;2023. doi: 10.1145/3584371.3613002. Epub 2023 Oct 4.

PSO-XnB: a proposed model for predicting hospital stay of CAD patients.PSO-XnB：一种用于预测冠心病患者住院时间的提议模型。

Front Artif Intell. 2024 May 3;7:1381430. doi: 10.3389/frai.2024.1381430. eCollection 2024.

A hybrid modeling framework for generalizable and interpretable predictions of ICU mortality across multiple hospitals.一种用于跨多家医院进行 ICU 死亡率的可推广和可解释预测的混合建模框架。

Sci Rep. 2024 Mar 8;14(1):5725. doi: 10.1038/s41598-024-55577-6.

Towards Predicting Length of Stay and Identification of Cohort Risk Factors Using Self-Attention-Based Transformers and Association Mining: COVID-19 as a Phenotype.利用基于自注意力机制的变换器和关联挖掘预测住院时间并识别队列风险因素：以 COVID-19 为表型

Diagnostics (Basel). 2023 May 17;13(10):1760. doi: 10.3390/diagnostics13101760.

Artificial intelligence for clinical decision support for monitoring patients in cardiovascular ICUs: A systematic review.用于心血管重症监护病房患者监测的临床决策支持人工智能：一项系统综述。

Front Med (Lausanne). 2023 Mar 31;10:1109411. doi: 10.3389/fmed.2023.1109411. eCollection 2023.

Time-to-event modeling for hospital length of stay prediction for COVID-19 patients.用于预测COVID-19患者住院时间的事件发生时间建模。

Mach Learn Appl. 2022 Sep 15;9:100365. doi: 10.1016/j.mlwa.2022.100365. Epub 2022 Jun 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于 ICD 编码和人口统计学数据的深度注意力模型，可在入院时预测住院时间和院内死亡率。

A deep attention model to forecast the Length Of Stay and the in-hospital mortality right on admission from ICD codes and demographic data.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献