Kosmidis Dimitrios, Simopoulos Dimitrios, Kosmidis Nestoras, Anastassopoulos George
Department of Nursing, Democritus University of Thrace, Greece, Alexandroupolis, GRC.
Department of Medicine, Democritus University of Thrace, Alexandroupolis, GRC.
Cureus. 2025 Aug 12;17(8):e89903. doi: 10.7759/cureus.89903. eCollection 2025 Aug.
Background and aim Predicting the Length of Stay (LOS) for patients in the Intensive Care Unit (ICU) can aid in improving care management and resource allocation. Compared to traditional scoring systems, machine learning methods usually provide more accurate LOS predictions, highlighting the need for improved precision. This study aims to introduce and assess the stacked ensemble machine learning model capabilities to predict ICU LOS for both short- and long-term patient groups, using Acute Physiology and Chronic Health Evaluation (APACHE) IV-derived features. Methods We used approximately 148,000 patient records from the eICU Collaborative Research Database. To predict patient LOS, we first divided the patients into two groups (short- and long-term), based on their actual ICU LOS. Subsequently, we developed two stacked ensemble learning models for each patient group. Results For short-term patients, the Mean Absolute Error (MAE) was 1.037, whereas for long-term patients it was 1.997. Particularly, for patients with long-term actual ICU LOS (median 10.6 days), the respective predictions were clinically acceptable, suggesting the model's dynamics in real ICU environment applications. The clinical utility of these predictions can help clinicians in managing patient care in the ICU. Validation of these findings on external datasets may increase the applicability in daily clinical practice. Limitations include the use of data only from United States ICUs, selection of input features, and lack of external validation, which may affect the further generalizability and applicability of our model to different clinical settings. Conclusion The clinical utility of these predictions can help clinicians in managing patient care in the ICU. Future validation of these findings on external datasets may increase the applicability in daily clinical practice.
背景与目的 预测重症监护病房(ICU)患者的住院时长(LOS)有助于改善护理管理和资源分配。与传统评分系统相比,机器学习方法通常能提供更准确的住院时长预测,这凸显了提高预测精度的必要性。本研究旨在引入并评估堆叠集成机器学习模型预测短期和长期患者群体ICU住院时长的能力,使用急性生理学与慢性健康状况评估(APACHE)IV衍生特征。方法 我们使用了来自电子ICU协作研究数据库的约148,000份患者记录。为预测患者住院时长,我们首先根据患者实际的ICU住院时长将其分为两组(短期和长期)。随后,我们为每个患者群体开发了两个堆叠集成学习模型。结果 对于短期患者,平均绝对误差(MAE)为1.037,而对于长期患者,该误差为1.997。特别是,对于长期实际ICU住院时长(中位数为10.6天)的患者,相应的预测在临床上是可接受的,这表明了该模型在实际ICU环境应用中的动态性。这些预测的临床实用性有助于临床医生在ICU中管理患者护理。在外部数据集上对这些发现进行验证可能会增加其在日常临床实践中的适用性。局限性包括仅使用来自美国ICU的数据、输入特征的选择以及缺乏外部验证,这可能会影响我们模型在不同临床环境中的进一步通用性和适用性。结论 这些预测的临床实用性有助于临床医生在ICU中管理患者护理。未来在外部数据集上对这些发现进行验证可能会增加其在日常临床实践中的适用性。