利用观察医疗结局伙伴关系通用数据模型预测计划性入院的住院时间：回顾性研究。

Hospital Length of Stay Prediction for Planned Admissions Using Observational Medical Outcomes Partnership Common Data Model: Retrospective Study.

机构信息

Department of Biomedical Informatics and Data Science, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, MD, United States.

Office of eHealth Research and Businesses, Seoul National University Bundang Hospital, Seongnam-si, Republic of Korea.

出版信息

J Med Internet Res. 2024 Nov 22;26:e59260. doi: 10.2196/59260.

DOI:10.2196/59260

PMID:39576284

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11624451/

Abstract

BACKGROUND

Accurate hospital length of stay (LoS) prediction enables efficient resource management. Conventional LoS prediction models with limited covariates and nonstandardized data have limited reproducibility when applied to the general population.

OBJECTIVE

In this study, we developed and validated a machine learning (ML)-based LoS prediction model for planned admissions using the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM).

METHODS

Retrospective patient-level prediction models used electronic health record (EHR) data converted to the OMOP CDM (version 5.3) from Seoul National University Bundang Hospital (SNUBH) in South Korea. The study included 137,437 hospital admission episodes between January 2016 and December 2020. Covariates from the patient, condition occurrence, medication, observation, measurement, procedure, and visit occurrence tables were included in the analysis. To perform feature selection, we applied Lasso regularization in the logistic regression. The primary outcome was an LoS of 7 days or longer, while the secondary outcome was an LoS of 3 days or longer. The prediction models were developed using 6 ML algorithms, with the training and test set split in a 7:3 ratio. The performance of each model was evaluated based on the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC). Shapley Additive Explanations (SHAP) analysis measured feature importance, while calibration plots assessed the reliability of the prediction models. External validation of the developed models occurred at an independent institution, the Seoul National University Hospital.

RESULTS

The final sample included 129,938 patient entry events in the planned admissions. The Extreme Gradient Boosting (XGB) model achieved the best performance in binary classification for predicting an LoS of 7 days or longer, with an AUROC of 0.891 (95% CI 0.887-0.894) and an AUPRC of 0.819 (95% CI 0.813-0.826) on the internal test set. The Light Gradient Boosting (LGB) model performed the best in the multiclassification for predicting an LoS of 3 days or more, with an AUROC of 0.901 (95% CI 0.898-0.904) and an AUPRC of 0.770 (95% CI 0.762-0.779). The most important features contributing to the models were the operation performed, frequency of previous outpatient visits, patient admission department, age, and day of admission. The RF model showed robust performance in the external validation set, achieving an AUROC of 0.804 (95% CI 0.802-0.807).

CONCLUSIONS

The use of the OMOP CDM in predicting hospital LoS for planned admissions demonstrates promising predictive capabilities for stays of varying durations. It underscores the advantage of standardized data in achieving reproducible results. This approach should serve as a model for enhancing operational efficiency and patient care coordination across health care settings.

摘要

背景

准确预测医院住院时间（LoS）有助于实现资源的有效管理。当应用于普通人群时，传统的LoS 预测模型由于其有限的协变量和非标准化数据，其可重复性有限。

目的

本研究旨在利用 Observational Medical Outcomes Partnership 通用数据模型（OMOP CDM）开发和验证一种基于机器学习（ML）的计划入院患者的 LoS 预测模型。

方法

回顾性患者水平预测模型使用来自韩国首尔国立大学盆唐医院（SNUBH）的电子健康记录（EHR）数据，这些数据已转换为 OMOP CDM（版本 5.3）。研究包括 2016 年 1 月至 2020 年 12 月期间的 137437 例住院病例。分析中包括来自患者、疾病发生、药物、观察、测量、程序和就诊发生表的协变量。为了进行特征选择，我们在逻辑回归中应用了 Lasso 正则化。主要结局是 LOS 为 7 天或更长，次要结局是 LOS 为 3 天或更长。使用 6 种 ML 算法开发预测模型，训练集和测试集的比例为 7:3。根据受试者工作特征曲线下面积（AUROC）和精度-召回曲线下面积（AUPRC）评估每个模型的性能。Shapley Additive Explanations（SHAP）分析衡量特征的重要性，而校准图评估预测模型的可靠性。在独立机构首尔国立大学医院进行了开发模型的外部验证。

结果

最终样本包括计划入院的 129938 例患者入院事件。在预测 7 天或更长的 LOS 的二进制分类中，极端梯度提升（XGB）模型的表现最佳，内部测试集的 AUROC 为 0.891（95%CI 0.887-0.894），AUPRC 为 0.819（95%CI 0.813-0.826）。在预测 3 天或更长的 LOS 的多类分类中，Light Gradient Boosting（LGB）模型的表现最佳，AUROC 为 0.901（95%CI 0.898-0.904），AUPRC 为 0.770（95%CI 0.762-0.779）。对模型贡献最大的特征是手术、门诊就诊频率、患者入院科室、年龄和入院日。RF 模型在外部验证集中表现稳健，AUROC 为 0.804（95%CI 0.802-0.807）。

结论

使用 OMOP CDM 预测计划入院患者的医院 LOS 显示出对不同持续时间的住院具有有前景的预测能力。这突显了标准化数据在实现可重复结果方面的优势。这种方法应该成为提高医疗保健环境中运营效率和患者护理协调的典范。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/678a/11624451/0d9e48d2c832/jmir_v26i1e59260_fig1.jpg

相似文献

Hospital Length of Stay Prediction for Planned Admissions Using Observational Medical Outcomes Partnership Common Data Model: Retrospective Study.利用观察医疗结局伙伴关系通用数据模型预测计划性入院的住院时间：回顾性研究。

J Med Internet Res. 2024 Nov 22;26:e59260. doi: 10.2196/59260.

Developing a Machine Learning Model for Predicting 30-Day Major Adverse Cardiac and Cerebrovascular Events in Patients Undergoing Noncardiac Surgery: Retrospective Study.开发用于预测非心脏手术患者30天主要不良心脑血管事件的机器学习模型：回顾性研究

J Med Internet Res. 2025 Apr 9;27:e66366. doi: 10.2196/66366.

Development and Validation of a Machine Learning Model for Early Prediction of Delirium in Intensive Care Units Using Continuous Physiological Data: Retrospective Study.使用连续生理数据的重症监护病房谵妄早期预测机器学习模型的开发与验证：回顾性研究

J Med Internet Res. 2025 Apr 2;27:e59520. doi: 10.2196/59520.

Predicting In-Hospital Fall Risk Using Machine Learning With Real-Time Location System and Electronic Medical Records.利用机器学习结合实时定位系统和电子病历预测住院期间跌倒风险

J Cachexia Sarcopenia Muscle. 2025 Feb;16(1):e13713. doi: 10.1002/jcsm.13713.

Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records.动态可解释机器学习预测 ICU 患者死亡率：电子患者记录中高频数据的回顾性研究。

Lancet Digit Health. 2020 Apr;2(4):e179-e191. doi: 10.1016/S2589-7500(20)30018-2. Epub 2020 Mar 12.

Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测

Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.

Development and Validation of a Prognostic Classification Model Predicting Postoperative Adverse Outcomes in Older Surgical Patients Using a Machine Learning Algorithm: Retrospective Observational Network Study.基于机器学习算法的老年外科患者术后不良结局预测预后分类模型的建立与验证：回顾性观察性网络研究。

J Med Internet Res. 2023 Nov 13;25:e42259. doi: 10.2196/42259.

Development and Validation of a Routine Electronic Health Record-Based Delirium Prediction Model for Surgical Patients Without Dementia: Retrospective Case-Control Study.针对无痴呆症手术患者的基于常规电子健康记录的谵妄预测模型的开发与验证：回顾性病例对照研究

JMIR Perioper Med. 2025 Jan 9;8:e59422. doi: 10.2196/59422.

Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.中国两个中心用于预测非心脏手术后心肌损伤的可解释机器学习模型的开发与验证：一项回顾性研究

JMIR Aging. 2024 Jul 26;7:e54872. doi: 10.2196/54872.

Predictors of in-hospital length of stay among cardiac patients: A machine learning approach.心脏病人住院时间的预测因素：一种机器学习方法。

Int J Cardiol. 2019 Aug 1;288:140-147. doi: 10.1016/j.ijcard.2019.01.046. Epub 2019 Jan 19.

引用本文的文献

Predicting 30-day hospital readmissions using ClinicalT5 with structured and unstructured electronic health records.使用ClinicalT5结合结构化和非结构化电子健康记录预测30天再入院情况。

PLoS One. 2025 Sep 2;20(9):e0328848. doi: 10.1371/journal.pone.0328848. eCollection 2025.

本文引用的文献

JAK2 Mutation Assessment in Thrombotic Events at Unusual Anatomical Sites: Insights from a High-Altitude Cohort.不常见解剖部位血栓形成事件中的JAK2突变评估：来自高海拔队列的见解

Int J Gen Med. 2024 Oct 9;17:4551-4558. doi: 10.2147/IJGM.S480705. eCollection 2024.

Identifying Risk Factors for Prolonged Length of Stay in Hospital and Developing Prediction Models for Patients with Cardiac Arrest Receiving Targeted Temperature Management.识别心脏骤停接受目标温度管理患者住院时间延长的风险因素并建立预测模型

Rev Cardiovasc Med. 2023 Feb 6;24(2):55. doi: 10.31083/j.rcm2402055. eCollection 2023 Feb.

J Med Internet Res. 2023 Nov 13;25:e42259. doi: 10.2196/42259.

Scalable Infrastructure Supporting Reproducible Nationwide Healthcare Data Analysis toward FAIR Stewardship.支持可扩展基础设施，实现可重复的全国范围医疗保健数据分析，以实现 FAIR 治理。

Sci Data. 2023 Oct 4;10(1):674. doi: 10.1038/s41597-023-02580-7.

Hospital length of stay prediction tools for all hospital admissions and general medicine populations: systematic review and meta-analysis.适用于所有住院患者和普通内科人群的住院时间预测工具：系统评价与荟萃分析

Front Med (Lausanne). 2023 Aug 16;10:1192969. doi: 10.3389/fmed.2023.1192969. eCollection 2023.

Mapping the Oncological Basis Dataset to the Standardized Vocabularies of a Common Data Model: A Feasibility Study.将肿瘤学基础数据集映射到通用数据模型的标准化词汇表：一项可行性研究。

Cancers (Basel). 2023 Aug 11;15(16):4059. doi: 10.3390/cancers15164059.

A systematic review of the prediction of hospital length of stay: Towards a unified framework.住院时间预测的系统评价：迈向统一框架

PLOS Digit Health. 2022 Apr 14;1(4):e0000017. doi: 10.1371/journal.pdig.0000017. eCollection 2022 Apr.

Exploring the potential of OMOP common data model for process mining in healthcare.探索 OMOP 通用数据模型在医疗保健流程挖掘中的潜力。

PLoS One. 2023 Jan 3;18(1):e0279641. doi: 10.1371/journal.pone.0279641. eCollection 2023.

Machine-learning prediction for hospital length of stay using a French medico-administrative database.使用法国医疗管理数据库对住院时间进行机器学习预测。

J Mark Access Health Policy. 2022 Nov 26;11(1):2149318. doi: 10.1080/20016689.2022.2149318. eCollection 2023.

Evaluation of factors that influenced the length of hospital stay using data mining techniques.运用数据挖掘技术评估影响住院时间的因素。

BMC Med Inform Decis Mak. 2022 Oct 29;22(1):280. doi: 10.1186/s12911-022-02027-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用观察医疗结局伙伴关系通用数据模型预测计划性入院的住院时间：回顾性研究。

Hospital Length of Stay Prediction for Planned Admissions Using Observational Medical Outcomes Partnership Common Data Model: Retrospective Study.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献