Havranek Michael M, Hwang Aljoscha B, Funke Ilona, Kuhlen Dominique, Liedtke Daniel, Boes Stefan
Competence Center for Health Data Science, Faculty of Health Science and Medicine, University of Lucerne, Lucerne, Switzerland.
Medical Department, Hirslanden Group, Zurich, Switzerland.
PLoS One. 2025 Sep 4;20(9):e0331263. doi: 10.1371/journal.pone.0331263. eCollection 2025.
Hospital readmissions prolong patient suffering and increase healthcare expenditures. While several studies have attempted to develop prediction models to reduce readmissions, most have demonstrated modest predictive accuracy. To improve upon prior approaches, we conducted an overview of systematic reviews to identify the most relevant predictor variables, then subsequently developed machine learning models in a retrospective, multisite study across eight hospitals. The patient sample comprised 200,799 inpatient stays from eligible hospitalizations, based on the Centers for Medicare and Medicaid Services (CMS) definition of unplanned readmissions within 30 days of discharge. We constructed random forest models and evaluated out-of-sample performance using the area under the receiver operating characteristic curve (AUC) across different train-test splits. The hospital-wide sample was divided into medical and surgical cohorts to investigate predictor importance across different patient populations. The average AUC score was 0.78 ± 0.01 (mean ± standard deviation [SD]). Patients' diagnoses were the most important predictor variables (contributing 18.4% ± 0.15 to the model's decision, mean ± standard error [SE]), followed by nursing assessments (11.2% ± 0.04, mean ± SE) and procedural information (10.8% ± 0.09, mean ± SE). Comparing medical and surgical patients, we found that medications and prior healthcare use (e.g., prior emergency encounters) were more important in the medical compared with the surgical cohort, whereas procedural information and healthcare provider information (e.g., physician caseload) were more relevant in the surgical relative to the medical cohort. In conclusion, we have established the feasibility of using Swiss electronic medical record (EMR) data to accurately predict unplanned readmissions. The reported variable importances may guide future research and inform development of clinical decision support systems aimed at reducing readmissions.
医院再入院会延长患者的痛苦并增加医疗费用。虽然有几项研究试图开发预测模型以减少再入院情况,但大多数研究显示预测准确性一般。为了改进先前的方法,我们对系统评价进行了概述,以确定最相关的预测变量,随后在一项涵盖八家医院的回顾性多地点研究中开发了机器学习模型。患者样本包括符合条件的住院治疗中的200,799次住院,这是根据医疗保险和医疗补助服务中心(CMS)对出院后30天内非计划再入院的定义确定的。我们构建了随机森林模型,并使用不同训练-测试分割下的受试者工作特征曲线下面积(AUC)评估样本外性能。将全院样本分为内科和外科队列,以研究不同患者群体中预测因素的重要性。平均AUC得分为0.78±0.01(均值±标准差[SD])。患者的诊断是最重要的预测变量(对模型决策的贡献为18.4%±0.15,均值±标准误[SE]),其次是护理评估(11.2%±0.04,均值±SE)和手术信息(10.8%±0.09,均值±SE)。比较内科和外科患者,我们发现与外科队列相比,药物治疗和先前的医疗使用情况(如先前的急诊就诊)在内科患者中更为重要,而手术信息和医疗服务提供者信息(如医生工作量)在外科患者中相对于内科队列更为相关。总之,我们已经证明了使用瑞士电子病历(EMR)数据准确预测非计划再入院的可行性。报告的变量重要性可能会指导未来的研究,并为旨在减少再入院的临床决策支持系统的开发提供信息。