Zhang Yiye, Huang Yufang, Rosen Anthony, Jiang Lynn G, McCarty Matthew, RoyChoudhury Arindam, Han Jin Ho, Wright Adam, Ancker Jessica S, Steel Peter Ad
Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, United States of America.
Department of Emergency Medicine, Weill Cornell Medicine, New York, New York, United States of America.
PLOS Digit Health. 2024 Sep 27;3(9):e0000606. doi: 10.1371/journal.pdig.0000606. eCollection 2024 Sep.
Return visit admissions (RVA), which are instances where patients discharged from the emergency department (ED) rapidly return and require hospital admission, have been associated with quality issues and adverse outcomes. We developed and validated a machine learning model to predict 72-hour RVA using electronic health records (EHR) data. Study data were extracted from EHR data in 2019 from three urban EDs. The development and independent validation datasets included 62,154 patients from two EDs and 73,453 patients from one ED, respectively. Multiple machine learning algorithms were evaluated, including deep significance clustering (DICE), regularized logistic regression (LR), Gradient Boosting Decision Tree, and XGBoost. These machine learning models were also compared against an existing clinical risk score. To support clinical actionability, clinician investigators conducted manual chart reviews of the cases identified by the model. Chart reviews categorized predicted cases across index ED discharge diagnosis and RVA root cause classifications. The best-performing model achieved an AUC of 0.87 in the development site (test set) and 0.75 in the independent validation set. The model, which combined DICE and LR, boosted predictive performance while providing well-defined features. The model was relatively robust to sensitivity analyses regarding performance across age, race, and by varying predictor availability but less robust across diagnostic groups. Clinician examination demonstrated discrete model performance characteristics within clinical subtypes of RVA. This machine learning model demonstrated a strong predictive performance for 72- RVA. Despite the limited clinical actionability potentially due to model complexity, the rarity of the outcome, and variable relevance, the clinical examination offered guidance on further variable inclusion for enhanced predictive accuracy and actionability.
复诊入院(RVA)是指从急诊科(ED)出院的患者迅速返回并需要住院治疗的情况,它与质量问题和不良后果相关。我们开发并验证了一种机器学习模型,用于使用电子健康记录(EHR)数据预测72小时复诊入院情况。研究数据于2019年从三个城市急诊科的EHR数据中提取。开发数据集和独立验证数据集分别包括来自两个急诊科的62154名患者和来自一个急诊科的73453名患者。评估了多种机器学习算法,包括深度显著性聚类(DICE)、正则化逻辑回归(LR)、梯度提升决策树和XGBoost。这些机器学习模型还与现有的临床风险评分进行了比较。为了支持临床可操作性,临床研究人员对模型识别出的病例进行了人工病历审查。病历审查根据索引急诊科出院诊断和复诊入院根本原因分类对预测病例进行了分类。表现最佳的模型在开发地点(测试集)的AUC为0.87,在独立验证集中为0.75。该模型结合了DICE和LR,在提供明确特征的同时提高了预测性能。该模型在年龄、种族以及不同预测变量可用性方面的性能敏感性分析中相对稳健,但在不同诊断组中稳健性较差。临床检查显示了复诊入院临床亚型内不同的模型性能特征。这种机器学习模型对72小时复诊入院情况表现出了很强的预测性能。尽管由于模型复杂性、结果的罕见性和相关性的变化,临床可操作性有限,但临床检查为进一步纳入变量以提高预测准确性和可操作性提供了指导。