Wang Hairong, Zhang Xingyu
Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Master of Science in Analytics Program, Georgia Institute of Technology, Atlanta, GA 30394, USA.
J Pers Med. 2025 Aug 6;15(8):358. doi: 10.3390/jpm15080358.
: Electrocardiograms (EKGs) are essential tools in emergency medicine, often used to evaluate chest pain, dyspnea, and other symptoms suggestive of cardiac dysfunction. Yet, EKGs are not universally administered to all emergency department (ED) patients. Understanding and predicting which patients receive an EKG may offer insights into clinical decision making, resource allocation, and potential disparities in care. This study examines whether integrating structured clinical data with free-text patient narratives can improve prediction of EKG utilization in the ED. : We conducted a retrospective observational study to predict electrocardiogram (EKG) utilization using data from 13,115 adult emergency department (ED) visits in the nationally representative 2021 National Hospital Ambulatory Medical Care Survey-Emergency Department (NHAMCS-ED), leveraging both structured features-demographics, vital signs, comorbidities, arrival mode, and triage acuity, with the most influential selected via Lasso regression-and unstructured patient narratives transformed into numerical embeddings using Clinical-BERT. Four supervised learning models-Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF) and Extreme Gradient Boosting (XGB)-were trained on three inputs (structured data only, text embeddings only, and a late-fusion combined model); hyperparameters were optimized by grid search with 5-fold cross-validation; performance was evaluated via AUROC, accuracy, sensitivity, specificity and precision; and interpretability was assessed using SHAP values and Permutation Feature Importance. : EKGs were administered in 30.6% of adult ED visits. Patients who received EKGs were more likely to be older, White, Medicare-insured, and to present with abnormal vital signs or higher triage severity. Across all models, the combined data approach yielded superior predictive performance. The SVM and LR achieved the highest area under the ROC curve (AUC = 0.860 and 0.861) when using both structured and unstructured data, compared to 0.772 with structured data alone and 0.823 and 0.822 with unstructured data alone. Similar improvements were observed in accuracy, sensitivity, and specificity. : Integrating structured clinical data with patient narratives significantly enhances the ability to predict EKG utilization in the emergency department. These findings support a personalized medicine framework by demonstrating how multimodal data integration can enable individualized, real-time decision support in the ED.
心电图(EKG)是急诊医学中的重要工具,常用于评估胸痛、呼吸困难以及其他提示心脏功能障碍的症状。然而,并非所有急诊科(ED)患者都会接受心电图检查。了解并预测哪些患者会接受心电图检查,可能有助于洞察临床决策、资源分配以及潜在的护理差异。本研究旨在探讨将结构化临床数据与自由文本患者叙述相结合,是否能提高急诊科心电图使用情况的预测能力。
我们进行了一项回顾性观察研究,利用具有全国代表性的2021年国家医院门诊医疗护理调查 - 急诊科(NHAMCS - ED)中13115例成人急诊科就诊数据来预测心电图(EKG)的使用情况,同时利用结构化特征——人口统计学、生命体征、合并症、就诊方式和分诊 acuity,通过套索回归选择最具影响力的特征——以及使用Clinical - BERT将非结构化患者叙述转化为数值嵌入。四个监督学习模型——逻辑回归(LR)、支持向量机(SVM)、随机森林(RF)和极端梯度提升(XGB)——在三种输入上进行训练(仅结构化数据、仅文本嵌入以及后期融合组合模型);通过5折交叉验证的网格搜索优化超参数;通过AUROC、准确性、敏感性、特异性和精确性评估性能;并使用SHAP值和排列特征重要性评估可解释性。
在30.6%的成人急诊科就诊中进行了心电图检查。接受心电图检查的患者更有可能年龄较大、为白人、有医疗保险,并且生命体征异常或分诊严重程度较高。在所有模型中,组合数据方法产生了更好的预测性能。当同时使用结构化和非结构化数据时,支持向量机(SVM)和逻辑回归(LR)达到了最高的ROC曲线下面积(AUC = 0.860和0.861),相比之下,仅使用结构化数据时为0.772,仅使用非结构化数据时为0.823和0.822。在准确性、敏感性和特异性方面也观察到了类似的改善。
将结构化临床数据与患者叙述相结合,显著增强了预测急诊科心电图使用情况的能力。这些发现通过展示多模态数据整合如何在急诊科实现个性化、实时决策支持,支持了个性化医疗框架。