Abujaber Ahmad A, Yaseen Said, Nashwan Abdulqadir J, Akhtar Naveed, Imam Yahia
Nursing Department, Hamad Medical Corporation (HMC), Doha, Qatar.
School of Medicine, Jordan University of Science and Technology, Irbid, Jordan.
J Stroke Cerebrovasc Dis. 2025 Feb;34(2):108200. doi: 10.1016/j.jstrokecerebrovasdis.2024.108200. Epub 2024 Dec 12.
Stroke-associated Hospital Acquired Pneumonia (HAP) significantly impacts patient outcomes. This study explores the utility of machine learning models in predicting HAP in stroke patients, leveraging national registry data and SHapley Additive exPlanations (SHAP) analysis to identify key predictive factors.
We collected data from a national stroke registry covering January 2014 to July 2022, including 9,840 patients diagnosed with ischemic and hemorrhagic strokes. Five machine learning models were trained and evaluated: XGBoost, Random Forest, Support Vector Machine (SVM), Logistic Regression, and Artificial Neural Network (ANN). Performance was assessed using accuracy, precision, recall, F1-score, AUC, log loss, and Brier score. SHAP analysis was conducted to interpret model outputs.
The ANN model demonstrated superior performance, with an F1-score of 0.86 and an AUC of 0.94. SHAP analysis identified key predictors: stroke severity, admission location, Glasgow Coma score (GCS), systolic and diastolic blood pressure at admission, ethnicity, stroke type, mode of arrival, and age. Patients with higher stroke severity, dysphagia, and those arriving by ambulance were at increased risk for HAP.
This study enhances our understanding of early predictive factors for HAP in stroke patients and underlines the potential of machine learning to improve clinical decision-making and personalized care.
卒中相关性医院获得性肺炎(HAP)对患者预后有显著影响。本研究利用国家登记数据和夏普利值加法解释(SHAP)分析来确定关键预测因素,探讨机器学习模型在预测卒中患者发生HAP方面的效用。
我们收集了2014年1月至2022年7月全国卒中登记处的数据,包括9840例诊断为缺血性和出血性卒中的患者。训练并评估了五种机器学习模型:XGBoost、随机森林、支持向量机(SVM)、逻辑回归和人工神经网络(ANN)。使用准确率、精确率、召回率、F1分数、AUC、对数损失和布里尔分数评估性能。进行SHAP分析以解释模型输出。
ANN模型表现出卓越的性能,F1分数为0.86,AUC为0.94。SHAP分析确定了关键预测因素:卒中严重程度、入院地点、格拉斯哥昏迷评分(GCS)、入院时的收缩压和舒张压、种族、卒中类型、到达方式和年龄。卒中严重程度较高、吞咽困难以及通过救护车到达的患者发生HAP的风险增加。
本研究增进了我们对卒中患者发生HAP早期预测因素的理解,并强调了机器学习在改善临床决策和个性化医疗方面的潜力。