Cho Younghee, Lee Hyang Kyu, Kim Joungyoun, Yoo Ki-Bong, Choi Jongrim, Lee Yongseok, Choi Mona
College of Nursing, Yonsei University, Seoul, Republic of Korea.
Department of Digital Health, Samsung SDS, Seoul, Republic of Korea.
BMC Infect Dis. 2024 May 2;24(1):466. doi: 10.1186/s12879-024-09358-1.
Hospital-acquired influenza (HAI) is under-recognized despite its high morbidity and poor health outcomes. The early detection of HAI is crucial for curbing its transmission in hospital settings.
This study aimed to investigate factors related to HAI, develop predictive models, and subsequently compare them to identify the best performing machine learning algorithm for predicting the occurrence of HAI.
This retrospective observational study was conducted in 2022 and included 111 HAI and 73,748 non-HAI patients from the 2011-2012 and 2019-2020 influenza seasons. General characteristics, comorbidities, vital signs, laboratory and chest X-ray results, and room information within the electronic medical record were analysed. Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGB), and Artificial Neural Network (ANN) techniques were used to construct the predictive models. Employing randomized allocation, 80% of the dataset constituted the training set, and the remaining 20% comprised the test set. The performance of the developed models was assessed using metrics such as the area under the receiver operating characteristic curve (AUC), the count of false negatives (FN), and the determination of feature importance.
Patients with HAI demonstrated notable differences in general characteristics, comorbidities, vital signs, laboratory findings, chest X-ray result, and room status compared to non-HAI patients. Among the developed models, the RF model demonstrated the best performance taking into account both the AUC (83.3%) and the occurrence of FN (four). The most influential factors for prediction were staying in double rooms, followed by vital signs and laboratory results.
This study revealed the characteristics of patients with HAI and emphasized the role of ventilation in reducing influenza incidence. These findings can aid hospitals in devising infection prevention strategies, and the application of machine learning-based predictive models especially RF can enable early intervention to mitigate the spread of influenza in healthcare settings.
医院获得性流感(HAI)尽管发病率高且健康结局不佳,但仍未得到充分认识。早期发现医院获得性流感对于遏制其在医院环境中的传播至关重要。
本研究旨在调查与医院获得性流感相关的因素,开发预测模型,随后对其进行比较,以确定预测医院获得性流感发生的最佳性能机器学习算法。
这项回顾性观察研究于2022年进行,纳入了2011 - 2012年和2019 - 2020年流感季节的111例医院获得性流感患者和73748例非医院获得性流感患者。分析了电子病历中的一般特征、合并症、生命体征、实验室和胸部X线检查结果以及病房信息。使用逻辑回归(LR)、随机森林(RF)、极端梯度提升(XGB)和人工神经网络(ANN)技术构建预测模型。采用随机分配,80%的数据集构成训练集,其余20%构成测试集。使用受试者操作特征曲线下面积(AUC)、假阴性计数(FN)和特征重要性测定等指标评估所开发模型的性能。
与非医院获得性流感患者相比,医院获得性流感患者在一般特征、合并症、生命体征、实验室检查结果、胸部X线检查结果和病房状况方面存在显著差异。在所开发的模型中,考虑到AUC(83.3%)和FN的发生情况(4例),随机森林模型表现最佳。预测的最有影响因素是住在双人间,其次是生命体征和实验室检查结果。
本研究揭示了医院获得性流感患者的特征,并强调了通风在降低流感发病率中的作用。这些发现可帮助医院制定感染预防策略,基于机器学习的预测模型尤其是随机森林的应用能够实现早期干预,以减轻流感在医疗环境中的传播。