Suppr超能文献

构建一个筛查模型,以识别入院时医院获得性流感高危患者。

Constructing a screening model to identify patients at high risk of hospital-acquired influenza on admission to hospital.

作者信息

Zhang Shangshu, Li Peng, Qiao Bo, Qin Hongying, Wu Zhenzhen, Guo Leilei

机构信息

Department of Disease Prevention and Control, Zhengzhou Central Hospital Affiliated to Zhengzhou University, Zhengzhou, Henan, China.

Department of Hospital Infection Control, Henan Provincial People's Hospital, People's Hospital of Zhengzhou University, Zhengzhou, China.

出版信息

Front Public Health. 2025 Apr 16;13:1495794. doi: 10.3389/fpubh.2025.1495794. eCollection 2025.

Abstract

OBJECTIVE

To develop a machine learning (ML)-based admission screening model for hospital-acquired (HA) influenza using routinely available data to support early clinical intervention.

METHODS

The study focused on hospitalized patients from January 2021 to May 2024. The case group consisted of patients with HA influenza, while the control group comprised non-HA influenza patients admitted to the same ward in the HA influenza unit within 2 weeks. The 953 subjects were divided into the training set and the validation set in a 7:3 ratio. Feature screening was performed using least absolute shrinkage and selection operator (LASSO) and the Boruta algorithm. Subsequently eight ML algorithms were applied to analyze and identify the optimal model using a 5-fold cross-validation methodology. And the area under the curve (AUC), area under the precision-recall curve (AP), F1 score, calibration curve and decision curve analysis (DCA) were applied to comprehensively assess the predictive effectiveness of the selected models. Feature factors were selected and feature importance's were assessed using SHapley's additive interpretation (SHAP). Furthermore, an interactive web-based platform was additionally developed to visualize and demonstrate the predictive model.

RESULTS

Age, pneumonia on admission, Chronic renal failure, Malignant tumor, hypoproteinemia, glucocorticoid use, admission to ICU, lymphopenia, BMI were identified as key variables. For the eight ML algorithms, ROC values ranging from 0.548 to 0.812 were observed in the validation set. A comprehensive analysis showed that the XGBoost model predicted the highest accuracy (AUC: 0.812) with an F1 score of 0.590 and the highest A value (0.655). Evaluating the optimal model, the AUC values were 0.995, 0.826, and 0.781 for the training, validation and test sets. The XGBoost model showed strong robust. SHapley's additive interpretation (SHAP) was utilized to analyze the contribution of explanatory variables to the model and their correlation with HA influenza. In addition, we developed a practical online prediction tool to calculate the risk of HA influenza occurrence.

CONCLUSION

Based on the routine data, the XGBoost model demonstrated excellent calibration among all ML algorithms and accurately predicted the risk of HA influenza, thereby serving as an effective tool for early screening of HA influenza.

摘要

目的

利用常规可得数据开发一种基于机器学习(ML)的医院获得性(HA)流感入院筛查模型,以支持早期临床干预。

方法

该研究聚焦于2021年1月至2024年5月期间的住院患者。病例组由HA流感患者组成,对照组包括在HA流感病房同一病房2周内入院的非HA流感患者。953名受试者按7:3的比例分为训练集和验证集。使用最小绝对收缩和选择算子(LASSO)及博鲁塔算法进行特征筛选。随后应用八种ML算法,采用五折交叉验证方法分析并确定最优模型。并应用曲线下面积(AUC)、精确召回率曲线下面积(AP)、F1分数、校准曲线和决策曲线分析(DCA)全面评估所选模型的预测有效性。使用夏普利加法解释(SHAP)选择特征因素并评估特征重要性。此外,还额外开发了一个基于网络的交互式平台,以可视化和展示预测模型。

结果

年龄、入院时肺炎、慢性肾衰竭、恶性肿瘤、低蛋白血症、糖皮质激素使用、入住重症监护病房、淋巴细胞减少、体重指数被确定为关键变量。对于八种ML算法,在验证集中观察到的ROC值范围为0.548至0.812。综合分析表明,XGBoost模型预测准确率最高(AUC:0.812),F1分数为0.590,A值最高(0.655)。评估最优模型时,训练集、验证集和测试集的AUC值分别为0.995、0.826和0.781。XGBoost模型表现出很强的稳健性。利用夏普利加法解释(SHAP)分析解释变量对模型的贡献及其与HA流感的相关性。此外,我们开发了一种实用的在线预测工具来计算HA流感发生的风险。

结论

基于常规数据,XGBoost模型在所有ML算法中表现出出色的校准能力,能够准确预测HA流感的风险,从而成为HA流感早期筛查的有效工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64f7/12041216/552d229a0b53/fpubh-13-1495794-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验