Wei Junying, Cao Heshan, Peng Mingling, Zhang Yinzhou, Li Sibei, Ma Wuhua, Li Yuhui
First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China.
Department of Neurology, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China.
PLoS One. 2025 Jan 7;20(1):e0316526. doi: 10.1371/journal.pone.0316526. eCollection 2025.
Ventilator-associated pneumonia (VAP) is a common nosocomial infection in ICU, significantly associated with poor outcomes. However, there is currently a lack of reliable and interpretable tools for assessing the risk of in-hospital mortality in VAP patients. This study aims to develop an interpretable machine learning (ML) prediction model to enhance the assessment of in-hospital mortality risk in VAP patients.
This study extracted VAP patient data from versions 2.2 and 3.1 of the MIMIC-IV database, using version 2.2 for model training and validation, and version 3.1 for external testing. Feature selection was conducted using the Boruta algorithm, and 14 ML models were constructed. The optimal model was identified based on the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, and specificity across both validation and test cohorts. SHapley Additive exPlanations (SHAP) analysis was applied for global and local interpretability.
A total of 1,894 VAP patients were included, with 12 features ultimately selected for model construction: 24-hour urine output, blood urea nitrogen, age, diastolic blood pressure, platelet count, anion gap, body temperature, bicarbonate level, sodium level, body mass index, and whether combined with congestive heart failure and cerebrovascular disease. The random forest (RF) model showed the best performance, achieving an AUC of 0.780 in internal validation and 0.724 in external testing, outperforming other ML models and common clinical scoring systems.
The RF model demonstrated robust and reliable performance in predicting in-hospital mortality risk for VAP patients. The developed online tool can assist clinicians in efficiently assessing VAP in-hospital mortality risk, supporting clinical decision-making.
呼吸机相关性肺炎(VAP)是重症监护病房(ICU)常见的医院感染,与不良预后显著相关。然而,目前缺乏可靠且可解释的工具来评估VAP患者的院内死亡风险。本研究旨在开发一种可解释的机器学习(ML)预测模型,以加强对VAP患者院内死亡风险的评估。
本研究从MIMIC-IV数据库的2.2版和3.1版中提取VAP患者数据,使用2.2版进行模型训练和验证,3.1版进行外部测试。采用Boruta算法进行特征选择,并构建了14个ML模型。根据受试者工作特征曲线下面积(AUROC)、准确性、敏感性和特异性,在验证队列和测试队列中确定最佳模型。应用SHapley加性解释(SHAP)分析进行全局和局部可解释性分析。
共纳入1894例VAP患者,最终选择12个特征用于模型构建:24小时尿量、血尿素氮、年龄、舒张压、血小板计数、阴离子间隙、体温、碳酸氢盐水平、钠水平、体重指数,以及是否合并充血性心力衰竭和脑血管疾病。随机森林(RF)模型表现最佳,内部验证的AUC为0.780,外部测试的AUC为0.724,优于其他ML模型和常见的临床评分系统。
RF模型在预测VAP患者院内死亡风险方面表现出稳健可靠的性能。开发的在线工具可协助临床医生有效评估VAP患者的院内死亡风险,支持临床决策。