Cui Chen, Mu Fei, Tang Meng, Lin Rui, Wang Mingming, Zhao Xian, Guan Yue, Wang Jingwen
Department of Pharmacy, Xijing Hospital, Fourth Military Medical University, Xi'an, China.
Front Med (Lausanne). 2022 Jul 25;9:942356. doi: 10.3389/fmed.2022.942356. eCollection 2022.
Pseudomonas aeruginosa is a ubiquitous opportunistic bacterial pathogen, which is a leading cause of nosocomial pneumonia. Early identification of the risk factors is urgently needed for severe infection patients with P. aeruginosa. However, no detailed relevant investigation based on machine learning has been reported, and little research has focused on exploring relationships between key risk clinical variables and clinical outcome of patients. In this study, we collected 571 severe infections with P. aeruginosa patients admitted to the Xijing Hospital of the Fourth Military Medical University from January 2010 to July 2021. Basic clinical information, clinical signs and symptoms, laboratory indicators, bacterial culture, and drug related were recorded. Machine learning algorithm of XGBoost was applied to build a model for predicting mortality risk of P. aeruginosa infection in severe patients. The performance of XGBoost model (AUROC = 0.94 ± 0.01, AUPRC = 0.94 ± 0.03) was greater than the performance of support vector machine (AUROC = 0.90 ± 0.03, AUPRC = 0.91 ± 0.02) and random forest (AUROC = 0.93 ± 0.03, AUPRC = 0.89 ± 0.04). This study also aimed to interpret the model and to explore the impact of clinical variables. The interpretation analysis highlighted the effects of age, high-alert drugs, and the number of drug varieties. Further stratification clarified the necessity of different treatment for severe infection for different populations.
铜绿假单胞菌是一种普遍存在的机会性细菌病原体,是医院获得性肺炎的主要病因。对于铜绿假单胞菌严重感染患者,迫切需要尽早识别危险因素。然而,尚未有基于机器学习的详细相关调查报道,且很少有研究聚焦于探索关键风险临床变量与患者临床结局之间的关系。在本研究中,我们收集了2010年1月至2021年7月在第四军医大学西京医院收治的571例铜绿假单胞菌严重感染患者的资料。记录了基本临床信息、临床症状和体征、实验室指标、细菌培养及药物相关信息。应用XGBoost机器学习算法构建了预测严重患者铜绿假单胞菌感染死亡风险的模型。XGBoost模型的性能(曲线下面积[AUC] = 0.94±0.01,精确率-召回率曲线下面积[AUPRC] = 0.94±0.03)优于支持向量机(AUC = 0.90±0.03,AUPRC = 0.91±0.02)和随机森林(AUC = 0.93±0.03,AUPRC = 0.89±0.04)。本研究还旨在解释该模型并探索临床变量的影响。解释分析突出了年龄、高警示药物和药物品种数量的影响。进一步分层明确了针对不同人群的严重感染进行不同治疗的必要性。