Suppr超能文献

使用机器学习模型预测艰难梭菌感染心力衰竭患者的28天全因死亡率:来自MIMIC-IV数据库的证据

Prediction of 28-Day All-Cause Mortality in Heart Failure Patients with Clostridioides difficile Infection Using Machine Learning Models: Evidence from the MIMIC-IV Database.

作者信息

Shi Caiping, Jie Qiong, Zhang Hongsong, Zhang Xinying, Chu Weijuan, Chen Chen, Zhang Qian, Hu Zhen

机构信息

School of Mathematics, Hohai University, Nanjing, China.

Department of Pharmacy, Nanjing First Hospital, Nanjing Medical University, Nanjing, China.

出版信息

Cardiology. 2025;150(2):133-144. doi: 10.1159/000540994. Epub 2024 Aug 17.

Abstract

INTRODUCTION

Heart failure (HF) may induce bowel hypoperfusion, leading to hypoxia of the villa of the bowel wall and the occurrence of Clostridioides difficile infection (CDI). However, the risk factors for the development of CDI in HF patients have yet to be fully illustrated, especially because of a lack of evidence from real-world data.

METHODS

Clinical data and survival situations of HF patients with CDI admitted to ICU were extracted from the Medical Information Mart for Intensive Care (MIMIC)-IV database. For developing a model that can predict 28-day all-cause mortality in HF patients with CDI, the Recursive Feature Elimination with Cross-Validation (RFE-CV) method was used for feature selection. And nine machine learning (ML) algorithms, including logistic regression (LR), decision tree, Bayesian, adaptive boosting, random forest (RF), gradient boosting decision tree, XGBoost, light gradient boosting machine, and categorical boosting, were applied for model construction. After training and hyperparameter optimization of the models through grid search 5-fold cross-validation, the performance of models was evaluated by the area under curve (AUC), accuracy, sensitivity, specificity, precision, negative predictive value, and F1 score. Furthermore, the SHapley Additive exPlanations (SHAP) method was used to interpret the optimal model.

RESULTS

A total of 526 HF patients with CDI were included in the study, of whom 99 cases (18.8%) experienced death within 28 days. Eighteen of the 57 variables were selected for the model construction algorithm for model construction. Among the ML models considered, the RF model emerged as the optimal model achieving the accuracy, F1-score, and AUC values of 0.821, 0.596, and 0.864, respectively. The net benefit of the model surpassed other models at 16%-22% threshold probabilities based on decision curve analysis. According to the importance of features in the RF model, red blood cell distribution width, blood urea nitrogen, Simplified Acute Physiology Score II, Sequential Organ Failure Assessment, and white blood cell count were highlighted as the five most influential variables.

CONCLUSIONS

We developed ML models to predict 28-day all-cause mortality in HF patients associated with CDI in the ICU, which are more effective than the conventional LR model. The RF model has the best performance among all the ML models employed. It may be useful to help clinicians identify high-risk HF patients with CDI.

摘要

引言

心力衰竭(HF)可能导致肠道灌注不足,进而引起肠壁绒毛缺氧及艰难梭菌感染(CDI)的发生。然而,HF患者发生CDI的危险因素尚未完全阐明,尤其是缺乏来自真实世界数据的证据。

方法

从重症监护医学信息数据库(MIMIC-IV)中提取入住ICU的合并CDI的HF患者的临床资料和生存情况。为建立一个能够预测合并CDI的HF患者28天全因死亡率的模型,采用带交叉验证的递归特征消除法(RFE-CV)进行特征选择。并应用包括逻辑回归(LR)、决策树、贝叶斯、自适应提升、随机森林(RF)、梯度提升决策树、XGBoost、轻量级梯度提升机和分类提升在内的9种机器学习(ML)算法进行模型构建。通过网格搜索5折交叉验证对模型进行训练和超参数优化后,采用曲线下面积(AUC)、准确率、灵敏度、特异度、精准度、阴性预测值和F1分数对模型性能进行评估。此外,使用SHapley加性解释(SHAP)方法对最优模型进行解释。

结果

本研究共纳入526例合并CDI的HF患者,其中99例(18.8%)在28天内死亡。57个变量中的18个被选入模型构建算法用于模型构建。在所考虑的ML模型中,RF模型成为最优模型,其准确率、F1分数和AUC值分别为0.821、0.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验