Suppr超能文献

用于预测重症监护病房心力衰竭死亡率的可解释机器学习和在线计算器。

Explainable machine learning and online calculators to predict heart failure mortality in intensive care units.

作者信息

Chen An-Tian, Zhang Yuhui, Zhang Jian

机构信息

Department of Cardiology, Fuwai Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College/National Center for Cardiovascular Diseases, Beijing, China.

Heart Failure Center, Fuwai Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College/National Center for Cardiovascular Diseases, Beijing, China.

出版信息

ESC Heart Fail. 2025 Feb;12(1):353-368. doi: 10.1002/ehf2.15062. Epub 2024 Sep 19.

Abstract

AIMS

This study aims to develop explainable machine learning models and clinical tools for predicting mortality in patients in the intensive care unit (ICU) with heart failure (HF).

METHODS

Patients diagnosed with HF who experienced their first ICU stay lasting between 24 h and 28 days were selected from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database. The primary outcome was all-cause mortality within 28 days. Data analysis was performed using Python and R, with feature selection conducted via least absolute shrinkage and selection operator (LASSO) regression. Fifteen models were evaluated, and the most effective model was rendered explainable through the Shapley additive explanations (SHAP) approach. A nomogram was developed based on logistic regression to facilitate interpretation. For external validation, the eICU database was utilized.

RESULTS

After selection, the study included 2343 records, with 1808 surviving and 535 deceased patients. The median age of the study population was 70.00, with ~3/5 males (60.31%). The median length of stay in the ICU was 6.00 days. The median age of the survival group was younger than the non-survival group (69.00 vs. 73.00), and non-survival patients spent longer time in the ICU. Seventy-five features were initially selected, including basic information, vital signs, laboratory tests, haemodynamics and oxygen status. LASSO regression determined the shrinkage parameter α = 0.020, and 44 features were chosen for model construction. The linear discriminant analysis (LDA) model showed the best performance, and the accuracy reached 0.8354 in the training cohort and 0.8563 in the testing cohort. It showed satisfying area under the curve (AUC), recall, precision, F1 score, Cohen's kappa score and Matthew's correlation coefficient. The concordance index (c-index) reached 0.7972 in the training cohort and 0.8125 in the testing cohort. In external validation, the LDA model achieved approximately 0.9 in accuracy, precision, recall and F1 score, with an AUC of 0.79. Univariable analysis was performed in the training cohort. Features that differed significantly between the survival and non-survival groups were subjected to multiple logistic regression. The nomogram built on multiple logistic regression included 14 features and demonstrated excellent performance. The AUC of the nomogram is 0.852 in the training cohort, 0.855 in the internal validation cohort and 0.770 in the external validation cohort. The calibration curve showed good consistency.

CONCLUSIONS

The study developed an LDA and a nomogram model for predicting mortality in HF patients in the ICU. The SHAP approach was employed to elucidate the LDA model, enhancing its utility for clinicians. These models were made accessible online for clinical application.

摘要

目的

本研究旨在开发可解释的机器学习模型和临床工具,用于预测重症监护病房(ICU)中心力衰竭(HF)患者的死亡率。

方法

从重症监护医学信息集市IV(MIMIC-IV)数据库中选取首次入住ICU持续24小时至28天且被诊断为HF的患者。主要结局是28天内的全因死亡率。使用Python和R进行数据分析,通过最小绝对收缩和选择算子(LASSO)回归进行特征选择。评估了15个模型,并通过夏普利加法解释(SHAP)方法使最有效的模型具有可解释性。基于逻辑回归开发了列线图以方便解读。为进行外部验证,使用了eICU数据库。

结果

筛选后,研究纳入2343条记录,其中1808例存活,535例死亡患者。研究人群的中位年龄为70.00岁,约五分之三为男性(60.31%)。在ICU的中位住院时间为6.00天。存活组的中位年龄低于非存活组(69.00对73.00),非存活患者在ICU的停留时间更长。最初选择了75个特征,包括基本信息、生命体征、实验室检查、血流动力学和氧状态。LASSO回归确定收缩参数α = 0.020,并选择44个特征用于模型构建。线性判别分析(LDA)模型表现最佳,在训练队列中的准确率达到0.8354,在测试队列中达到0.8563。其曲线下面积(AUC)、召回率、精确率、F1分数、科恩kappa分数和马修斯相关系数均令人满意。一致性指数(c指数)在训练队列中达到0.7972,在测试队列中达到0.8125。在外部验证中,LDA模型在准确率、精确率、召回率和F1分数方面达到约0.9,AUC为0.79。在训练队列中进行单变量分析。对存活组和非存活组之间有显著差异的特征进行多因素逻辑回归。基于多因素逻辑回归构建的列线图包括14个特征,表现出色。列线图在训练队列中的AUC为0.852,在内部验证队列中为0.855,在外部验证队列中为0.770。校准曲线显示出良好的一致性。

结论

本研究开发了用于预测ICU中HF患者死亡率的LDA模型和列线图。采用SHAP方法阐明LDA模型,提高了其对临床医生的实用性。这些模型可在线获取以供临床应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e56/11769656/f423f33d3912/EHF2-12-353-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验