用于预测重症心力衰竭患者一年死亡风险的时间依赖性机器学习模型的开发、验证与应用

Development, Validation, and Deployment of a Time-Dependent Machine Learning Model for Predicting One-Year Mortality Risk in Critically Ill Patients with Heart Failure.

作者信息

Wang Jiuyi, Kang Qingxia, Tian Shiqi, Zhang Shunli, Wang Kai, Feng Guibo

机构信息

Department of General Medicine, The Affiliated Yongchuan Hospital of Chongqing Medical University, Chongqing 402160, China.

Department of Cardiology, The Affiliated Yongchuan Hospital of Chongqing Medical University, Chongqing 402160, China.

出版信息

Bioengineering (Basel). 2025 May 12;12(5):511. doi: 10.3390/bioengineering12050511.

DOI:10.3390/bioengineering12050511

PMID:40428130

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12108603/

Abstract

Heart failure (HF) ranks among the foremost causes of mortality globally, exhibiting particularly high prevalence and significant impact within intensive care units (ICUs). This study sought to develop, validate, and deploy a time-dependent machine learning model aimed at predicting the one-year all-cause mortality risk in ICU patients diagnosed with HF, thereby facilitating precise prognostic evaluation and risk stratification. This study encompassed a cohort of 8960 ICU patients with HF sourced from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database (version 3.1). This latest version of the database added data from 2020 to 2022 on the basis of version 2.2 (covering data from 2008 to 2019); therefore, data spanning 2008 to 2019 ( = 5748) were designated for the training set, while data from 2020 to 2022 ( = 3212) were reserved for the test set. The primary endpoint of interest was one-year all-cause mortality. Least Absolute Shrinkage and Selection Operator (LASSO) regression was employed to select predictive features from an initial pool of 64 candidate variables (including demographic characteristics, vital signs, comorbidities and complications, therapeutic interventions, routine laboratory data, and disease severity scores). Four predictive models were developed and compared: Cox proportional hazards, random survival forest (RSF), Cox proportional hazards deep neural network (DeepSurv), and eXtreme Gradient Boosting (XGBoost). Model performance was assessed using the concordance index (C-index) and Brier score, with model interpretability addressed through SHapley Additive exPlanations (SHAP) and time-dependent Survival SHapley Additive exPlanations (SurvSHAP(t)). This study revealed a one-year mortality rate of 46.1% within the population under investigation. In the training set, LASSO effectively identified 24 features in the model. In the test set, the XGBoost model exhibited superior predictive performance, as evidenced by a C-index of 0.772 and a Brier score of 0.161, outperforming the Cox model (C-index: 0.740, Brier score: 0.175), the RSF model (C-index: 0.747, Brier score: 0.178), and the DeepSur model (C-index: 0.723, Brier score: 0.183). Decision curve analysis validated the clinical utility of the XGBoost model across a broad spectrum of risk thresholds. Feature importance analysis identified the red cell distribution width-to-albumin ratio (RAR), Charlson Comorbidity Index, Simplified Acute Physiology Score II (SAPS II), Acute Physiology Score III (APS III), and the age-bilirubin-INR-creatinine (ABIC) score as the top five predictive factors. Consequently, an online risk prediction tool based on this model has been developed and is publicly accessible. The time-dependent XGBoost model demonstrated robust predictive capability in evaluating the one-year all-cause mortality risk in critically ill HF patients. This model offered a useful tool for early risk identification and supported timely interventions.

摘要

心力衰竭（HF）是全球首要的死亡原因之一，在重症监护病房（ICU）中患病率尤其高且影响重大。本研究旨在开发、验证并应用一种时间依赖性机器学习模型，以预测诊断为HF的ICU患者的一年全因死亡风险，从而促进精确的预后评估和风险分层。本研究纳入了来自重症监护医学信息集市IV（MIMIC-IV）数据库（版本3.1）的8960例患有HF的ICU患者队列。该数据库的最新版本在版本2.2（涵盖2008年至2019年的数据）的基础上增加了2020年至2022年的数据；因此，将2008年至2019年的数据（ = 5748）指定为训练集，而将2020年至2022年的数据（ = 3212）留作测试集。感兴趣的主要终点是一年全因死亡率。采用最小绝对收缩和选择算子（LASSO）回归从64个候选变量的初始集合（包括人口统计学特征、生命体征、合并症和并发症、治疗干预、常规实验室数据以及疾病严重程度评分）中选择预测特征。开发并比较了四种预测模型：Cox比例风险模型、随机生存森林（RSF）模型、Cox比例风险深度神经网络（DeepSurv）模型和极端梯度提升（XGBoost）模型。使用一致性指数（C指数）和Brier评分评估模型性能，并通过SHapley加性解释（SHAP）和时间依赖性生存SHapley加性解释（SurvSHAP(t)）解决模型可解释性问题。本研究显示，在被调查人群中一年死亡率为46.1%。在训练集中，LASSO有效地在模型中识别出24个特征。在测试集中，XGBoost模型表现出卓越的预测性能，C指数为0.772，Brier评分为0.161，优于Cox模型（C指数：0.740，Brier评分：0.175）、RSF模型（C指数：0.747，Brier评分：0.178）和DeepSurv模型（C指数：0.723，Brier评分：0.183）。决策曲线分析验证了XGBoost模型在广泛风险阈值范围内的临床实用性。特征重要性分析确定红细胞分布宽度与白蛋白比值（RAR）、Charlson合并症指数、简化急性生理学评分II（SAPS II）、急性生理学评分III（APS III）以及年龄 - 胆红素 - INR - 肌酐（ABIC）评分是前五个预测因素。因此，基于该模型的在线风险预测工具已开发出来并可公开获取。时间依赖性XGBoost模型在评估重症HF患者的一年全因死亡风险方面表现出强大的预测能力。该模型为早期风险识别提供了有用工具，并支持及时干预。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/174c/12108603/712cce8d06fc/bioengineering-12-00511-g001.jpg

相似文献

Development, Validation, and Deployment of a Time-Dependent Machine Learning Model for Predicting One-Year Mortality Risk in Critically Ill Patients with Heart Failure.用于预测重症心力衰竭患者一年死亡风险的时间依赖性机器学习模型的开发、验证与应用

Bioengineering (Basel). 2025 May 12;12(5):511. doi: 10.3390/bioengineering12050511.

Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.用于预测心力衰竭合并高血压重症患者28天全因院内死亡率的可解释机器学习：一项基于重症监护医学信息集市数据库-IV和电子重症监护病房数据库的回顾性队列研究

Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.

Machine learning-based in-hospital mortality risk prediction tool for intensive care unit patients with heart failure.基于机器学习的心力衰竭重症监护病房患者院内死亡风险预测工具。

Front Cardiovasc Med. 2023 Apr 3;10:1119699. doi: 10.3389/fcvm.2023.1119699. eCollection 2023.

[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.

Machine Learning for the Prediction of Acute Kidney Injury in Critically Ill Patients With Coronary Heart Disease: Algorithm Development and Validation.用于预测冠心病重症患者急性肾损伤的机器学习：算法开发与验证

JMIR Med Inform. 2025 May 28;13:e72349. doi: 10.2196/72349.

Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型：基于多中心队列研究的开发与验证研究

J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.

Explainable machine learning and online calculators to predict heart failure mortality in intensive care units.用于预测重症监护病房心力衰竭死亡率的可解释机器学习和在线计算器。

ESC Heart Fail. 2025 Feb;12(1):353-368. doi: 10.1002/ehf2.15062. Epub 2024 Sep 19.

Construction of a random survival forest model based on a machine learning algorithm to predict early recurrence after hepatectomy for adult hepatocellular carcinoma.基于机器学习算法构建随机生存森林模型以预测成人肝细胞癌肝切除术后的早期复发。

BMC Cancer. 2024 Dec 25;24(1):1575. doi: 10.1186/s12885-024-13366-4.

An interpretable machine learning model for predicting 28-day mortality in patients with sepsis-associated liver injury.用于预测脓毒症相关性肝损伤患者 28 天死亡率的可解释机器学习模型。

PLoS One. 2024 May 20;19(5):e0303469. doi: 10.1371/journal.pone.0303469. eCollection 2024.

Prediction of 28-Day All-Cause Mortality in Heart Failure Patients with Clostridioides difficile Infection Using Machine Learning Models: Evidence from the MIMIC-IV Database.使用机器学习模型预测艰难梭菌感染心力衰竭患者的28天全因死亡率：来自MIMIC-IV数据库的证据

Cardiology. 2025;150(2):133-144. doi: 10.1159/000540994. Epub 2024 Aug 17.

本文引用的文献

Association between red cell distribution width-albumin ratio and all-cause mortality in intensive care unit patients with heart failure.红细胞分布宽度与白蛋白比值和重症监护病房心力衰竭患者全因死亡率之间的关联。

Front Cardiovasc Med. 2025 Jan 20;12:1410339. doi: 10.3389/fcvm.2025.1410339. eCollection 2025.

Association Between the Albumin-Bilirubin (ALBI) Score and All-cause Mortality Risk in Intensive Care Unit Patients with Heart Failure.白蛋白-胆红素（ALBI）评分与重症监护病房心力衰竭患者全因死亡风险之间的关联

Glob Heart. 2024 Dec 19;19(1):97. doi: 10.5334/gh.1379. eCollection 2024.

Independent prognostic importance of the albumin-corrected anion gap in critically ill patients with congestive heart failure: a retrospective study from MIMIC-IV database.白蛋白校正阴离子间隙在充血性心力衰竭危重症患者中的独立预后重要性：一项来自MIMIC-IV数据库的回顾性研究

BMC Cardiovasc Disord. 2024 Dec 20;24(1):735. doi: 10.1186/s12872-024-04422-9.

Association between serum albumin creatinine ratio and all-cause mortality in intensive care unit patients with heart failure.重症监护病房心力衰竭患者血清白蛋白肌酐比值与全因死亡率之间的关联。

Front Cardiovasc Med. 2024 Jul 4;11:1406294. doi: 10.3389/fcvm.2024.1406294. eCollection 2024.

Global trends in heart failure from 1990 to 2019: An age-period-cohort analysis from the Global Burden of Disease study.2019 年全球心力衰竭趋势：来自全球疾病负担研究的年龄-时期-队列分析。

ESC Heart Fail. 2024 Oct;11(5):3264-3278. doi: 10.1002/ehf2.14915. Epub 2024 Jun 27.

Ratio of Red Blood Cell Distribution Width to Albumin Level and Risk of Mortality.红细胞分布宽度与白蛋白比值与死亡率风险。

JAMA Netw Open. 2024 May 1;7(5):e2413213. doi: 10.1001/jamanetworkopen.2024.13213.

Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.2022 年全球癌症统计数据：全球 185 个国家和地区 36 种癌症的发病率和死亡率全球估计数。

CA Cancer J Clin. 2024 May-Jun;74(3):229-263. doi: 10.3322/caac.21834. Epub 2024 Apr 4.

Machine Learning for Mortality Prediction in Patients With Heart Failure With Mildly Reduced Ejection Fraction.机器学习在射血分数轻度降低的心力衰竭患者死亡率预测中的应用。

J Am Heart Assoc. 2023 Jun 20;12(12):e029124. doi: 10.1161/JAHA.122.029124. Epub 2023 Jun 10.

Comparison of linear and non-linear machine learning models for time-dependent readmission or mortality prediction among hospitalized heart failure patients.住院心力衰竭患者中用于时间依赖性再入院或死亡率预测的线性和非线性机器学习模型比较。

Heliyon. 2023 May 6;9(5):e16068. doi: 10.1016/j.heliyon.2023.e16068. eCollection 2023 May.

Front Cardiovasc Med. 2023 Apr 3;10:1119699. doi: 10.3389/fcvm.2023.1119699. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于预测重症心力衰竭患者一年死亡风险的时间依赖性机器学习模型的开发、验证与应用

Development, Validation, and Deployment of a Time-Dependent Machine Learning Model for Predicting One-Year Mortality Risk in Critically Ill Patients with Heart Failure.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献