脑出血患者脓毒症可解释预测模型的识别与验证：多中心回顾性研究

Identification and Validation of an Explainable Prediction Model of Sepsis in Patients With Intracerebral Hemorrhage: Multicenter Retrospective Study.

作者信息

Liu Xianglin, Huang Zhihua, Guo Yizhi, Li Yandeng, Zhu Jianming, Wen Jun, Gao Yunchun, Liu Jianyi

机构信息

Changde Hospital, Xiangya School of Medicine, Central South University (The First People's Hospital of Changde City), Changde, China.

出版信息

J Med Internet Res. 2025 Apr 28;27:e71413. doi: 10.2196/71413.

DOI:10.2196/71413

PMID:40293793

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12070006/

Abstract

BACKGROUND

Sepsis is a life-threatening condition frequently observed in patients with intracerebral hemorrhage (ICH) who are critically ill. Early and accurate identification and prediction of sepsis are crucial. Machine learning (ML)-based predictive models exhibit promising sepsis prediction capabilities in emergency settings. However, their application in predicting sepsis among patients with ICH is still limited.

OBJECTIVE

The aim of the study is to develop an ML-driven risk calculator for early prediction of sepsis in patients with ICH who are critically ill and to clarify feature importance and explain the model using the Shapley Additive Explanations method.

METHODS

Patients with ICH admitted to the intensive care unit (ICU) from the Medical Information Mart for Intensive Care IV database between 2008 and 2022 were divided into training and internal test sets. The external test was performed using the eICU Collaborative Research Database, which includes over 200,000 ICU admissions across the United States between 2014 and 2015. Sepsis following ICU admission was identified using Sepsis-3.0 through clinical diagnosis combining elevation of the Sequential Organ Failure Assessment by ≥2 points with suspected infection. The Boruta algorithm was used for feature selection, confirming 29 features. Nine ML algorithms were used to construct the prediction models. Predictive performance was compared using several evaluation metrics, including the area under the receiver operating characteristic curve (AUC). The Shapley Additive Explanations technique was used to interpret the final model, and a web-based risk calculator was constructed for clinical practice.

RESULTS

Overall, 2414 patients with ICH were enrolled from the Medical Information Mart for Intensive Care IV database, with 1689 and 725 patients assigned to the training and internal test sets, respectively. An external test set of 2806 patients with ICH from the eICU database was used. Among the 9 ML models tested, the categorical boosting (CatBoost) model demonstrated the best discriminative ability. After reducing features based on their importance, an explainable final CatBoost model was developed using 8 features. The final model accurately predicted sepsis in internal (AUC=0.812) and external (AUC=0.771) tests.

CONCLUSIONS

We constructed a web-based risk calculator with 8 features based on the CatBoost model to assist clinicians in identifying people at high risk for sepsis in patients with ICH who are critically ill.

摘要

背景

脓毒症是一种危及生命的病症，在重症脑出血（ICH）患者中经常出现。早期准确识别和预测脓毒症至关重要。基于机器学习（ML）的预测模型在紧急情况下展现出了有前景的脓毒症预测能力。然而，其在预测ICH患者脓毒症方面的应用仍然有限。

目的

本研究旨在开发一种由ML驱动的风险计算器，用于早期预测重症ICH患者的脓毒症，并阐明特征重要性，使用Shapley加法解释方法解释模型。

方法

将2008年至2022年间从重症监护医学信息集市IV数据库收治入重症监护病房（ICU）的ICH患者分为训练集和内部测试集。外部测试使用eICU协作研究数据库进行，该数据库包含2014年至2015年间美国各地超过20万例ICU入院病例。通过将序贯器官衰竭评估升高≥2分并伴有疑似感染的临床诊断，使用Sepsis-3.0来识别ICU入院后的脓毒症。使用Boruta算法进行特征选择，确定了29个特征。使用9种ML算法构建预测模型。使用包括受试者操作特征曲线下面积（AUC）在内的几种评估指标比较预测性能。使用Shapley加法解释技术解释最终模型，并构建了基于网络的风险计算器用于临床实践。

结果

总体而言，从重症监护医学信息集市IV数据库纳入了2414例ICH患者，分别有1689例和725例患者被分配到训练集和内部测试集。使用了来自eICU数据库的2806例ICH患者的外部测试集。在测试的9种ML模型中，分类提升（CatBoost）模型表现出最佳的判别能力。根据特征重要性减少特征后，使用8个特征开发了一个可解释的最终CatBoost模型。最终模型在内部测试（AUC = 0.812）和外部测试（AUC = 0.771）中准确预测了脓毒症。

结论

我们基于CatBoost模型构建了一个具有8个特征的基于网络的风险计算器，以帮助临床医生识别重症ICH患者中脓毒症的高危人群。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3c2/12070006/25f2230077f2/jmir_v27i1e71413_fig1.jpg

相似文献

Identification and Validation of an Explainable Prediction Model of Sepsis in Patients With Intracerebral Hemorrhage: Multicenter Retrospective Study.

J Med Internet Res. 2025 Apr 28;27:e71413. doi: 10.2196/71413.

Explainable Machine Learning Model for Predicting Persistent Sepsis-Associated Acute Kidney Injury: Development and Validation Study.

J Med Internet Res. 2025 Apr 28;27:e62932. doi: 10.2196/62932.

Interpretable machine learning model for early prediction of 28-day mortality in ICU patients with sepsis-induced coagulopathy: development and validation.

Eur J Med Res. 2024 Jan 3;29(1):14. doi: 10.1186/s40001-023-01593-7.

Early prediction of sepsis associated encephalopathy in elderly ICU patients using machine learning models: a retrospective study based on the MIMIC-IV database.

Front Cell Infect Microbiol. 2025 Apr 17;15:1545979. doi: 10.3389/fcimb.2025.1545979. eCollection 2025.

Predicting the risk of acute kidney injury in patients with acute pancreatitis complicated by sepsis using a stacked ensemble machine learning model: a retrospective study based on the MIMIC database.

BMJ Open. 2025 Feb 26;15(2):e087427. doi: 10.1136/bmjopen-2024-087427.

Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.

Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.

Machine learning model to predict sepsis in ICU patients with intracerebral hemorrhage.

Sci Rep. 2025 May 10;15(1):16326. doi: 10.1038/s41598-025-99431-9.

Development and validation of a novel risk-predicted model for early sepsis-associated acute kidney injury in critically ill patients: a retrospective cohort study.

BMJ Open. 2025 Jan 28;15(1):e088404. doi: 10.1136/bmjopen-2024-088404.

Explainable machine learning for early prediction of sepsis in traumatic brain injury: A discovery and validation study.

PLoS One. 2024 Nov 11;19(11):e0313132. doi: 10.1371/journal.pone.0313132. eCollection 2024.

A Novel Composite Indicator of Predicting Mortality Risk for Heart Failure Patients With Diabetes Admitted to Intensive Care Unit Based on Machine Learning.

Front Endocrinol (Lausanne). 2022 Jun 29;13:917838. doi: 10.3389/fendo.2022.917838. eCollection 2022.

引用本文的文献

Machine learning models predict risk of lower extremity deep vein thrombosis in hospitalized patients with spontaneous intracerebral hemorrhage.

Sci Rep. 2025 Jul 10;15(1):24932. doi: 10.1038/s41598-025-10905-2.

本文引用的文献

Interpretable machine learning model for new-onset atrial fibrillation prediction in critically ill patients: a multi-center study.

Crit Care. 2024 Oct 29;28(1):349. doi: 10.1186/s13054-024-05138-0.

Identification and validation of an explainable prediction model of acute kidney injury with prognostic implications in critically ill children: a prospective multicenter cohort study.

EClinicalMedicine. 2024 Jan 5;68:102409. doi: 10.1016/j.eclinm.2023.102409. eCollection 2024 Feb.

Building gender-specific sexually transmitted infection risk prediction models using CatBoost algorithm and NHANES data.

BMC Med Inform Decis Mak. 2024 Jan 24;24(1):24. doi: 10.1186/s12911-024-02426-1.

Construction and validation of machine learning models for sepsis prediction in patients with acute pancreatitis.

BMC Surg. 2023 Sep 1;23(1):267. doi: 10.1186/s12893-023-02151-y.

Phonocardiogram transfer learning-based CatBoost model for diastolic dysfunction identification using multiple domain-specific deep feature fusion.

Comput Biol Med. 2023 Apr;156:106707. doi: 10.1016/j.compbiomed.2023.106707. Epub 2023 Feb 20.

MIMIC-IV, a freely accessible electronic health record dataset.

Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.

Sepsis-Exacerbated Brain Dysfunction After Intracerebral Hemorrhage.

Front Cell Neurosci. 2022 Jan 21;15:819182. doi: 10.3389/fncel.2021.819182. eCollection 2021.

Quantification of Sepsis Model Alerts in 24 US Hospitals Before and During the COVID-19 Pandemic.

JAMA Netw Open. 2021 Nov 1;4(11):e2135286. doi: 10.1001/jamanetworkopen.2021.35286.

Using CatBoost algorithm to identify middle-aged and elderly depression, national health and nutrition examination survey 2011-2018.

Psychiatry Res. 2021 Dec;306:114261. doi: 10.1016/j.psychres.2021.114261. Epub 2021 Nov 1.

A Machine Learning Model for Accurate Prediction of Sepsis in ICU Patients.

Front Public Health. 2021 Oct 15;9:754348. doi: 10.3389/fpubh.2021.754348. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

脑出血患者脓毒症可解释预测模型的识别与验证：多中心回顾性研究

Identification and Validation of an Explainable Prediction Model of Sepsis in Patients With Intracerebral Hemorrhage: Multicenter Retrospective Study.

作者信息

Liu Xianglin, Huang Zhihua, Guo Yizhi, Li Yandeng, Zhu Jianming, Wen Jun, Gao Yunchun, Liu Jianyi

机构信息

Changde Hospital, Xiangya School of Medicine, Central South University (The First People's Hospital of Changde City), Changde, China.