School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, Shenzhen, Guangdong, 518055, China.
Department of Biomedical Engineering, Hong Kong Polytechnic University, Hong Kong SAR, China.
BMC Med Inform Decis Mak. 2023 Sep 15;23(1):185. doi: 10.1186/s12911-023-02279-0.
This study aimed to construct a mortality model for the risk stratification of intensive care unit (ICU) patients with sepsis by applying a machine learning algorithm.
Adult patients who were diagnosed with sepsis during admission to ICU were extracted from MIMIC-III, MIMIC-IV, eICU, and Zigong databases. MIMIC-III was used for model development and internal validation. The other three databases were used for external validation. Our proposed model was developed based on the Extreme Gradient Boosting (XGBoost) algorithm. The generalizability, discrimination, and validation of our model were evaluated. The Shapley Additive Explanation values were used to interpret our model and analyze the contribution of individual features.
A total of 16,741, 15,532, 22,617, and 1,198 sepsis patients were extracted from the MIMIC-III, MIMIC-IV, eICU, and Zigong databases, respectively. The proposed model had an area under the receiver operating characteristic curve (AUROC) of 0.84 in the internal validation, which outperformed all the traditional scoring systems. In the external validations, the AUROC was 0.87 in the MIMIC-IV database, better than all the traditional scoring systems; the AUROC was 0.83 in the eICU database, higher than the Simplified Acute Physiology Score III and Sequential Organ Failure Assessment (SOFA),equal to 0.83 of the Acute Physiology and Chronic Health Evaluation IV (APACHE-IV), and the AUROC was 0.68 in the Zigong database, higher than those from the systemic inflammatory response syndrome and SOFA. Furthermore, the proposed model showed the best discriminatory and calibrated capabilities and had the best net benefit in each validation.
The proposed algorithm based on XGBoost and SHAP-value feature selection had high performance in predicting the mortality of sepsis patients within 24 h of ICU admission.
本研究旨在应用机器学习算法为 ICU 脓毒症患者构建风险分层的死亡率模型。
从 MIMIC-III、MIMIC-IV、eICU 和 Zigong 数据库中提取入住 ICU 期间被诊断为脓毒症的成年患者。MIMIC-III 用于模型开发和内部验证。其余三个数据库用于外部验证。我们提出的模型是基于极端梯度提升(XGBoost)算法开发的。评估了我们模型的泛化能力、区分能力和验证能力。使用 Shapley 加法解释值来解释我们的模型并分析单个特征的贡献。
从 MIMIC-III、MIMIC-IV、eICU 和 Zigong 数据库中分别提取了 16741、15532、22617 和 1198 例脓毒症患者。该模型在内部验证中的受试者工作特征曲线下面积(AUROC)为 0.84,优于所有传统评分系统。在外部验证中,MIMIC-IV 数据库中的 AUROC 为 0.87,优于所有传统评分系统;eICU 数据库中的 AUROC 为 0.83,高于简化急性生理学评分 III 和序贯器官衰竭评估(SOFA),与急性生理学和慢性健康评估 IV(APACHE-IV)的 0.83 持平,Zigong 数据库中的 AUROC 为 0.68,高于全身炎症反应综合征和 SOFA。此外,该模型在每种验证中均表现出最佳的区分能力和校准能力,且净收益最佳。
基于 XGBoost 和 SHAP 值特征选择的算法在预测 ICU 入住 24 小时内脓毒症患者死亡率方面具有较高的性能。