Faculty of Anesthesiology, Changhai Hospital, Naval Medical University of PLA, Shanghai 200433, China; Department of Anesthesiology, Naval Medical Center, Naval Medical University of PLA, Shanghai 200052, China.
Faculty of Anesthesiology, Changhai Hospital, Naval Medical University of PLA, Shanghai 200433, China.
Comput Methods Programs Biomed. 2023 Nov;241:107772. doi: 10.1016/j.cmpb.2023.107772. Epub 2023 Aug 20.
Interpretable and real-time prediction of sepsis and risk factor analysis could enable timely treatment by clinicians and improve patient outcomes. To develop an interpretable machine-learning model for the prediction and risk factor analysis of sepsis and septic death.
This is a retrospective observational cohort study based on the Medical Information Mart for Intensive Care (MIMIC-IV) dataset; 69,619 patients from the database were screened. The two outcomes include patients diagnosed with sepsis and the death of septic patients. Clinical variables from ICU admission to outcomes were analyzed: demographic data, vital signs, Glasgow Coma Scale scores, laboratory test results, and results for arterial blood gasses (ABGs). Model performance was compared using the area under the receiver operating characteristic curve (AUROC). Model interpretations were based on the Shapley additive explanations (SHAP), and the clustered analysis was based on the combination of K-means and dimensionality reduction algorithms of t-SNE and PCA.
For the analysis of sepsis and septic death, 47,185 and 2480 patients were enrolled, respectively. The XGBoost model achieved a predictive value of area under the curve (AUC): 0.745 [0.731-0.759] for sepsis prediction and 0.8 [0.77, 0.828] for septic death prediction. The real-time prediction model was trained to predict by day and visualize the individual or combined risk factor effects on the outcomes based on SHAP values. Clustered analysis separated the two phenotypes with distinct risk factors among patients with septic death.
The proposed real-time, clustered prediction model for sepsis and septic death exhibited superior performance in predicting the outcomes and visualizing the risk factors in a real-time and interpretable manner to distinguish and mitigate patient risks, thus promising immense potential in effective clinical decision making and comprehensive understanding of complex diseases such as sepsis.
对脓毒症及其危险因素进行可解释和实时预测,可以使临床医生及时进行治疗,改善患者预后。本研究旨在建立一个可解释的机器学习模型,用于脓毒症和脓毒症死亡的预测及危险因素分析。
这是一项基于医疗信息监护数据库(MIMIC-IV)的回顾性观察队列研究,从数据库中筛选出 69619 例患者。本研究的两个结局分别为脓毒症患者和脓毒症死亡患者。分析了从 ICU 入院到结局的临床变量:人口统计学数据、生命体征、格拉斯哥昏迷评分、实验室检查结果和动脉血气分析(ABG)结果。使用受试者工作特征曲线下面积(AUROC)比较模型性能。基于 Shapley 加性解释(SHAP)进行模型解释,聚类分析基于 K-means 与 t-SNE 和 PCA 降维算法的组合。
本研究分别纳入 47185 例和 2480 例患者用于分析脓毒症和脓毒症死亡。XGBoost 模型对脓毒症和脓毒症死亡的预测曲线下面积(AUC)分别为 0.745[0.731-0.759]和 0.8[0.77,0.828]。实时预测模型通过训练可以按日预测,并根据 SHAP 值可视化个体或组合危险因素对结局的影响。聚类分析将死亡患者中具有不同危险因素的两种表型分离。
所提出的脓毒症和脓毒症死亡实时聚类预测模型在预测结局和实时可视化危险因素方面表现出优异的性能,能够区分和减轻患者风险,有望在有效的临床决策和对脓毒症等复杂疾病的全面理解方面具有巨大的潜力。