Luo X, Cheng Y, Wu C, He J
Department of Military Health Statistics, Naval Medical University, Shanghai 200433, China.
Nan Fang Yi Ke Da Xue Xue Bao. 2023 Jul 20;43(7):1241-1247. doi: 10.12122/j.issn.1673-4254.2023.07.21.
To construct an inherent interpretability machine learning model as an explainable boosting machine model (EBM) for predicting one-year risk of death in patients with severe ischemic stroke.
We randomly divided the data of 2369 eligible patients with severe ischemic stroke in the MIMIC-Ⅳ(2.0) database, who were admitted in ICU in 2008 to 2019, into a training dataset (80%) and a test dataset (20%), and assessed the prognosis of the patients using the EBM model. The prediction performance of the model was evaluated by calculating the area under the receiver operating characteristic (AUC) curve. The calibration curve and Brier score were used to evaluate the degree of calibration of the model, and a decision curve was generated to assess the net clinical benefit.
The EBM model constructed in this study had good discrimination power, calibration and net benefit, with an AUC of 0.857 (95% : 0.831-0.887) for predicting prognosis of severe ischemic stroke. Calibration curve analysis showed that the standard curve of the EBM model was the closest to the ideal curve. Decision curve analysis showed that the model had the greatest net benefit rate at the prediction probability threshold of 0.10 to 0.80. The top 5 independent predictive variables based on the EBM model were age, SOFA score, mean heart rate, mechanical ventilation, and mean respiratory rate, whose significance scores ranged from 0.179 to 0.370.
This EBM model has a good performance for predicting the risk of death within one year in patients with severe ischemic stroke and allows clinicians to better understand the contributing factors of the patients' outcomes through the model interpretability.
构建一种具有内在可解释性的机器学习模型,即可解释增强机器模型(EBM),用于预测重症缺血性脑卒中患者的一年死亡风险。
我们将2008年至2019年入住重症监护病房的MIMIC-Ⅳ(2.0)数据库中2369例符合条件的重症缺血性脑卒中患者的数据随机分为训练数据集(80%)和测试数据集(20%),并使用EBM模型评估患者的预后。通过计算受试者操作特征(AUC)曲线下面积来评估模型的预测性能。校准曲线和Brier评分用于评估模型的校准程度,并生成决策曲线以评估净临床效益。
本研究构建的EBM模型具有良好的区分能力、校准和净效益,预测重症缺血性脑卒中预后的AUC为0.857(95%:0.831-0.887)。校准曲线分析表明,EBM模型的标准曲线最接近理想曲线。决策曲线分析表明,该模型在预测概率阈值为0.10至0.80时净效益率最大。基于EBM模型的前5个独立预测变量为年龄、序贯器官衰竭评估(SOFA)评分、平均心率、机械通气和平均呼吸频率,其显著性评分范围为0.179至0.370。
该EBM模型在预测重症缺血性脑卒中患者一年内死亡风险方面具有良好性能,并通过模型可解释性使临床医生更好地了解患者预后的影响因素。