Xie Puguang, Yang Cheng, Yang Gangyi, Jiang Youzhao, He Min, Jiang Xiaoyan, Chen Yan, Deng Liling, Wang Min, Armstrong David G, Ma Yu, Deng Wuquan
Department of Endocrinology and Bioengineering College, Chongqing University Central Hospital, Chongqing Emergency Medical Centre, Chongqing University, NO. 1 Jiankang Road, Yuzhong District, Chongqing, 400014, China.
Department of Endocrinology, The Second Affiliated Hospital, Chongqing Medical University, Chongqing, 400010, China.
Diabetol Metab Syndr. 2023 Mar 11;15(1):44. doi: 10.1186/s13098-023-01020-1.
Experiencing a hyperglycaemic crisis is associated with a short- and long-term increased risk of mortality. We aimed to develop an explainable machine learning model for predicting 3-year mortality and providing individualized risk factor assessment of patients with hyperglycaemic crisis after admission.
Based on five representative machine learning algorithms, we trained prediction models on data from patients with hyperglycaemic crisis admitted to two tertiary hospitals between 2016 and 2020. The models were internally validated by tenfold cross-validation and externally validated using previously unseen data from two other tertiary hospitals. A SHapley Additive exPlanations algorithm was used to interpret the predictions of the best performing model, and the relative importance of the features in the model was compared with the traditional statistical test results.
A total of 337 patients with hyperglycaemic crisis were enrolled in the study, 3-year mortality was 13.6% (46 patients). 257 patients were used to train the models, and 80 patients were used for model validation. The Light Gradient Boosting Machine model performed best across testing cohorts (area under the ROC curve 0.89 [95% CI 0.77-0.97]). Advanced age, higher blood glucose and blood urea nitrogen were the three most important predictors for increased mortality.
The developed explainable model can provide estimates of the mortality and visual contribution of the features to the prediction for an individual patient with hyperglycaemic crisis. Advanced age, metabolic disorders, and impaired renal and cardiac function were important factors that predicted non-survival.
ChiCTR1800015981, 2018/05/04.
经历高血糖危象与短期和长期死亡风险增加相关。我们旨在开发一种可解释的机器学习模型,用于预测3年死亡率,并对入院后发生高血糖危象的患者进行个体化风险因素评估。
基于五种代表性机器学习算法,我们使用2016年至2020年期间两所三级医院收治的高血糖危象患者的数据训练预测模型。模型通过十折交叉验证进行内部验证,并使用另外两所三级医院的未见数据进行外部验证。使用SHapley加法解释算法来解释表现最佳模型的预测结果,并将模型中特征的相对重要性与传统统计测试结果进行比较。
本研究共纳入337例高血糖危象患者,3年死亡率为13.6%(46例患者)。257例患者用于训练模型,80例患者用于模型验证。在测试队列中,轻梯度提升机模型表现最佳(ROC曲线下面积为0.89 [95% CI 0.77 - 0.97])。高龄、较高的血糖和血尿素氮是死亡率增加的三个最重要预测因素。
所开发的可解释模型可以为高血糖危象个体患者提供死亡率估计以及特征对预测的直观贡献。高龄、代谢紊乱以及肾和心功能受损是预测死亡的重要因素。
ChiCTR1800015981,2018年5月4日。