Shen Xiaoli, Shang Dongfeng, Sun Weize, Ru Shuyan
Department of Emergency Medicine, Huishan 3rd People's Hospital of Wuxi City, Wuxi, China.
Department of Clinical Laboratory, Huishan 3rd People's Hospital of Wuxi City, Wuxi, China.
PLoS One. 2025 Apr 9;20(4):e0319519. doi: 10.1371/journal.pone.0319519. eCollection 2025.
This study aimed to develop models for predicting the 30-day mortality of sepsis-associated delirium (SAD) by multiple machine learning (ML) algorithms. On the whole, a cohort of 3,197 SAD patients were collected from the Medical Information Mart for Intensive Care (MIMIC)-IV database. Among them, a total of 659 (20.61%) patients died following SAD. The patients who died were about 73.00 (62.00, 82.00) years old and mostly male (56.75%). Recursive feature elimination (RFE) was used to distinguish risk factors. Subsequently, six ML algorithms including artificial neural network (NNET), gradient boosting machine (GBM), adaptive boosting (Ada), random forest (RF), eXtreme Gradient Boosting (XGB) and logistic regression (LR) were employed to establish models to predict the 30-day mortality of SAD. The performance of models was assessed via both discrimination and calibration by cross-validation with 100 resamples. Overall, 10 independent predictors, including Glasgow Coma Scale (GCS), Sequential Organ Failure Assessment (SOFA), anion gap (AG), continuous renal replacement therapy (CRRT), temperature, mean corpuscular hemoglobin concentration (MCHC), vasopressor, blood urea nitrogen (BUN), base excess (BE), and bicarbonate were identified as independent predictors for the 30-day mortality of SAD. The validation cohort demonstrated that all these six models had relatively favorable differentiation, while among them, the GBM model had the highest area under the curve (AUC) of 0.845 (95% Confidence Interval (CI): 0.816, 0.874). Furthermore, the calibration curve of these six models was close to the diagonal line in the validation sets. As for decision curve analysis, the predictive models were clinically useful as well. Based on real-world research, we developed ML models to provide personalized predictions of delirium-related mortality in sepsis patients, potentially enabling clinicians to identify high-risk SAD patients more promptly.
本研究旨在通过多种机器学习(ML)算法开发预测脓毒症相关性谵妄(SAD)30天死亡率的模型。总体而言,从重症监护医学信息数据库(MIMIC-IV)中收集了3197例SAD患者。其中,共有659例(20.61%)患者在发生SAD后死亡。死亡患者年龄约为73.00(62.00,82.00)岁,且大多为男性(56.75%)。采用递归特征消除(RFE)来区分危险因素。随后,使用包括人工神经网络(NNET)、梯度提升机(GBM)、自适应提升(Ada)、随机森林(RF)、极端梯度提升(XGB)和逻辑回归(LR)在内的六种ML算法建立模型,以预测SAD的30天死亡率。通过100次重采样的交叉验证,从区分度和校准度两方面评估模型性能。总体而言,10个独立预测因子,包括格拉斯哥昏迷量表(GCS)、序贯器官衰竭评估(SOFA)、阴离子间隙(AG)、连续性肾脏替代治疗(CRRT)、体温、平均红细胞血红蛋白浓度(MCHC)、血管活性药物、血尿素氮(BUN)、碱剩余(BE)和碳酸氢盐,被确定为SAD 30天死亡率的独立预测因子。验证队列表明,所有这六个模型都具有相对良好的区分度,其中GBM模型的曲线下面积(AUC)最高,为0.845(95%置信区间(CI):0.816,0.874)。此外,这六个模型的校准曲线在验证集中接近对角线。至于决策曲线分析,这些预测模型在临床上也很有用。基于真实世界研究,我们开发了ML模型,以提供脓毒症患者谵妄相关死亡率的个性化预测,有可能使临床医生更迅速地识别高危SAD患者。