Zhu Hongling, Ye Qing, Wang Shurui, Cai Hongsen, Maimaiti Mairihaba, Lai Jinsheng, Qin Chuan, Zhang Ping, Chen Yanyan, Luo Qiushi, Wu Hong, Chen Danyang, Chen Shiling, Zhu Shudan, Lv Yuting, Xu Yanxiang, Zhang Jian, Hu Benshan, Yin Yuanxiang, Xie Yan, Zhu Dongmei, Ming Xiaoxing, Tang Zhouping, Zeng Hesong
Division of Cardiology, Department of Internal Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei 430030, P.R. China.
Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430030, P.R. China.
Research (Wash D C). 2025 Aug 19;8:0817. doi: 10.34133/research.0817. eCollection 2025.
Current tools for predicting the thrombolysis risk in patients after stroke exhibit limited event prediction in early post-thrombolysis hemorrhagic events. This highlights an unmet medical need to improve the tools for stroke management. We developed an explainable 2-stage machine learning model for stroke risk stratification to predict the risk of bleeding, composite complications, and all-cause death in patients before and after thrombolysis therapy. The model integrated LightGBM, XGBoost, random forest model (RF), decision tree model (DT), and logistic regression model (LR), and was trained on data from 5,333 patients from Tongji Hospital, achieving improved predictive accuracy in the post-thrombolysis stage compared to the pre-thrombolysis stage. The model exhibited increased area under the curve (AUC) of 0.7581 [95% confidence interval (CI), 0.6955 to 0.8177] and 0.7234 (0.6527 to 0.7909) (bleeding), 0.7625 (0.7324 to 0.7936) and 0.7035 (0.6685 to 0.7392) (composite complications), and 0.9264 (0.8736 to 0.9660) and 0.845 (0.7454 to 0.9375) (death) in post-thrombolysis stage than in pre-thrombolysis stage. External validation using data of 526 patients across 2 different hospitals confirmed the robustness of the model. Key predictors such as temperature, vital signs, and demographic factors were identified. A prototype embedding the best-performing model was constructed. This model enhances thrombolysis risk prediction and supports personalized patient care management, demonstrating its potential for clinical decision support system integration into stroke management strategies.
目前用于预测中风患者溶栓风险的工具在溶栓后早期出血事件中的事件预测能力有限。这凸显了改善中风管理工具这一未满足的医疗需求。我们开发了一种可解释的两阶段机器学习模型用于中风风险分层,以预测溶栓治疗前后患者的出血风险、复合并发症风险和全因死亡风险。该模型整合了LightGBM、XGBoost、随机森林模型(RF)、决策树模型(DT)和逻辑回归模型(LR),并基于来自同济医院的5333例患者的数据进行训练,与溶栓前阶段相比,在溶栓后阶段实现了更高的预测准确性。该模型在溶栓后阶段的曲线下面积(AUC)有所增加,出血方面分别为0.7581 [95%置信区间(CI),0.6955至0.8177]和0.7234(0.6527至0.7909),复合并发症方面为0.7625(0.7324至0.7936)和0.7035(0.6685至0.7392),死亡方面为0.9264(0.8736至0.9660)和0.845(0.7454至0.9375),均高于溶栓前阶段。使用来自2家不同医院的526例患者的数据进行外部验证证实了该模型的稳健性。识别出了诸如体温、生命体征和人口统计学因素等关键预测因素。构建了一个嵌入性能最佳模型的原型。该模型增强了溶栓风险预测,并支持个性化的患者护理管理,展示了其集成到中风管理策略中的临床决策支持系统的潜力。