Ma Jiangping, Xie Yuanjie
Department of Neurology, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Department of Neurology, Shanghai Tenth People's Hospital, Tongji University School of Medicine, Shanghai, China.
J Neuroeng Rehabil. 2025 Feb 1;22(1):19. doi: 10.1186/s12984-025-01548-5.
This study aimed to develop and validate a machine learning-based predictive model for gait recovery in patients with acute anterior circulation ischemic stroke.
Between May and November 2023, 237 patients with acute anterior circulation ischemic stroke were enrolled. Patients were randomly divided into training and validation sets at a 7:3 ratio. Thirty-one medical characteristics were collected, and the Least Absolute Shrinkage and Selection Operator (LASSO) regression was applied to screen predictor variables. Predictive models were developed using the Random Survival Forest (RSF) and COX regression methods. The optimal model was identified based on C-index values. The SHapley Additive exPlanations (SHAP) method was employed to interpret the RSF model globally and locally.
Ten predictors were identified through LASSO regression, including age, gender, periventricular white matter hyperintensities (PVWMH), Montreal Cognitive Assessment (MoCA), National Institutes of Health Stroke Scale (NIHSS), enlarged perivascular spaces in basal ganglia (BG-EPVS), lacunes, parietal infarction, basal ganglia infarction, and Timed Up & Go (TUG) test score. The C-index values of the COX regression and RSF models were 0.741 and 0.761 in the training set and 0.705 and 0.725 in the validation set, respectively. SHAP analysis of the RSF model identified BG-EPVS, TUG, MoCA, age, and PVWMH as the top five most influential predictors of gait recovery.
The RSF model demonstrated superior performance to the COX regression model in predicting gait recovery, offering a reliable tool for clinical decision-making regarding stroke patients' prognoses.
本研究旨在开发并验证一种基于机器学习的急性前循环缺血性脑卒中患者步态恢复预测模型。
2023年5月至11月,纳入237例急性前循环缺血性脑卒中患者。患者按7:3的比例随机分为训练集和验证集。收集31项医学特征,并应用最小绝对收缩和选择算子(LASSO)回归筛选预测变量。使用随机生存森林(RSF)和COX回归方法开发预测模型。根据C指数值确定最佳模型。采用SHapley加性解释(SHAP)方法对RSF模型进行全局和局部解释。
通过LASSO回归确定了10个预测因素,包括年龄、性别、脑室周围白质高信号(PVWMH)、蒙特利尔认知评估(MoCA)、美国国立卫生研究院卒中量表(NIHSS)、基底节区血管周围间隙增宽(BG-EPVS)、腔隙、顶叶梗死、基底节梗死和计时起立行走(TUG)测试得分。COX回归模型和RSF模型在训练集的C指数值分别为0.741和0.761,在验证集的C指数值分别为0.705和0.725。RSF模型的SHAP分析确定BG-EPVS、TUG、MoCA、年龄和PVWMH是步态恢复的前五个最有影响力的预测因素。
在预测步态恢复方面,RSF模型表现优于COX回归模型,为中风患者预后的临床决策提供了可靠工具。