Kitagawa Kyota, Maki Satoshi, Furuya Takeo, Shiratani Yuki, Nagashima Yuki, Maruyama Juntaro, Toki Yasunori, Iwata Shuhei, Inoue Masahiro, Shiga Yasuhiro, Inage Kazuhide, Orita Sumihisa, Ohtori Seiji
Department of Orthopaedic Surgery, Graduate School of Medicine, Chiba University, Chiba, Japan.
Department of Orthopaedic Surgery, Graduate School of Medicine, Chiba University, Chiba, Japan; Center for Frontier Medical Engineering, Chiba University, Chiba, Japan.
Spine J. 2025 Jan 31. doi: 10.1016/j.spinee.2025.01.005.
Spinal cord injury (SCI) is a devastating condition with profound physical, psychological, and socioeconomic consequences. Despite advances in SCI treatment, accurately predicting functional recovery remains a significant challenge. Conventional prognostic methods often fall short in capturing the complex interplay of factors influencing SCI outcomes. There is an urgent demand for more precise and comprehensive prognostic tools that can guide clinical decision-making and improve patient care in SCI.
This study aims to develop and validate a machine learning (ML) model for predicting American Spinal Injury Association (ASIA) Impairment Scale (AIS) at discharge in SCI patients. We also aim to convert this model into an open-access web application.
STUDY DESIGN/SETTING: This was a retrospective cohort study enrolling traumatic SCI patients from 1991 to 2015, analyzed in 2023. Data were obtained from the Japan Rehabilitation Database (JARD), a comprehensive nationwide database that includes SCI patients from specialized SCI centers and rehabilitation hospitals across Japan.
4,108 SCI cases from JARD were reviewed, excluding 405 cases, patients caused by nontraumatic injuries, patients who were graded as AIS E at admission, and patients without data of AIS at discharge, resulting in 3,703 cases being included in the study. Patient demographics and specific SCI injury characteristics at admission were utilized for model training and prediction.
Model performance was evaluated based on R, accuracy, and the weighted Kappa coefficient. Shapley additive explanations (SHAP) values highlighted significant features influencing the model's output.
The primary outcome was AIS at discharge, treated as a continuous variable (0-4) to capture the ordinal nature and clinical significance of potential misclassifications. Data preprocessing included multicollinearity removal, feature selection using the Boruta algorithm, and iterative imputation for missing data. The dataset was split using the hold-out method with a 7:3 ratio resulting in 2,592 cases for training and 1,111 cases for testing the regression model. A best performing model was defined as the highest R using PyCaret's automated model comparison. Final predictions of regression model were discretized to the original AIS categories for clinical interpretation.
The Gradient Boosting Regressor (GBR) was identified as the optimal model. The GBR model showed an R² of 0.869, accuracy of 0.814, and weighted Kappa of 0.940. Eleven key variables, including AIS at admission, the day from injury to admission, and the motor score of L3, were identified as significant based on SHAP values. This model was then adapted into a web application via Streamlit.
We developed a high-accuracy ML model for predicting the AIS at discharge, which effectively captures the ordinal nature of the AIS scale, using 11 key variables. This model demonstrated its performance to provide reliable prognostic information. The model has been integrated into a user-friendly, open-access web application (http://3.138.174.54:8502/). This tool has the potential to aid in resource allocation and enhance treatment for each patient.
脊髓损伤(SCI)是一种具有严重身体、心理和社会经济后果的灾难性疾病。尽管SCI治疗取得了进展,但准确预测功能恢复仍然是一项重大挑战。传统的预后方法在捕捉影响SCI结果的因素的复杂相互作用方面往往存在不足。迫切需要更精确和全面的预后工具,以指导临床决策并改善SCI患者的护理。
本研究旨在开发并验证一种用于预测SCI患者出院时美国脊髓损伤协会(ASIA)损伤量表(AIS)的机器学习(ML)模型。我们还旨在将该模型转换为一个开放获取的网络应用程序。
研究设计/设置:这是一项回顾性队列研究,纳入了1991年至2015年的创伤性SCI患者,并于2023年进行分析。数据来自日本康复数据库(JARD),这是一个全国性的综合数据库,包括来自日本各地专门的SCI中心和康复医院的SCI患者。
对JARD中的4108例SCI病例进行了审查,排除了405例由非创伤性损伤引起的患者、入院时评定为AIS E级的患者以及出院时没有AIS数据的患者,最终有3703例病例纳入研究。患者的人口统计学特征和入院时特定的SCI损伤特征用于模型训练和预测。
基于R、准确率和加权Kappa系数评估模型性能。夏普利加法解释(SHAP)值突出了影响模型输出的显著特征。
主要结局是出院时的AIS,将其视为连续变量(0 - 4)以捕捉潜在错误分类的顺序性质和临床意义。数据预处理包括去除多重共线性、使用Boruta算法进行特征选择以及对缺失数据进行迭代插补。使用留出法以7:3的比例分割数据集,得到2592例用于训练回归模型的病例和1111例用于测试回归模型的病例。使用PyCaret的自动模型比较将表现最佳的模型定义为具有最高R值的模型。回归模型的最终预测结果被离散化为原始的AIS类别以便于临床解释。
梯度提升回归器(GBR)被确定为最优模型。GBR模型的R²为0.869,准确率为0.814,加权Kappa为0.940。基于SHAP值,确定了11个关键变量,包括入院时的AIS、受伤至入院的天数以及L3的运动评分。然后通过Streamlit将该模型改编为网络应用程序。
我们开发了一种高精度的ML模型,用于预测出院时的AIS,该模型使用11个关键变量有效地捕捉了AIS量表的顺序性质。该模型展示了其提供可靠预后信息的性能。该模型已集成到一个用户友好的、开放获取的网络应用程序(http://3.138.174.54:8502/)中。这个工具有可能有助于资源分配并加强对每个患者的治疗。