Yasin Parhat, Ding Liwen, Mamat Mardan, Guo Wei, Song Xinghua
Department of Spine Surgery, The Sixth Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang, 830000, People's Republic of China.
College of Pediatrics, Xinjiang Medical University, Urumqi, Xinjiang, 830000, People's Republic of China.
Infect Drug Resist. 2025 May 31;18:2797-2821. doi: 10.2147/IDR.S520062. eCollection 2025.
Tuberculosis spondylitis (TS), also known as Pott's disease, is the most common destructive form of musculoskeletal tuberculosis and poses significant clinical challenges, particularly when complicated by osteoporosis. Osteoporosis exacerbates surgical outcomes and increases the risk of complications, making its accurate prediction crucial for effective patient management.
This retrospective study included 906 TS patients from two medical centers between January 2016 and November 2022. We collected demographic information and blood test data from routine examinations. To address class imbalance, the synthetic minority oversampling technique (SMOTE) was applied. Feature selection was performed using LASSO, Boruta, and Recursive Feature Elimination (RFE) to identify key predictors of osteoporosis. Multiple machine learning (ML) algorithms, including logistic regression, random forest, and XGBoost, were trained and optimized using nested cross-validation and hyperparameter tuning. The optimal model was further refined through threshold tuning to enhance performance metrics. Model interpretability was achieved using SHapley Additive exPlanations (SHAP), and an online web application was developed for real-time clinical use.
Out of 906 patients, 60 were diagnosed with osteoporosis based on Dual-energy X-ray absorptiometry (DXA) measurements. Feature selection identified hemoglobin (HB), estimated glomerular filtration rate (eGFR), and cystatin C (CYS_C) as significant predictors. The logistic regression model exhibited the highest performance with an area under the receiver operating characteristic curve (AUC) of 0.826, which was externally validated with an AUC of 0.796. Threshold tuning optimized the decision threshold to 0.32, improving the F1-score and balancing sensitivity and specificity. SHAP analysis highlighted the critical roles of HB, eGFR, and CYS_C in osteoporosis prediction. The developed web application facilitates the model's integration into clinical workflows, enabling healthcare professionals to make informed decisions at the bedside.
This study successfully developed and validated an ML-based tool for predicting osteoporosis in TS patients using readily available clinical data. The model demonstrated robust predictive performance and was effectively integrated into a user-friendly online application, offering a practical solution to enhance surgical decision-making and improve patient outcomes in real-time clinical settings.
脊柱结核(TS),也称为波特氏病,是肌肉骨骼结核最常见的破坏性形式,带来了重大的临床挑战,尤其是在合并骨质疏松症时。骨质疏松症会加剧手术结果并增加并发症风险,因此准确预测对于有效的患者管理至关重要。
这项回顾性研究纳入了2016年1月至2022年11月期间来自两个医疗中心的906例TS患者。我们从常规检查中收集了人口统计学信息和血液检测数据。为了解决类别不平衡问题,应用了合成少数过采样技术(SMOTE)。使用套索回归(LASSO)、博鲁塔算法(Boruta)和递归特征消除法(RFE)进行特征选择,以识别骨质疏松症的关键预测因素。使用逻辑回归、随机森林和XGBoost等多种机器学习(ML)算法,通过嵌套交叉验证和超参数调整进行训练和优化。通过阈值调整进一步优化最佳模型,以提高性能指标。使用夏普利值加法解释(SHAP)实现模型可解释性,并开发了一个在线网络应用程序以供临床实时使用。
在906例患者中,根据双能X线吸收法(DXA)测量,有60例被诊断为骨质疏松症。特征选择确定血红蛋白(HB)、估计肾小球滤过率(eGFR)和胱抑素C(CYS_C)为重要预测因素。逻辑回归模型表现出最高性能,受试者工作特征曲线(AUC)下面积为0.826,外部验证的AUC为0.796。阈值调整将决策阈值优化至0.32,提高了F1分数并平衡了敏感性和特异性。SHAP分析突出了HB、eGFR和CYS_C在骨质疏松症预测中的关键作用。开发的网络应用程序有助于将该模型整合到临床工作流程中,使医疗保健专业人员能够在床边做出明智的决策。
本研究成功开发并验证了一种基于机器学习的工具,可利用现成的临床数据预测TS患者的骨质疏松症。该模型表现出强大的预测性能,并有效地集成到一个用户友好的在线应用程序中,为在实时临床环境中加强手术决策和改善患者预后提供了一个实用的解决方案。