Qiu Weizhi, Cui Penglei, Li Shaojie, Tang Zhenzhou, Chen Jiani, Wang Jiayin, Li Yasong
Department of Neurosurgery, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, 362000, Fujian, People's Republic of China.
Sci Rep. 2025 Jul 10;15(1):24932. doi: 10.1038/s41598-025-10905-2.
Lower extremity deep vein thrombosis is one of the important complications of spontaneous intracerebral hemorrhage. We aimed to develop a risk assessment model to predict the risk of lower extremity DVT during hospitalization in patients with spontaneous cerebral hemorrhage. The retrospective study began by randomly dividing the data into a training set and a test set in a 7:3 ratio. Feature selection was performed in the training set, and Boruta and LASSO algorithms were used to screen significant predictors. Five machine learning algorithms were used to construct the prediction model and the model accuracy was evaluated by ROC curves. To validate the model, we constructed calibration curves and compared the calibration of the model using the Brier score. Finally, the clinical value of the model was assessed by Decision Clinical Curve (DCA) and the "black box" model was interpreted by SHAP. The training and test sets did not show significant differences between the individual variables. Screening by the LASSO and Boruta algorithms yielded 15 and 7 potentially relevant variables, respectively, resulting in the identification of six significant predictors associated with DVT. Subsequently, the performance of five machine learning algorithms in DVT prediction was evaluated in the test set. These results suggest that the LGBM model has significant advantages in predicting DVT after cerebral hemorrhage. We developed a model to predict the risk of lower extremity deep vein thrombosis during hospitalization in patients with spontaneous cerebral hemorrhage, and this model can accurately identify high-risk patients.
下肢深静脉血栓形成是自发性脑出血的重要并发症之一。我们旨在建立一个风险评估模型,以预测自发性脑出血患者住院期间发生下肢深静脉血栓形成的风险。回顾性研究首先将数据按7:3的比例随机分为训练集和测试集。在训练集中进行特征选择,使用Boruta和LASSO算法筛选显著预测因子。使用五种机器学习算法构建预测模型,并通过ROC曲线评估模型准确性。为验证模型,我们构建校准曲线并使用Brier评分比较模型的校准情况。最后,通过决策临床曲线(DCA)评估模型的临床价值,并使用SHAP对“黑箱”模型进行解释。训练集和测试集在个体变量之间未显示出显著差异。通过LASSO和Boruta算法筛选分别产生了15个和7个潜在相关变量,从而确定了6个与深静脉血栓形成相关的显著预测因子。随后,在测试集中评估了五种机器学习算法在深静脉血栓形成预测中的性能。这些结果表明,LGBM模型在预测脑出血后深静脉血栓形成方面具有显著优势。我们建立了一个模型来预测自发性脑出血患者住院期间发生下肢深静脉血栓形成的风险,该模型可以准确识别高危患者。