Sun Meng, Sun Xingling, Wang Fei, Liu Li
Department of Neurosurgery, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, Shandong, China.
Department of Nursing, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, Shandong, China.
Front Endocrinol (Lausanne). 2025 Jun 5;16:1614657. doi: 10.3389/fendo.2025.1614657. eCollection 2025.
Diabetic peripheral neuropathy (DPN) is a common and debilitating complication of type 2 diabetes mellitus (T2DM), significantly impacting patients' quality of life and increasing healthcare burdens. Early prediction and intervention are critical to mitigating its impact.
This study analyzed 1,544 diabetic patients from the First Affiliated Hospital of Shandong First Medical University, who were randomly divided into a training cohort (n = 1,082) and a testing cohort (n = 462) using a 7:3 split ratio. Feature selection was performed using both Boruta and LASSO algorithms, and the intersection of the selected variables was used as the final predictor set. Eight key predictors were identified from 23 variables, including diabetes duration, uric acid, HbA1c, NLR, smoking status, SCR, LDH, and hypertension. Nine machine learning models were developed and compared for DPN risk prediction.
Stochastic Gradient Boosting (SGBT) demonstrated the best performance (training AUC: 0.933, 95% CI: 0.921-0.946; testing AUC: 0.811, 95% CI: 0.776-0.843). Shapley Additive Explanations (SHAP) analysis provided interpretability, highlighting the clinical importance of diabetes duration and HbA1c among other predictors.
This study establishes a robust predictive tool for early DPN detection, laying the foundation for improved prevention and management strategies.
糖尿病周围神经病变(DPN)是2型糖尿病(T2DM)常见且使人衰弱的并发症,严重影响患者生活质量并增加医疗负担。早期预测和干预对于减轻其影响至关重要。
本研究分析了山东第一医科大学第一附属医院的1544例糖尿病患者,采用7:3的分割比例将其随机分为训练队列(n = 1082)和测试队列(n = 462)。使用Boruta算法和LASSO算法进行特征选择,并将所选变量的交集用作最终预测变量集。从23个变量中确定了8个关键预测因素,包括糖尿病病程、尿酸、糖化血红蛋白、中性粒细胞与淋巴细胞比值、吸烟状况、血清肌酐、乳酸脱氢酶和高血压。开发并比较了9种机器学习模型用于DPN风险预测。
随机梯度提升(SGBT)表现最佳(训练AUC:0.933,95%CI:0.921 - 0.946;测试AUC:0.811,95%CI:0.776 - 0.843)。Shapley值相加解释(SHAP)分析提供了可解释性,突出了糖尿病病程和糖化血红蛋白等预测因素在其他因素中的临床重要性。
本研究建立了一种强大的早期DPN检测预测工具,为改进预防和管理策略奠定了基础。