School of Integrated Traditional Chinese and Western Medicine, Anhui University of Chinese Medicine, 350 Longzihu Road, Xinzhan District, Hefei City, Anhui Province, 230012, China.
Yunnan University of Chinese Medicine, Kunming, 650500, China.
BMC Endocr Disord. 2024 Sep 20;24(1):196. doi: 10.1186/s12902-024-01728-9.
The primary objective of this study was to investigate the risk factors for diabetic peripheral neuropathy (DPN) and to establish an early diagnostic prediction model for its onset, based on clinical data and biochemical indices.
Retrospective data were collected from 1,446 diabetic patients at the First Affiliated Hospital of Anhui University of Chinese Medicine and were split into training and internal validation sets in a 7:3 ratio. Additionally, 360 diabetic patients from the Second Affiliated Hospital were used as an external validation cohort. Feature selection was conducted within the training set, where univariate logistic regression identified variables with a p-value < 0.05, followed by backward elimination to construct the logistic regression model. Concurrently, the random forest algorithm was applied to the training set to identify the top 10 most important features, with hyperparameter optimization performed via grid search combined with cross-validation. Model performance was evaluated using ROC curves, decision curve analysis, and calibration curves. Model fit was assessed using the Hosmer-Lemeshow test, followed by Brier Score evaluation for the random forest model. Ten-fold cross-validation was employed for further validation, and SHAP analysis was conducted to enhance model interpretability.
A nomogram model was developed using logistic regression with key features: limb numbness, limb pain, diabetic retinopathy, diabetic kidney disease, urinary protein, diastolic blood pressure, white blood cell count, HbA1c, and high-density lipoprotein cholesterol. The model achieved AUCs of 0.91, 0.88, and 0.88 for the training, validation, and test sets, respectively, with a mean AUC of 0.902 across 10-fold cross-validation. Hosmer-Lemeshow test results showed p-values of 0.595, 0.418, and 0.126 for the training, validation, and test sets, respectively. The random forest model demonstrated AUCs of 0.95, 0.88, and 0.88 for the training, validation, and test sets, respectively, with a mean AUC of 0.886 across 10-fold cross-validation. The Brier score indicates a good calibration level, with values of 0.104, 0.143, and 0.142 for the training, validation, and test sets, respectively.
The developed nomogram exhibits promise as an effective tool for the diagnosis of diabetic peripheral neuropathy in clinical settings.
本研究旨在探讨糖尿病周围神经病变(DPN)的危险因素,并基于临床数据和生化指标建立其发病的早期诊断预测模型。
回顾性收集了安徽中医药大学第一附属医院 1446 例糖尿病患者的数据,并按 7:3 的比例分为训练集和内部验证集。此外,还从第二附属医院收集了 360 例糖尿病患者作为外部验证队列。在训练集中进行特征选择,单因素逻辑回归确定 p 值<0.05 的变量,然后采用向后消除法构建逻辑回归模型。同时,将随机森林算法应用于训练集,确定前 10 个最重要的特征,并通过网格搜索与交叉验证相结合进行超参数优化。使用 ROC 曲线、决策曲线分析和校准曲线评估模型性能。使用 Hosmer-Lemeshow 检验评估模型拟合度,然后对随机森林模型进行 Brier 评分评估。采用 10 折交叉验证进行进一步验证,并进行 SHAP 分析以增强模型可解释性。
使用逻辑回归建立了一个基于关键特征的列线图模型:四肢麻木、四肢疼痛、糖尿病视网膜病变、糖尿病肾病、尿蛋白、舒张压、白细胞计数、糖化血红蛋白和高密度脂蛋白胆固醇。该模型在训练集、验证集和测试集的 AUC 分别为 0.91、0.88 和 0.88,10 折交叉验证的平均 AUC 为 0.902。Hosmer-Lemeshow 检验结果显示,训练集、验证集和测试集的 p 值分别为 0.595、0.418 和 0.126。随机森林模型在训练集、验证集和测试集的 AUC 分别为 0.95、0.88 和 0.88,10 折交叉验证的平均 AUC 为 0.886。Brier 评分表明校准水平良好,训练集、验证集和测试集的 Brier 评分为 0.104、0.143 和 0.142。
所开发的列线图模型有望成为临床诊断糖尿病周围神经病变的有效工具。