Suppr超能文献

糖尿病周围神经病变危险因素研究及机器学习预测模型的建立。

Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning.

机构信息

Affiliated Hospital of Nanjing University of Chinese Medicine,Jiangsu Province Hospital of Chinese Medicine, Nanjing, Jiangsu, 210029, China.

School of Artificial Intelligence and Information Technology, Nanjing University of Chinese Medicine, Nanjing, 210023, Jiangsu, China.

出版信息

BMC Med Inform Decis Mak. 2023 Aug 2;23(1):146. doi: 10.1186/s12911-023-02232-1.

Abstract

BACKGROUND

Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials.

METHODS

We retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models' discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model.

RESULTS

The XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors.

CONCLUSIONS

The machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN.

摘要

背景

糖尿病周围神经病变(DPN)是糖尿病的常见并发症。预测发生 DPN 的风险对于临床决策和临床试验设计很重要。

方法

我们回顾性分析了 2020 年至 2022 年在两家中心医院治疗的 1278 例糖尿病患者的数据。数据包括病史、体格检查和生化指标检测结果。经过特征选择和数据平衡后,队列以 7:3 的比例分为训练和内部验证数据集。在基于机器学习的逻辑回归、k 近邻、决策树、朴素贝叶斯、随机森林和极端梯度提升(XGBoost)中进行训练。使用 k 折交叉验证评估模型,采用准确性、精确率、召回率、F1 评分和受试者工作特征曲线下面积(AUC)来验证模型的区分度和临床实用性。使用 SHapley Additive exPlanation(SHAP)解释表现最佳的模型。

结果

XGBoost 模型表现优于其他模型,其准确性为 0·746,精确率为 0·765,召回率为 0·711,F1 评分为 0·736,AUC 为 0·813。SHAP 结果表明,年龄、病程、糖化血红蛋白、胰岛素抵抗指数、24 小时尿蛋白定量和尿蛋白浓度是 DPN 的危险因素,而餐后 2 小时 C 肽与空腹 C 肽(C2/C0)比值、总胆固醇、活化部分凝血活酶时间和肌酐是保护因素。

结论

机器学习方法有助于建立性能良好的 DPN 风险预测模型。该模型确定了与 DPN 最密切相关的因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ec6/10394817/b191e4710f2c/12911_2023_2232_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验