基于优化梯度提升分类器模型的中西医结合预测糖尿病视网膜病变风险研究

Risk prediction of integrated traditional Chinese and western medicine for diabetes retinopathy based on optimized gradient boosting classifier model.

作者信息

Xiao Li, Tang Lixuan, Kuang Wenxuan, Yang Yijing, Deng Ying, Lu Jing, Peng Qinghua, Yan Junfeng

机构信息

School of Chinese Medicine, Hunan University of Chinese Medicine, Changsha, China.

School of Medicine, Hunan University of Chinese Medicine, Changsha, China.

出版信息

Medicine (Baltimore). 2024 Dec 20;103(51):e40896. doi: 10.1097/MD.0000000000040896.

DOI:10.1097/MD.0000000000040896

PMID:39705459

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11666193/

Abstract

In order to take full advantage of traditional Chinese medicine (TCM) and western medicine, combined with machine learning technology, to study the risk factors and better risk prediction model of diabetic retinopathy (DR), and provide basis for the screening and treatment of it. Through a retrospective study of DR cases in the real world, the electronic medical records of patients who met screening criteria were collected. Moreover, Recursive Feature Elimination with Cross-Validation (RFECV) was used for feature selection. Then, the prediction model was built based on Gradient Boosting Machine (GBM) and it was compared with 4 other popular machine learning techniques, including Logistic Regression (LR), K-Nearest Neighbors (KNN), Random Forest, and Support Vector Machine (SVM). The models were evaluated with accuracy, precision, recall, F1 score, and area under the curve (AUC) value as indicators. In addition, grid search was used to optimize the model. To explain the results of the model more intuitively, the Shapley Additive exPlanation (SHAP) method was used. A total of 9034 type 2 diabetes mellitus (T2DM) patients meeting the screening criteria were included in this study, including 1118 patients with DR. 19 features were selected using RFECV in the model construction. We constructed 5 commonly used models, including GBM, LR, KNN, Random Forest, and SVM. By comparing model performance, GBM has the highest accuracy (0.85) and AUC value (0.934), which is the best prediction model. We also carried out hyperparameter optimization of grid search for this model, and the model accuracy reached 0.88, and the AUC value increased to 0.958. Through SHAP analysis, it was found that TCM syndrome types, albumin, low density lipoprotein, triglyceride, total protein, glycosylated hemoglobin were closely related to the increased risk of DR. It can be concluded that TCM syndrome type is the risk factor of DR. The GBM classifier based on grid search optimization, with relevant risk factors of TCM and western medicine as variables, can better predict the risk of DR.

摘要

为充分利用中医和西医，结合机器学习技术，研究糖尿病视网膜病变（DR）的危险因素及更好的风险预测模型，为其筛查和治疗提供依据。通过对现实世界中DR病例的回顾性研究，收集符合筛查标准患者的电子病历。此外，采用带交叉验证的递归特征消除法（RFECV）进行特征选择。然后，基于梯度提升机（GBM）构建预测模型，并将其与其他4种常用机器学习技术进行比较，包括逻辑回归（LR）、K近邻（KNN）、随机森林和支持向量机（SVM）。以准确率、精确率、召回率、F1分数和曲线下面积（AUC）值为指标对模型进行评估。此外，使用网格搜索对模型进行优化。为更直观地解释模型结果，采用了夏普利值附加解释（SHAP）方法。本研究共纳入9034例符合筛查标准的2型糖尿病（T2DM）患者，其中1118例患有DR。在模型构建中使用RFECV选择了19个特征。我们构建了5种常用模型，包括GBM、LR、KNN、随机森林和SVM。通过比较模型性能，GBM具有最高的准确率（0.85）和AUC值（0.934），是最佳预测模型。我们还对该模型进行了网格搜索的超参数优化，模型准确率达到0.88，AUC值增至0.958。通过SHAP分析发现，中医证型、白蛋白、低密度脂蛋白、甘油三酯、总蛋白、糖化血红蛋白与DR风险增加密切相关。可以得出结论，中医证型是DR的危险因素。基于网格搜索优化的GBM分类器，以中医和西医的相关危险因素为变量，能够更好地预测DR风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062c/11666193/1bda1b400469/medi-103-e40896-g001.jpg

相似文献

Risk prediction of integrated traditional Chinese and western medicine for diabetes retinopathy based on optimized gradient boosting classifier model.基于优化梯度提升分类器模型的中西医结合预测糖尿病视网膜病变风险研究

Medicine (Baltimore). 2024 Dec 20;103(51):e40896. doi: 10.1097/MD.0000000000040896.

An enhanced machine learning algorithm for type 2 diabetes prognosis with a detailed examination of Key correlates.一种用于 2 型糖尿病预后的增强机器学习算法，对关键相关因素进行了详细研究。

Sci Rep. 2024 Nov 1;14(1):26355. doi: 10.1038/s41598-024-75898-w.

Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。

Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.

Multi-feature, Chinese-Western medicine-integrated prediction model for diabetic peripheral neuropathy based on machine learning and SHAP.基于机器学习和SHAP的糖尿病周围神经病变多特征中西医结合预测模型

Diabetes Metab Res Rev. 2024 May;40(4):e3801. doi: 10.1002/dmrr.3801.

Predicting diabetic retinopathy based on routine laboratory tests by machine learning algorithms.基于机器学习算法通过常规实验室检查预测糖尿病视网膜病变

Eur J Med Res. 2025 Mar 18;30(1):183. doi: 10.1186/s40001-025-02442-5.

Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China.基于机器学习的糖尿病视网膜病变预测模型与风险分析：中国的回顾性队列研究。

BMJ Open. 2021 Nov 26;11(11):e050989. doi: 10.1136/bmjopen-2021-050989.

Diabetic peripheral neuropathy detection of type 2 diabetes using machine learning from TCM features: a cross-sectional study.基于中医特征运用机器学习检测2型糖尿病患者的糖尿病周围神经病变：一项横断面研究

BMC Med Inform Decis Mak. 2025 Feb 18;25(1):90. doi: 10.1186/s12911-025-02932-w.

An interpreting machine learning models to predict amputation risk in patients with diabetic foot ulcers: a multi-center study.一种用于预测糖尿病足溃疡患者截肢风险的解释性机器学习模型：一项多中心研究。

Front Endocrinol (Lausanne). 2025 Mar 25;16:1526098. doi: 10.3389/fendo.2025.1526098. eCollection 2025.

Machine learning algorithms for diabetic kidney disease risk predictive model of Chinese patients with type 2 diabetes mellitus.用于中国2型糖尿病患者糖尿病肾病风险预测模型的机器学习算法

Ren Fail. 2025 Dec;47(1):2486558. doi: 10.1080/0886022X.2025.2486558. Epub 2025 Apr 7.

HHO optimized support vector machine classifier for traditional Chinese medicine syndrome differentiation of diabetic retinopathy.用于糖尿病视网膜病变中医辨证的HHO优化支持向量机分类器

Int J Ophthalmol. 2024 Jun 18;17(6):991-1000. doi: 10.18240/ijo.2024.06.02. eCollection 2024.

本文引用的文献

[Establishment of a prognostic model for non-nephrotic membranous nephropathy based on unbalanced data].[基于不平衡数据建立非肾病性膜性肾病的预后模型]

Zhonghua Yi Xue Za Zhi. 2023 May 16;103(18):1386-1392. doi: 10.3760/cma.j.cn112137-20221115-02399.

Prognostic factors for the development and progression of proliferative diabetic retinopathy in people with diabetic retinopathy.增生性糖尿病性视网膜病变在糖尿病性视网膜病变患者中发展和进展的预测因素。

Cochrane Database Syst Rev. 2023 Feb 22;2(2):CD013775. doi: 10.1002/14651858.CD013775.pub2.

A new strategy for the early detection of alzheimer disease stages using multifractal geometry analysis based on K-Nearest Neighbor algorithm.基于 K-最近邻算法的多重分形几何分析在阿尔茨海默病早期检测阶段的新策略。

Sci Rep. 2022 Dec 26;12(1):22381. doi: 10.1038/s41598-022-26958-6.

Survival Prediction of Children Undergoing Hematopoietic Stem Cell Transplantation Using Different Machine Learning Classifiers by Performing Chi-Square Test and Hyperparameter Optimization: A Retrospective Analysis.采用卡方检验和超参数优化的不同机器学习分类器对接受造血干细胞移植的儿童进行生存预测的回顾性分析。

Comput Math Methods Med. 2022 Sep 25;2022:9391136. doi: 10.1155/2022/9391136. eCollection 2022.

Variability of Grading DR Screening Images among Non-Trained Retina Specialists.非专业视网膜专家对糖尿病视网膜病变筛查图像分级的变异性

J Clin Med. 2022 May 31;11(11):3125. doi: 10.3390/jcm11113125.

Non-alcoholic fatty liver disease and type 2 diabetes: An update.非酒精性脂肪性肝病与 2 型糖尿病：最新进展。

J Diabetes Investig. 2022 Jun;13(6):930-940. doi: 10.1111/jdi.13756. Epub 2022 Feb 14.

Inflammation in obesity, diabetes, and related disorders.肥胖、糖尿病及相关紊乱中的炎症。

Immunity. 2022 Jan 11;55(1):31-55. doi: 10.1016/j.immuni.2021.12.013.

Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study.机器学习在一项基于大规模人群的研究中用于识别可转诊糖尿病视网膜病变的效用。

Front Med (Lausanne). 2021 Dec 9;8:773881. doi: 10.3389/fmed.2021.773881. eCollection 2021.

BMJ Open. 2021 Nov 26;11(11):e050989. doi: 10.1136/bmjopen-2021-050989.

Data Pre-Processing Using Neural Processes for Modeling Personalized Vital-Sign Time-Series Data.使用神经过程进行数据预处理以对个性化生命体征时间序列数据进行建模

IEEE J Biomed Health Inform. 2022 Apr;26(4):1528-1537. doi: 10.1109/JBHI.2021.3107518. Epub 2022 Apr 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于优化梯度提升分类器模型的中西医结合预测糖尿病视网膜病变风险研究

Risk prediction of integrated traditional Chinese and western medicine for diabetes retinopathy based on optimized gradient boosting classifier model.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献