基于机器学习算法通过常规实验室检查预测糖尿病视网膜病变

Predicting diabetic retinopathy based on routine laboratory tests by machine learning algorithms.

作者信息

Wan Xiaohua, Zhang Ruihuan, Wang Yanan, Wei Wei, Song Biao, Zhang Lin, Hu Yanwei

机构信息

Department of Clinical Laboratory, Beijing Chao-Yang Hospital, Capital Medical University, Beijing, People's Republic of China.

Beijing Center for Clinical Laboratories, Beijing, People's Republic of China.

出版信息

Eur J Med Res. 2025 Mar 18;30(1):183. doi: 10.1186/s40001-025-02442-5.

DOI:10.1186/s40001-025-02442-5

PMID:40102923

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11921716/

Abstract

OBJECTIVES

This study aimed to identify risk factors for diabetic retinopathy (DR) and develop machine learning (ML)-based predictive models using routine laboratory data in patients with type 2 diabetes mellitus (T2DM).

METHODS

Clinical data from 4259 T2DM inpatients at Beijing Tongren Hospital were analyzed, divided into a model construction data set (N = 3936) and an external validation data set (N = 323). Using 39 optimal variables, a prediction model was constructed using the eXtreme Gradient Boosting (XGBoost) algorithm and compared with four other algorithms: support vector machine (SVM), gradient boosting decision tree (GBDT), neural network (NN), and logistic regression (LR). The Shapley Additive exPlanation (SHAP) method was employed to interpret the XGBoost model. External validation was performed to assess model performance.

RESULTS

DR was present in 47.69% (N = 1877) of T2DM patients in the model construction data set. Among the models tested, the XGBoost model performed best with an AUC of 0.831, accuracy of 0.757, sensitivity of 0.754, specificity of 0.759, and F1-score of 0.752. SHAP explained feature importance for XGBoost model and identified key risk factors for DR. External validation yielded an accuracy of 0.650 for the XGBoost model.

CONCLUSIONS

The XGBoost-based prediction model effectively assesses DR risk in T2DM patients using routine laboratory data, aiding clinicians in identifying high-risk individuals and guiding personalized management strategies, especially in medically underserved areas.

摘要

目的

本研究旨在确定糖尿病视网膜病变（DR）的危险因素，并利用2型糖尿病（T2DM）患者的常规实验室数据开发基于机器学习（ML）的预测模型。

方法

分析北京同仁医院4259例T2DM住院患者的临床资料，分为模型构建数据集（N = 3936）和外部验证数据集（N = 323）。使用39个最优变量，采用极端梯度提升（XGBoost）算法构建预测模型，并与其他四种算法进行比较：支持向量机（SVM）、梯度提升决策树（GBDT）、神经网络（NN）和逻辑回归（LR）。采用夏普利加性解释（SHAP）方法解释XGBoost模型。进行外部验证以评估模型性能。

结果

模型构建数据集中47.69%（N = 1877）的T2DM患者存在DR。在所测试的模型中，XGBoost模型表现最佳，AUC为0.831，准确率为0.757，灵敏度为0.754，特异性为0.759，F1分数为0.752。SHAP解释了XGBoost模型的特征重要性，并确定了DR的关键危险因素。XGBoost模型的外部验证准确率为0.650。

结论

基于XGBoost的预测模型利用常规实验室数据有效评估T2DM患者的DR风险，有助于临床医生识别高危个体并指导个性化管理策略，尤其是在医疗服务不足的地区。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80a9/11921716/9a7f5a5165e9/40001_2025_2442_Fig1_HTML.jpg

相似文献

Predicting diabetic retinopathy based on routine laboratory tests by machine learning algorithms.基于机器学习算法通过常规实验室检查预测糖尿病视网膜病变

Eur J Med Res. 2025 Mar 18;30(1):183. doi: 10.1186/s40001-025-02442-5.

Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。

Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.

Machine learning algorithms for diabetic kidney disease risk predictive model of Chinese patients with type 2 diabetes mellitus.用于中国2型糖尿病患者糖尿病肾病风险预测模型的机器学习算法

Ren Fail. 2025 Dec;47(1):2486558. doi: 10.1080/0886022X.2025.2486558. Epub 2025 Apr 7.

Interpretable machine learning method to predict the risk of pre-diabetes using a national-wide cross-sectional data: evidence from CHNS.利用全国性横断面数据预测糖尿病前期风险的可解释机器学习方法：来自中国健康与营养调查的证据

BMC Public Health. 2025 Mar 26;25(1):1145. doi: 10.1186/s12889-025-22419-7.

Risk prediction of integrated traditional Chinese and western medicine for diabetes retinopathy based on optimized gradient boosting classifier model.基于优化梯度提升分类器模型的中西医结合预测糖尿病视网膜病变风险研究

Medicine (Baltimore). 2024 Dec 20;103(51):e40896. doi: 10.1097/MD.0000000000040896.

Predictive model and risk analysis for diabetic retinopathy using machine learning: a retrospective cohort study in China.基于机器学习的糖尿病视网膜病变预测模型与风险分析：中国的回顾性队列研究。

BMJ Open. 2021 Nov 26;11(11):e050989. doi: 10.1136/bmjopen-2021-050989.

Development and Validation of a Machine Learning Algorithm for Predicting Diabetes Retinopathy in Patients With Type 2 Diabetes: Algorithm Development Study.用于预测2型糖尿病患者糖尿病视网膜病变的机器学习算法的开发与验证：算法开发研究

JMIR Med Inform. 2025 Feb 7;13:e58107. doi: 10.2196/58107.

An enhanced machine learning algorithm for type 2 diabetes prognosis with a detailed examination of Key correlates.一种用于 2 型糖尿病预后的增强机器学习算法，对关键相关因素进行了详细研究。

Sci Rep. 2024 Nov 1;14(1):26355. doi: 10.1038/s41598-024-75898-w.

Comparison of Machine Learning Algorithms and Nomogram Construction for Diabetic Retinopathy Prediction in Type 2 Diabetes Mellitus Patients.机器学习算法与列线图构建在 2 型糖尿病患者糖尿病视网膜病变预测中的比较。

Ophthalmic Res. 2024;67(1):537-548. doi: 10.1159/000541294. Epub 2024 Sep 4.

Predicting the risk of diabetic retinopathy using explainable machine learning algorithms.使用可解释的机器学习算法预测糖尿病视网膜病变的风险。

Diabetes Metab Syndr. 2023 Dec;17(12):102919. doi: 10.1016/j.dsx.2023.102919. Epub 2023 Dec 4.

引用本文的文献

Optimized prediction of diabetes complications using ensemble learning with Bayesian optimization: a cost-efficient laboratory-based approach.使用贝叶斯优化的集成学习优化糖尿病并发症预测：一种基于实验室的经济高效方法。

Front Endocrinol (Lausanne). 2025 Jun 20;16:1593068. doi: 10.3389/fendo.2025.1593068. eCollection 2025.

Nonlinear association between visceral fat metabolism score and heart failure: insights from LightGBM modeling and SHAP-Driven feature interpretation in NHANES.内脏脂肪代谢评分与心力衰竭之间的非线性关联：来自美国国家健康与营养检查调查（NHANES）中LightGBM建模和SHAP驱动特征解释的见解

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):223. doi: 10.1186/s12911-025-03076-7.

本文引用的文献

Enhancing machine learning-based forecasting of chronic renal disease with explainable AI.利用可解释人工智能增强基于机器学习的慢性肾病预测。

PeerJ Comput Sci. 2024 Sep 26;10:e2291. doi: 10.7717/peerj-cs.2291. eCollection 2024.

Global estimates on the number of people blind or visually impaired by diabetic retinopathy: a meta-analysis from 2000 to 2020.全球因糖尿病视网膜病变致盲或视力受损人数的估计：2000 年至 2020 年的荟萃分析。

Eye (Lond). 2024 Aug;38(11):2047-2057. doi: 10.1038/s41433-024-03101-5. Epub 2024 Jun 27.

Predicting the risk of diabetic retinopathy using explainable machine learning algorithms.使用可解释的机器学习算法预测糖尿病视网膜病变的风险。

Diabetes Metab Syndr. 2023 Dec;17(12):102919. doi: 10.1016/j.dsx.2023.102919. Epub 2023 Dec 4.

Construction and clinical validation of a nomogram-based predictive model for diabetic retinopathy in type 2 diabetes.2型糖尿病视网膜病变基于列线图预测模型的构建与临床验证

Am J Transl Res. 2023 Oct 15;15(10):6083-6094. eCollection 2023.

The automatic detection of diabetic kidney disease from retinal vascular parameters combined with clinical variables using artificial intelligence in type-2 diabetes patients.基于人工智能的 2 型糖尿病患者视网膜血管参数与临床变量联合自动检测糖尿病肾病。

BMC Med Inform Decis Mak. 2023 Oct 30;23(1):241. doi: 10.1186/s12911-023-02343-9.

Prevalence of diabetic retinopathy and vision-threatening diabetic retinopathy in adults with diabetes in China.中国成人糖尿病患者中糖尿病视网膜病变及威胁视力的糖尿病视网膜病变的患病率。

Nat Commun. 2023 Jul 18;14(1):4296. doi: 10.1038/s41467-023-39864-w.

A Metabolism-Based Interpretable Machine Learning Prediction Model for Diabetic Retinopathy Risk: A Cross-Sectional Study in Chinese Patients with Type 2 Diabetes.基于代谢的糖尿病视网膜病变风险可解释机器学习预测模型：中国 2 型糖尿病患者的横断面研究。

J Diabetes Res. 2023 May 16;2023:3990035. doi: 10.1155/2023/3990035. eCollection 2023.

A risk prediction model for type 2 diabetes mellitus complicated with retinopathy based on machine learning and its application in health management.基于机器学习的2型糖尿病合并视网膜病变风险预测模型及其在健康管理中的应用

Front Med (Lausanne). 2023 Apr 27;10:1136653. doi: 10.3389/fmed.2023.1136653. eCollection 2023.

Artificial intelligence: is it the right time for clinical laboratories?人工智能：临床实验室的时机到了吗？

Clin Chem Lab Med. 2022 Oct 24;60(12):1859-1861. doi: 10.1515/cclm-2022-1015. Print 2022 Nov 25.

Gradient boosting decision tree becomes more reliable than logistic regression in predicting probability for diabetes with big data.梯度提升决策树在预测大数据下糖尿病概率方面比逻辑回归更可靠。

Sci Rep. 2022 Oct 11;12(1):15889. doi: 10.1038/s41598-022-20149-z.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习算法通过常规实验室检查预测糖尿病视网膜病变

Predicting diabetic retinopathy based on routine laboratory tests by machine learning algorithms.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献