一家大型医疗机构中糖尿病患者的10年回顾性队列研究：利用多种机器学习模型预测糖尿病肾病

A 10-year retrospective cohort of diabetic patients in a large medical institution: Utilizing multiple machine learning models for diabetic kidney disease prediction.

作者信息

Li Guangpu, Li Jia, Tian Fei, Ren Jingjing, Guo Zuishuang, Pan Shaokang, Liu Dongwei, Duan Jiayu, Liu Zhangsuo

机构信息

Department of Integrated Traditional and Western Nephrology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China.

Research Institute of Nephrology, Zhengzhou University, Zhengzhou, China.

出版信息

Digit Health. 2024 Jul 21;10:20552076241265220. doi: 10.1177/20552076241265220. eCollection 2024 Jan-Dec.

DOI:10.1177/20552076241265220

PMID:39229465

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11369867/

Abstract

OBJECTIVE

As the prevalence of diabetes steadily increases, the burden of diabetic kidney disease (DKD) is also intensifying. In response, we have utilized a 10-year diabetes cohort from our medical center to train machine learning-based models for predicting DKD and interpreting relevant factors.

METHODS

Employing a large dataset from 73,101 hospitalized type 2 diabetes patients at The First Affiliated Hospital of Zhengzhou University, we analyzed demographic and medication data. Machine learning models, including XGBoost, CatBoost, LightGBM, Random Forest, AdaBoost, GBDT (gradient boosting decision tree), and SGD (stochastic gradient descent), were trained on these data, focusing on interpretability by SHAP. SHAP explains the output of the models by assigning an importance value to each feature for a particular prediction, enabling a clear understanding of how individual features influence the prediction outcomes.

RESULTS

The XGBoost model achieved an area under the curve (AUC) of 0.95 and an area under the precision-recall curve (AUPR) of 0.76, while CatBoost recorded an AUC of 0.97 and an AUPR of 0.84. These results underscore the effectiveness of these models in predicting DKD in patients with type 2 diabetes.

CONCLUSIONS

This study provides a comprehensive approach for predicting DKD in patients with type 2 diabetes, employing machine learning techniques. The findings are crucial for the early detection and intervention of DKD, offering a roadmap for future research and healthcare strategies in diabetes management. Additionally, the presence of non-diabetic kidney diseases and diabetes with complications was identified as significant factors in the development of DKD.

摘要

目的

随着糖尿病患病率稳步上升，糖尿病肾病（DKD）的负担也在加剧。作为应对措施，我们利用了来自我们医疗中心的一个为期10年的糖尿病队列来训练基于机器学习的模型，以预测DKD并解释相关因素。

方法

我们使用了郑州大学第一附属医院73101例住院2型糖尿病患者的大型数据集，分析了人口统计学和用药数据。在这些数据上训练了包括XGBoost、CatBoost、LightGBM、随机森林、AdaBoost、梯度提升决策树（GBDT）和随机梯度下降（SGD）在内的机器学习模型，重点是通过SHAP进行可解释性分析。SHAP通过为特定预测的每个特征分配一个重要性值来解释模型的输出，从而能够清楚地了解各个特征如何影响预测结果。

结果

XGBoost模型的曲线下面积（AUC）为0.95，精确率-召回率曲线下面积（AUPR）为0.76，而CatBoost的AUC为0.97，AUPR为0.84。这些结果强调了这些模型在预测2型糖尿病患者DKD方面的有效性。

结论

本研究提供了一种利用机器学习技术预测2型糖尿病患者DKD的综合方法。这些发现对于DKD的早期检测和干预至关重要，为糖尿病管理的未来研究和医疗保健策略提供了路线图。此外，非糖尿病肾病和伴有并发症的糖尿病的存在被确定为DKD发生的重要因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d30/11369867/aa4d58d94122/10.1177_20552076241265220-fig1.jpg

相似文献

A 10-year retrospective cohort of diabetic patients in a large medical institution: Utilizing multiple machine learning models for diabetic kidney disease prediction.一家大型医疗机构中糖尿病患者的10年回顾性队列研究：利用多种机器学习模型预测糖尿病肾病

Digit Health. 2024 Jul 21;10:20552076241265220. doi: 10.1177/20552076241265220. eCollection 2024 Jan-Dec.

Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study.基于真实世界数据应用机器学习预测 2 型糖尿病患者的糖尿病肾病：一项多中心回顾性研究。

Front Endocrinol (Lausanne). 2023 Jul 4;14:1184190. doi: 10.3389/fendo.2023.1184190. eCollection 2023.

Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records.基于电子病历的机器学习预测糖尿病肾病 3 年风险。

J Transl Med. 2022 Mar 26;20(1):143. doi: 10.1186/s12967-022-03339-1.

Development and internal validation of machine learning algorithms for end-stage renal disease risk prediction model of people with type 2 diabetes mellitus and diabetic kidney disease.机器学习算法在 2 型糖尿病合并糖尿病肾病患者终末期肾病风险预测模型中的开发与内部验证。

Ren Fail. 2022 Dec;44(1):562-570. doi: 10.1080/0886022X.2022.2056053.

Development and External Validation of Machine Learning Models for Diabetic Microvascular Complications: Cross-Sectional Study With Metabolites.机器学习模型在糖尿病微血管并发症中的开发和外部验证：基于代谢物的横断面研究。

J Med Internet Res. 2024 Mar 28;26:e41065. doi: 10.2196/41065.

Prediction model of atrial fibrillation recurrence after Cox-Maze IV procedure in patients with chronic valvular disease and atrial fibrillation based on machine learning algorithm.基于机器学习算法的慢性瓣膜病合并心房颤动患者 Cox-Maze IV 术后心房颤动复发预测模型。

Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2023 Jul 28;48(7):995-1007. doi: 10.11817/j.issn.1672-7347.2023.230018.

Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。

Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.

Design of Machine Learning Algorithms and Internal Validation of a Kidney Risk Prediction Model for Type 2 Diabetes Mellitus.2型糖尿病肾脏风险预测模型的机器学习算法设计与内部验证

Int J Gen Med. 2024 May 20;17:2299-2309. doi: 10.2147/IJGM.S449397. eCollection 2024.

Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。

BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.

Prediction of diabetic kidney disease risk using machine learning models: A population-based cohort study of Asian adults.利用机器学习模型预测糖尿病肾病风险：亚洲成年人的基于人群队列研究。

Elife. 2023 Sep 14;12:e81878. doi: 10.7554/eLife.81878.

本文引用的文献

Short-term duration of diabetic retinopathy as a predictor for development of diabetic kidney disease.糖尿病视网膜病变的短期病程作为糖尿病肾病发生的预测指标。

J Transl Int Med. 2023 Dec 20;11(4):449-458. doi: 10.2478/jtim-2022-0074. eCollection 2023 Dec.

Hypertension Statistics for US Adults: An Open-Source Web Application for Analysis and Visualization of National Health and Nutrition Examination Survey Data.美国成年人高血压统计数据：一个用于分析和可视化国家健康和营养检查调查数据的开源 Web 应用程序。

Hypertension. 2023 Jun;80(6):1311-1320. doi: 10.1161/HYPERTENSIONAHA.123.20900. Epub 2023 Apr 21.

Molecular pathways that drive diabetic kidney disease.驱动糖尿病肾病的分子通路。

J Clin Invest. 2023 Feb 15;133(4):e165654. doi: 10.1172/JCI165654.

Albuminuric diabetic kidney disease predicts foot ulcers in type 2 diabetes.白蛋白尿型糖尿病肾病可预测 2 型糖尿病患者的足部溃疡。

J Diabetes Complications. 2023 Feb;37(2):108403. doi: 10.1016/j.jdiacomp.2023.108403. Epub 2023 Jan 7.

Interpretable machine learning methods for predictions in systems biology from omics data.用于基于组学数据的系统生物学预测的可解释机器学习方法。

Front Mol Biosci. 2022 Oct 17;9:926623. doi: 10.3389/fmolb.2022.926623. eCollection 2022.

Incidence of Chronic Kidney Disease among Adults with Diabetes, 2015-2020.2015 - 2020年糖尿病成年患者慢性肾脏病的发病率

N Engl J Med. 2022 Oct 13;387(15):1430-1431. doi: 10.1056/NEJMc2207018.

Prevalence and Risk Factors of Hyperuricemia and Gout: A Cross-sectional Survey from 31 Provinces in Mainland China.高尿酸血症和痛风的患病率及危险因素：中国大陆31个省份的横断面调查

J Transl Int Med. 2022 Jul 7;10(2):134-145. doi: 10.2478/jtim-2022-0031. eCollection 2022 Jun.

Artificial intelligence-enabled decision support in nephrology.人工智能辅助肾脏病学决策支持。

Nat Rev Nephrol. 2022 Jul;18(7):452-465. doi: 10.1038/s41581-022-00562-3. Epub 2022 Apr 22.

Chronic kidney disease and neurological disorders: are uraemic toxins the missing piece of the puzzle?慢性肾脏病与神经病变：尿毒症毒素是缺失的一环吗？

Nephrol Dial Transplant. 2021 Dec 28;37(Suppl 2):ii33-ii44. doi: 10.1093/ndt/gfab223.

Performance of prediction models for nephropathy in people with type 2 diabetes: systematic review and external validation study.2 型糖尿病患者肾病预测模型的性能：系统评价和外部验证研究。

BMJ. 2021 Sep 28;374:n2134. doi: 10.1136/bmj.n2134.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一家大型医疗机构中糖尿病患者的10年回顾性队列研究：利用多种机器学习模型预测糖尿病肾病

A 10-year retrospective cohort of diabetic patients in a large medical institution: Utilizing multiple machine learning models for diabetic kidney disease prediction.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献