• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发和验证用于识别正常血糖中的糖尿病前期和糖尿病的机器学习模型。

Development and Validation of Machine Learning Models for Identifying Prediabetes and Diabetes in Normoglycemia.

机构信息

Postgraduate Department, Shandong First Medical University (Shandong Academy of Medical Sciences), Jinan, China.

Department of Anesthesiology, Second Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, China.

出版信息

Diabetes Metab Res Rev. 2024 Nov;40(8):e70003. doi: 10.1002/dmrr.70003.

DOI:10.1002/dmrr.70003
PMID:39497474
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11601146/
Abstract

BACKGROUND

Prediabetes and diabetes are both abnormal states of glucose metabolism (AGM) that can lead to severe complications. Early detection of AGM is crucial for timely intervention and treatment. However, fasting blood glucose (FBG) as a mass population screening method may fail to identify some individuals who are actually AGM but with normoglycemia. This study aimed to develop and validate machine learning (ML) models to identify AGM among individuals with normoglycemia using routine health check-up indicators.

METHODS

According to the American Diabetes Association (ADA) criteria, participants with normoglycemia (FBG ≤ 5.6 mmol/L) were collected from 2019 to 2023, and then divided into AGM and Normal groups using glycosylated haemoglobin (HbA1c) 5.7% as the threshold. Data from 2019 to 2022 were divided into training and internal validation sets at a 7:3 ratio, while data from 2023 were used as the external validation set. Seven ML algorithms-including logistic regression (LR), random forest (RF), support vector machine (SVM), extreme gradient boosting machine, multilayer perceptron (MLP), light gradient boosting machine (LightGBM), and categorical boosting (CatBoost)-were used to build models for identifying AGM in normoglycemia population. Model performance was evaluated using the area under the receiver operating characteristic curve (auROC) and the precision-recall curve (auPR). The feature contributions to the optimal model was visualised using the SHapley Additive exPlanations (SHAP). Finally, an intuitive and user-friendly interactive interface was developed.

RESULTS

A total of 59,259 participants were finally enroled in this study, and then divided into the training set of 32,810, the internal validation set of 14,060, and the external validation set of 12,389. The Catboost model outperformed the others with auROC of 0.806 and 0.794 for the internal and external validation set, respectively. Age was the most important feature influencing the performance of the CatBoost model, followed by fasting blood glucose, red blood cells, haemoglobin, body mass index, and triglyceride-glucose.

CONCLUSION

A well-performed ML model to identify AGM in the normoglycemia population was built, offering significant potential for early intervention and treatment of AGM that would otherwise have been missed.

摘要

背景

糖尿病前期和糖尿病都是葡萄糖代谢异常(AGM)的异常状态,可导致严重的并发症。早期发现 AGM 对于及时干预和治疗至关重要。然而,空腹血糖(FBG)作为一种大规模人群筛查方法,可能无法识别出一些实际上处于 AGM 但血糖正常的个体。本研究旨在开发和验证机器学习(ML)模型,以使用常规健康检查指标识别血糖正常人群中的 AGM。

方法

根据美国糖尿病协会(ADA)标准,收集 2019 年至 2023 年血糖正常(FBG≤5.6mmol/L)的参与者,并使用糖化血红蛋白(HbA1c)5.7%作为阈值将其分为 AGM 和正常组。2019 年至 2022 年的数据按 7:3 的比例分为训练集和内部验证集,而 2023 年的数据用作外部验证集。使用七种 ML 算法——逻辑回归(LR)、随机森林(RF)、支持向量机(SVM)、极端梯度提升机、多层感知机(MLP)、轻梯度提升机(LightGBM)和分类提升机(CatBoost)——构建用于识别血糖正常人群中 AGM 的模型。使用接受者操作特征曲线下的面积(auROC)和精度-召回曲线下的面积(auPR)评估模型性能。使用 SHapley Additive exPlanations(SHAP)可视化最优模型的特征贡献。最后,开发了一个直观且用户友好的交互界面。

结果

本研究共纳入 59259 名参与者,分为训练集 32810 名、内部验证集 14060 名和外部验证集 12389 名。CatBoost 模型的内部验证集和外部验证集的 auROC 分别为 0.806 和 0.794,优于其他模型。年龄是影响 CatBoost 模型性能的最重要特征,其次是空腹血糖、红细胞、血红蛋白、体重指数和甘油三酯-葡萄糖。

结论

构建了一种性能良好的用于识别血糖正常人群中 AGM 的 ML 模型,为早期干预和治疗否则可能被遗漏的 AGM 提供了重要潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/dfbf702b7f78/DMRR-40-e70003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/d185b03a02fe/DMRR-40-e70003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/a176f5469352/DMRR-40-e70003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/6ea7a916f731/DMRR-40-e70003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/dfbf702b7f78/DMRR-40-e70003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/d185b03a02fe/DMRR-40-e70003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/a176f5469352/DMRR-40-e70003-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/6ea7a916f731/DMRR-40-e70003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ed7/11601146/dfbf702b7f78/DMRR-40-e70003-g001.jpg

相似文献

1
Development and Validation of Machine Learning Models for Identifying Prediabetes and Diabetes in Normoglycemia.开发和验证用于识别正常血糖中的糖尿病前期和糖尿病的机器学习模型。
Diabetes Metab Res Rev. 2024 Nov;40(8):e70003. doi: 10.1002/dmrr.70003.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Predictive value of MetS-IR for the glucose status conversion in prediabetes: a multi-center retrospective cohort study.代谢综合征胰岛素抵抗(MetS-IR)对糖尿病前期血糖状态转换的预测价值:一项多中心回顾性队列研究。
BMC Endocr Disord. 2025 Jul 1;25(1):162. doi: 10.1186/s12902-025-01974-5.
6
An artificial intelligence model to predict mortality among hemodialysis patients: A retrospective validated cohort study.一种预测血液透析患者死亡率的人工智能模型:一项回顾性验证队列研究。
Sci Rep. 2025 Jul 29;15(1):27699. doi: 10.1038/s41598-025-06576-8.
7
Development and validation of a machine learning model based on laboratory parameters for preoperative prediction of Ki-67 expression in gliomas.基于实验室参数的机器学习模型用于术前预测胶质瘤中Ki-67表达的开发与验证
J Neurosurg. 2025 Mar 28:1-13. doi: 10.3171/2024.11.JNS241673.
8
Interpretable machine learning for predicting isolated basal septal hypertrophy.用于预测孤立性基底间隔肥厚的可解释机器学习。
PLoS One. 2025 Jun 30;20(6):e0325992. doi: 10.1371/journal.pone.0325992. eCollection 2025.
9
Interpretable XGBoost model identifies idiopathic central precocious puberty in girls using four clinical and imaging features.可解释的XGBoost模型利用四种临床和影像学特征识别女童特发性中枢性性早熟。
BMC Endocr Disord. 2025 Jul 1;25(1):159. doi: 10.1186/s12902-025-01983-4.
10
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.

引用本文的文献

1
Applications of Artificial Intelligence and Machine Learning in Prediabetes: A Scoping Review.人工智能和机器学习在糖尿病前期的应用:一项范围综述
J Diabetes Sci Technol. 2025 Jul 8:19322968251351995. doi: 10.1177/19322968251351995.
2
Corrigendum.勘误
Diabetes Metab Res Rev. 2025 Jan;41(1):e70013. doi: 10.1002/dmrr.70013.

本文引用的文献

1
The association between TyG and all-cause/non-cardiovascular mortality in general patients with type 2 diabetes mellitus is modified by age: results from the cohort study of NHANES 1999-2018.在 2 型糖尿病患者中,TyG 与全因/非心血管死亡率之间的关联受年龄影响:来自 NHANES 1999-2018 队列研究的结果。
Cardiovasc Diabetol. 2024 Jan 28;23(1):43. doi: 10.1186/s12933-024-02120-6.
2
A nomogram model for predicting 5-year risk of prediabetes in Chinese adults.用于预测中国成年人糖尿病前期 5 年风险的列线图模型。
Sci Rep. 2023 Dec 18;13(1):22523. doi: 10.1038/s41598-023-50122-3.
3
Triglyceride-glucose index is associated with severe obstructive coronary artery disease and atherosclerotic target lesion failure among young adults.
甘油三酯-葡萄糖指数与青年人群严重阻塞性冠状动脉疾病及动脉粥样硬化靶病变失败相关。
Cardiovasc Diabetol. 2023 Oct 21;22(1):283. doi: 10.1186/s12933-023-02004-1.
4
Differential effect of interventions in patients with prediabetes stratified by a machine learning-based diabetes progression prediction model.基于机器学习的糖尿病进展预测模型分层的糖尿病前期患者干预措施的差异效应。
Diabetes Obes Metab. 2024 Jan;26(1):97-107. doi: 10.1111/dom.15291. Epub 2023 Oct 1.
5
Prediction of postpartum prediabetes by machine learning methods in women with gestational diabetes mellitus.机器学习方法对妊娠期糖尿病女性产后糖尿病前期的预测
iScience. 2023 Sep 9;26(10):107846. doi: 10.1016/j.isci.2023.107846. eCollection 2023 Oct 20.
6
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
7
Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning.糖尿病周围神经病变危险因素研究及机器学习预测模型的建立。
BMC Med Inform Decis Mak. 2023 Aug 2;23(1):146. doi: 10.1186/s12911-023-02232-1.
8
Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021.全球、地区和国家 1990 年至 2021 年糖尿病负担,以及对 2050 年患病率的预测:2021 年全球疾病负担研究的系统分析。
Lancet. 2023 Jul 15;402(10397):203-234. doi: 10.1016/S0140-6736(23)01301-6. Epub 2023 Jun 22.
9
LASSO-based machine learning algorithm to predict the incidence of diabetes in different stages.基于套索的机器学习算法预测不同阶段糖尿病的发病率。
Aging Male. 2023 Dec;26(1):2205510. doi: 10.1080/13685538.2023.2205510.
10
Diagnosis and Management of Prediabetes: A Review.糖尿病前期的诊断与管理:综述。
JAMA. 2023 Apr 11;329(14):1206-1216. doi: 10.1001/jama.2023.4063.