• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于数据驱动的机器学习方法在糖尿病风险预测中的应用。

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction.

机构信息

Department of Computer Engineering and Informatics, University of Patras, 26504 Patras, Greece.

出版信息

Sensors (Basel). 2022 Jul 15;22(14):5304. doi: 10.3390/s22145304.

DOI:10.3390/s22145304
PMID:35890983
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9318204/
Abstract

Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models.

摘要

糖尿病是一种以碳水化合物、脂肪和蛋白质代谢紊乱为特征的慢性疾病。所有类型糖尿病中最典型的紊乱是高血糖,即血糖水平升高。现代生活方式显著增加了糖尿病的发病率。因此,早期诊断疾病是必要的。由于机器学习 (ML) 在开发用于风险预测、预后、治疗和管理各种疾病的高效工具方面具有很高的潜力,因此在医疗保健提供者和医生中得到了广泛的关注。在这项研究中,描述了一种监督学习方法,旨在创建用于 2 型糖尿病发生的高效风险预测工具。进行特征分析以评估其重要性并探索其与糖尿病的关联。这些特征是糖尿病常见的且通常发展缓慢的症状,用于训练和测试几种 ML 模型。根据 Precision、Recall、F-Measure、Accuracy 和 AUC 指标评估各种 ML 模型,并在 10 倍交叉验证和数据分割下进行比较。两种验证方法都强调随机森林和 K-NN 是表现最佳的模型,优于其他模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/8649062f60ff/sensors-22-05304-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/5fed72368151/sensors-22-05304-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/0df2efa0ec6d/sensors-22-05304-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/8b89f5746759/sensors-22-05304-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/e0aa7b70f296/sensors-22-05304-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/d6cbe5528458/sensors-22-05304-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/3ee21b290814/sensors-22-05304-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/2ff5b5b2d3e5/sensors-22-05304-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/8649062f60ff/sensors-22-05304-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/5fed72368151/sensors-22-05304-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/0df2efa0ec6d/sensors-22-05304-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/8b89f5746759/sensors-22-05304-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/e0aa7b70f296/sensors-22-05304-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/d6cbe5528458/sensors-22-05304-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/3ee21b290814/sensors-22-05304-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/2ff5b5b2d3e5/sensors-22-05304-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6811/9318204/8649062f60ff/sensors-22-05304-g008.jpg

相似文献

1
Data-Driven Machine-Learning Methods for Diabetes Risk Prediction.基于数据驱动的机器学习方法在糖尿病风险预测中的应用。
Sensors (Basel). 2022 Jul 15;22(14):5304. doi: 10.3390/s22145304.
2
Machine Learning Methods for Hypercholesterolemia Long-Term Risk Prediction.机器学习方法在高胆固醇血症长期风险预测中的应用。
Sensors (Basel). 2022 Jul 18;22(14):5365. doi: 10.3390/s22145365.
3
Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques.基于舌象特征和机器学习技术的无创糖尿病风险预测模型的建立。
Int J Med Inform. 2021 May;149:104429. doi: 10.1016/j.ijmedinf.2021.104429. Epub 2021 Feb 22.
4
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
5
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
6
Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective.从数据预处理和机器学习角度看糖尿病的预测与诊断
Comput Methods Programs Biomed. 2022 Jun;220:106773. doi: 10.1016/j.cmpb.2022.106773. Epub 2022 Mar 31.
7
Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type.基于生活方式类型的数据驱动糖尿病预测的机器学习模型。
Int J Environ Res Public Health. 2022 Nov 15;19(22):15027. doi: 10.3390/ijerph192215027.
8
Comparative study on risk prediction model of type 2 diabetes based on machine learning theory: a cross-sectional study.基于机器学习理论的 2 型糖尿病风险预测模型的比较研究:一项横断面研究。
BMJ Open. 2023 Aug 29;13(8):e069018. doi: 10.1136/bmjopen-2022-069018.
9
Application of information theoretic feature selection and machine learning methods for the development of genetic risk prediction models.信息论特征选择和机器学习方法在遗传风险预测模型开发中的应用。
Sci Rep. 2021 Dec 2;11(1):23335. doi: 10.1038/s41598-021-00854-x.
10
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

引用本文的文献

1
Enhancing diabetes risk prediction through focal active learning and machine learning models.通过聚焦主动学习和机器学习模型增强糖尿病风险预测
PLoS One. 2025 Jul 8;20(7):e0327120. doi: 10.1371/journal.pone.0327120. eCollection 2025.
2
Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis.数据驱动的糖尿病预测与管理:决策树分类器和人工神经网络模型的比较评估及统计分析
Sci Rep. 2025 Jun 2;15(1):19339. doi: 10.1038/s41598-025-03718-w.
3
Prediction of coronary heart disease based on klotho levels using machine learning.

本文引用的文献

1
Itch in diabetes: a common underestimated problem.糖尿病中的瘙痒:一个普遍被低估的问题。
Postepy Dermatol Alergol. 2021 Apr;38(2):177-183. doi: 10.5114/ada.2019.89712. Epub 2019 Nov 18.
2
Stroke Risk Prediction with Machine Learning Techniques.基于机器学习技术的中风风险预测。
Sensors (Basel). 2022 Jun 21;22(13):4670. doi: 10.3390/s22134670.
3
Machine learning-based prediction of COVID-19 diagnosis based on symptoms.基于症状的新冠肺炎诊断的机器学习预测
基于机器学习利用klotho水平预测冠心病
Sci Rep. 2025 May 27;15(1):18519. doi: 10.1038/s41598-025-03234-x.
4
Machine learning and artificial intelligence in type 2 diabetes prediction: a comprehensive 33-year bibliometric and literature analysis.机器学习与人工智能在2型糖尿病预测中的应用:一项为期33年的全面文献计量学与文献分析
Front Digit Health. 2025 Mar 27;7:1557467. doi: 10.3389/fdgth.2025.1557467. eCollection 2025.
5
Next-generation diabetes diagnosis and personalized diet-activity management: A hybrid ensemble paradigm.下一代糖尿病诊断与个性化饮食-运动管理:一种混合集成范式。
PLoS One. 2025 Jan 8;20(1):e0307718. doi: 10.1371/journal.pone.0307718. eCollection 2025.
6
Risk Prediction of high blood glucose among women (15-49 years) and men (15-54 years) in India: An analysis from National Family Health Survey-5 (2019-21).印度15至49岁女性和15至54岁男性高血糖的风险预测:来自第五轮全国家庭健康调查(2019 - 2021年)的分析
J Family Med Prim Care. 2024 Nov;13(11):5312-5319. doi: 10.4103/jfmpc.jfmpc_929_24. Epub 2024 Nov 18.
7
Comparative study of ten machine learning algorithms for short-term forecasting in gas warning systems.气体预警系统中十种机器学习算法用于短期预测的比较研究
Sci Rep. 2024 Sep 20;14(1):21969. doi: 10.1038/s41598-024-67283-4.
8
Proactive Identification of Patients with Diabetes at Risk of Uncontrolled Outcomes during a Diabetes Management Program: Conceptualization and Development Study Using Machine Learning.在糖尿病管理项目中主动识别有出现控制不佳结果风险的糖尿病患者:使用机器学习的概念化与开发研究
JMIR Form Res. 2024 Apr 26;8:e54373. doi: 10.2196/54373.
9
A feature optimization study based on a diabetes risk questionnaire.一项基于糖尿病风险问卷的特征优化研究。
Front Public Health. 2024 Feb 23;12:1328353. doi: 10.3389/fpubh.2024.1328353. eCollection 2024.
10
Diabetes risk prediction model based on community follow-up data using machine learning.基于社区随访数据并运用机器学习的糖尿病风险预测模型
Prev Med Rep. 2023 Aug 20;35:102358. doi: 10.1016/j.pmedr.2023.102358. eCollection 2023 Oct.
NPJ Digit Med. 2021 Jan 4;4(1):3. doi: 10.1038/s41746-020-00372-6.
4
The Effects of Type 2 Diabetes Mellitus on Organ Metabolism and the Immune System.2 型糖尿病对器官代谢和免疫系统的影响。
Front Immunol. 2020 Jul 22;11:1582. doi: 10.3389/fimmu.2020.01582. eCollection 2020.
5
Recent advancement in cancer detection using machine learning: Systematic survey of decades, comparisons and challenges.利用机器学习进行癌症检测的最新进展:数十年的系统调查、比较和挑战。
J Infect Public Health. 2020 Sep;13(9):1274-1289. doi: 10.1016/j.jiph.2020.06.033. Epub 2020 Aug 2.
6
Early detection of type 2 diabetes mellitus using machine learning-based prediction models.使用基于机器学习的预测模型进行 2 型糖尿病的早期检测。
Sci Rep. 2020 Jul 20;10(1):11981. doi: 10.1038/s41598-020-68771-z.
7
Machine Learning and Prediction of All-Cause Mortality in COPD.机器学习与 COPD 全因死亡率预测。
Chest. 2020 Sep;158(3):952-964. doi: 10.1016/j.chest.2020.02.079. Epub 2020 Apr 27.
8
Diabetes mellitus is associated with increased mortality and severity of disease in COVID-19 pneumonia - A systematic review, meta-analysis, and meta-regression.糖尿病与COVID-19肺炎患者的死亡率增加及疾病严重程度相关——一项系统评价、荟萃分析和Meta回归分析
Diabetes Metab Syndr. 2020 Jul-Aug;14(4):395-403. doi: 10.1016/j.dsx.2020.04.018. Epub 2020 Apr 17.
9
Logistic regression was as good as machine learning for predicting major chronic diseases.逻辑回归在预测主要慢性病方面与机器学习一样出色。
J Clin Epidemiol. 2020 Jun;122:56-69. doi: 10.1016/j.jclinepi.2020.03.002. Epub 2020 Mar 10.
10
Sociodemographic and lifestyle-related risk factors for identifying vulnerable groups for type 2 diabetes: a narrative review with emphasis on data from Europe.社会人口学和生活方式相关的 2 型糖尿病高危人群识别风险因素:以欧洲数据为主的叙述性综述。
BMC Endocr Disord. 2020 Mar 12;20(Suppl 1):134. doi: 10.1186/s12902-019-0463-3.