• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

糖尿病研究中的机器学习与数据挖掘方法

Machine Learning and Data Mining Methods in Diabetes Research.

作者信息

Kavakiotis Ioannis, Tsave Olga, Salifoglou Athanasios, Maglaveras Nicos, Vlahavas Ioannis, Chouvarda Ioanna

机构信息

Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece; Institute of Applied Biosciences, CERTH, Thessaloniki, Greece.

Laboratory of Inorganic Chemistry, Department of Chemical Engineering, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece.

出版信息

Comput Struct Biotechnol J. 2017 Jan 8;15:104-116. doi: 10.1016/j.csbj.2016.12.005. eCollection 2017.

DOI:10.1016/j.csbj.2016.12.005
PMID:28138367
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5257026/
Abstract

The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.

摘要

生物技术和健康科学的显著进步带来了大量数据的产生,例如从大型电子健康记录(EHR)中生成的高通量基因数据和临床信息。为此,目前在生物科学中应用机器学习和数据挖掘方法比以往任何时候都更加重要和不可或缺,这些方法有助于将所有可用信息智能地转化为有价值的知识。糖尿病(DM)被定义为一组对全球人类健康造成巨大压力的代谢紊乱疾病。在糖尿病各个方面(诊断、病因病理生理学、治疗等)的广泛研究导致了大量数据的产生。本研究的目的是对机器学习、数据挖掘技术和工具在糖尿病研究领域的应用进行系统综述,涉及以下方面:a)预测与诊断;b)糖尿病并发症;c)遗传背景与环境;e)医疗保健与管理,其中第一类应用似乎最为普遍。研究采用了多种机器学习算法。总体而言,所使用的算法中85%采用监督学习方法,15%采用无监督学习方法,更具体地说是关联规则。支持向量机(SVM)是最成功且应用最广泛的算法。关于数据类型,主要使用临床数据集。所选文章中的标题应用展示了提取有价值知识的有用性,这些知识能够产生新的假设,从而更深入地理解糖尿病并进行进一步研究。

相似文献

1
Machine Learning and Data Mining Methods in Diabetes Research.糖尿病研究中的机器学习与数据挖掘方法
Comput Struct Biotechnol J. 2017 Jan 8;15:104-116. doi: 10.1016/j.csbj.2016.12.005. eCollection 2017.
2
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
3
Comparing different supervised machine learning algorithms for disease prediction.比较不同的监督机器学习算法在疾病预测中的应用。
BMC Med Inform Decis Mak. 2019 Dec 21;19(1):281. doi: 10.1186/s12911-019-1004-8.
4
A survey on diabetes risk prediction using machine learning approaches.一项关于使用机器学习方法进行糖尿病风险预测的调查。
J Family Med Prim Care. 2022 Nov;11(11):6929-6934. doi: 10.4103/jfmpc.jfmpc_502_22. Epub 2022 Dec 16.
5
A systematic review of data mining and machine learning for air pollution epidemiology.空气污染流行病学中数据挖掘与机器学习的系统综述。
BMC Public Health. 2017 Nov 28;17(1):907. doi: 10.1186/s12889-017-4914-3.
6
A review on current advances in machine learning based diabetes prediction.基于机器学习的糖尿病预测研究进展综述。
Prim Care Diabetes. 2021 Jun;15(3):435-443. doi: 10.1016/j.pcd.2021.02.005. Epub 2021 Feb 26.
7
Assessment of vector-host-pathogen relationships using data mining and machine learning.利用数据挖掘和机器学习评估媒介-宿主-病原体关系。
Comput Struct Biotechnol J. 2020 Jun 25;18:1704-1721. doi: 10.1016/j.csbj.2020.06.031. eCollection 2020.
8
Potential value and impact of data mining and machine learning in clinical diagnostics.数据挖掘和机器学习在临床诊断中的潜在价值和影响。
Crit Rev Clin Lab Sci. 2021 Jun;58(4):275-296. doi: 10.1080/10408363.2020.1857681. Epub 2021 Mar 19.
9
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
10
Environmental exposures in machine learning and data mining approaches to diabetes etiology: A scoping review.机器学习和数据挖掘方法在糖尿病病因学中的环境暴露:范围综述。
Artif Intell Med. 2023 Jan;135:102461. doi: 10.1016/j.artmed.2022.102461. Epub 2022 Nov 30.

引用本文的文献

1
Gut Microbiome Signatures in Multiple Sclerosis: A Case-Control Study with Machine Learning and Global Data Integration.多发性硬化症中的肠道微生物组特征:一项基于机器学习和全球数据整合的病例对照研究。
Biomedicines. 2025 Jul 23;13(8):1806. doi: 10.3390/biomedicines13081806.
2
Predicting cancer risk using machine learning on lifestyle and genetic data.利用机器学习对生活方式和基因数据进行癌症风险预测。
Sci Rep. 2025 Aug 19;15(1):30458. doi: 10.1038/s41598-025-15656-8.
3
When Health Behavior Meets Technology: Artificial Neural Network-Based Prediction of Breast Self-Examination Intentions Among Filipino Female College Students.

本文引用的文献

1
The adipogenic potential of Cr(III). A molecular approach exemplifying metal-induced enhancement of insulin mimesis in diabetes mellitus II.三价铬的成脂潜能。一种分子学方法,例证金属诱导的II型糖尿病中胰岛素模拟作用增强。
J Inorg Biochem. 2016 Oct;163:323-331. doi: 10.1016/j.jinorgbio.2016.07.015. Epub 2016 Jul 27.
2
Implications of Big Data Analytics on Population Health Management.大数据分析对人口健康管理的影响。
Big Data. 2013 Sep;1(3):152-9. doi: 10.1089/big.2013.0019. Epub 2013 Sep 5.
3
Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.
当健康行为遇上科技:基于人工神经网络对菲律宾女大学生乳房自我检查意愿的预测
J Cancer Educ. 2025 Aug 16. doi: 10.1007/s13187-025-02690-3.
4
Analysis and Mapping of Machine Learning in the Context of Diabetes.糖尿病背景下机器学习的分析与映射
Health Sci Rep. 2025 Aug 13;8(8):e71167. doi: 10.1002/hsr2.71167. eCollection 2025 Aug.
5
Mapping the landscape of machine learning in chronic disease management: A comprehensive bibliometric study.绘制机器学习在慢性病管理中的全景:一项全面的文献计量学研究。
Digit Health. 2025 Jul 23;11:20552076251361614. doi: 10.1177/20552076251361614. eCollection 2025 Jan-Dec.
6
How machine learning on real world clinical data improves adverse event recording for endoscopy.基于真实世界临床数据的机器学习如何改善内镜检查不良事件记录。
NPJ Digit Med. 2025 Jul 10;8(1):424. doi: 10.1038/s41746-025-01826-5.
7
Obesity Biomarkers: Exploring Factors, Ramification, Machine Learning, and AI-Unveiling Insights in Health Research.肥胖生物标志物:探索健康研究中的因素、影响、机器学习及人工智能——揭示见解
MedComm (2020). 2025 Jun 22;6(7):e70169. doi: 10.1002/mco2.70169. eCollection 2025 Jul.
8
Digital horizons in non-communicable disease care: a bibliometric exploration of intervention impact and innovation.非传染性疾病护理中的数字前沿:干预影响与创新的文献计量学探索
Front Digit Health. 2025 Jun 5;7:1528711. doi: 10.3389/fdgth.2025.1528711. eCollection 2025.
9
Advancing breast cancer prediction: Comparative analysis of ML models and deep learning-based multi-model ensembles on original and synthetic datasets.推进乳腺癌预测:基于原始数据集和合成数据集对机器学习模型与深度学习多模型集成的比较分析
PLoS One. 2025 Jun 18;20(6):e0326221. doi: 10.1371/journal.pone.0326221. eCollection 2025.
10
Cost-effectiveness of the 3E model in diabetes management: a machine learning approach to assess long-term economic impact.3E模型在糖尿病管理中的成本效益:一种评估长期经济影响的机器学习方法
Front Public Health. 2025 May 23;13:1571546. doi: 10.3389/fpubh.2025.1571546. eCollection 2025.
从索赔数据预测 2 型糖尿病的人群水平及危险因素分析。
Big Data. 2015 Dec;3(4):277-87. doi: 10.1089/big.2015.0020.
4
Non-invasive detection of fasting blood glucose level via electrochemical measurement of saliva.通过唾液的电化学测量无创检测空腹血糖水平。
Springerplus. 2016 May 23;5(1):701. doi: 10.1186/s40064-016-2339-6. eCollection 2016.
5
A Predictive Metabolic Signature for the Transition From Gestational Diabetes Mellitus to Type 2 Diabetes.从妊娠期糖尿病转变为2型糖尿病的预测性代谢特征
Diabetes. 2016 Sep;65(9):2529-39. doi: 10.2337/db15-1720. Epub 2016 Jun 23.
6
Data analytics identify glycated haemoglobin co-markers for type 2 diabetes mellitus diagnosis.数据分析识别出用于2型糖尿病诊断的糖化血红蛋白共同标志物。
Comput Biol Med. 2016 Aug 1;75:90-7. doi: 10.1016/j.compbiomed.2016.05.005. Epub 2016 May 13.
7
Using Random Forest Models to Identify Correlates of a Diabetic Peripheral Neuropathy Diagnosis from Electronic Health Record Data.使用随机森林模型从电子健康记录数据中识别糖尿病周围神经病变诊断的相关因素。
Pain Med. 2017 Jan 1;18(1):107-115. doi: 10.1093/pm/pnw096.
8
Learning statistical models of phenotypes using noisy labeled training data.使用带有噪声标签的训练数据学习表型的统计模型。
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173. doi: 10.1093/jamia/ocw028. Epub 2016 May 12.
9
Type 2 Diabetes Mellitus Trajectories and Associated Risks.2型糖尿病病程及相关风险
Big Data. 2016 Mar 1;4(1):25-30. doi: 10.1089/big.2015.0029.
10
A flexible data-driven comorbidity feature extraction framework.一个灵活的数据驱动的共病特征提取框架。
Comput Biol Med. 2016 Jun 1;73:165-72. doi: 10.1016/j.compbiomed.2016.04.014. Epub 2016 Apr 20.