一种用于糖尿病特征选择和分类的新方法：机器学习方法。

A Novel Approach for Feature Selection and Classification of Diabetes Mellitus: Machine Learning Methods.

机构信息

CSE Department, Gautam Buddha University, Greater Noida, India.

Cedargate Technologies, Kathmandu, Nepal.

出版信息

Comput Intell Neurosci. 2022 Apr 15;2022:3820360. doi: 10.1155/2022/3820360. eCollection 2022.

DOI:10.1155/2022/3820360

PMID:35463255

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9033325/

Abstract

An active research area where the experts from the medical field are trying to envisage the problem with more accuracy is diabetes prediction. Surveys conducted by WHO have shown a remarkable increase in the diabetic patients. Diabetes generally remains in dormant mode and it boosts the other diseases if patients are diagnosed with some other disease such as damage to the kidney vessels, problems in retina of the eye, and cardiac problem; if unidentified, it can create metabolic disorders and too many complications in the body. The main objective of our study is to draw a comparative study of different classifiers and feature selection methods to predict the diabetes with greater accuracy. In this paper, we have studied multilayer perceptron, decision trees, K-nearest neighbour, and random forest classifiers and few feature selection techniques were applied on the classifiers to detect the diabetes at an early stage. Raw data is subjected to preprocessing techniques, thus removing outliers and imputing missing values by mean and then in the end hyperparameters optimization. Experiments were conducted on PIMA Indians diabetes dataset using Weka 3.9 and the accuracy achieved for multilayer perceptron is 77.60%, for decision trees is 76.07%, for K-nearest neighbour is 78.58%, and for random forest is , which is by far the best accuracy for random forest classifier.

摘要

一个活跃的研究领域，医学领域的专家正在努力更准确地预见这个问题，这就是糖尿病预测。世界卫生组织进行的调查显示，糖尿病患者显著增加。糖尿病通常处于潜伏状态，如果患者被诊断出患有其他疾病，如肾脏血管损伤、眼睛视网膜问题和心脏问题，它会加重其他疾病；如果未被识别，它会导致代谢紊乱和体内出现过多并发症。我们研究的主要目标是比较不同的分类器和特征选择方法，以更准确地预测糖尿病。在本文中，我们研究了多层感知器、决策树、K-最近邻和随机森林分类器，并在分类器上应用了一些特征选择技术，以尽早发现糖尿病。原始数据经过预处理技术，通过均值去除异常值并填补缺失值，然后最终进行超参数优化。我们在 Weka 3.9 上使用 PIMA 印第安人糖尿病数据集进行了实验，多层感知器的准确率为 77.60%，决策树的准确率为 76.07%，K-最近邻的准确率为 78.58%，随机森林的准确率为，这是随机森林分类器迄今为止最好的准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55b5/9033325/af9a615bab9c/CIN2022-3820360.001.jpg

相似文献

A Novel Approach for Feature Selection and Classification of Diabetes Mellitus: Machine Learning Methods.一种用于糖尿病特征选择和分类的新方法：机器学习方法。

Comput Intell Neurosci. 2022 Apr 15;2022:3820360. doi: 10.1155/2022/3820360. eCollection 2022.

Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective.从数据预处理和机器学习角度看糖尿病的预测与诊断

Comput Methods Programs Biomed. 2022 Jun;220:106773. doi: 10.1016/j.cmpb.2022.106773. Epub 2022 Mar 31.

Accurate Diabetes Risk Stratification Using Machine Learning: Role of Missing Value and Outliers.利用机器学习进行准确的糖尿病风险分层：缺失值和异常值的作用。

J Med Syst. 2018 Apr 10;42(5):92. doi: 10.1007/s10916-018-0940-7.

Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications.基于机器学习的医疗保健应用中的糖尿病分类和预测。

J Healthc Eng. 2021 Sep 29;2021:9930985. doi: 10.1155/2021/9930985. eCollection 2021.

Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。

BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.

Early Prediction of Diabetes Using an Ensemble of Machine Learning Models.使用机器学习模型集成进行糖尿病早期预测。

Int J Environ Res Public Health. 2022 Sep 28;19(19):12378. doi: 10.3390/ijerph191912378.

Evaluation of Machine Learning Techniques for Traffic Flow-Based Intrusion Detection.基于流量的入侵检测的机器学习技术评估。

Sensors (Basel). 2022 Nov 30;22(23):9326. doi: 10.3390/s22239326.

KFPredict: An ensemble learning prediction framework for diabetes based on fusion of key features.KFPredict：一种基于关键特征融合的糖尿病集成学习预测框架。

Comput Methods Programs Biomed. 2023 Apr;231:107378. doi: 10.1016/j.cmpb.2023.107378. Epub 2023 Jan 26.

Effectively Predicting the Presence of Coronary Heart Disease Using Machine Learning Classifiers.使用机器学习分类器有效预测冠心病的存在。

Sensors (Basel). 2022 Sep 23;22(19):7227. doi: 10.3390/s22197227.

Diabetes disease detection and classification on Indian demographic and health survey data using machine learning methods.使用机器学习方法对印度人口与健康调查数据进行糖尿病疾病检测与分类

Diabetes Metab Syndr. 2023 Jan;17(1):102690. doi: 10.1016/j.dsx.2022.102690. Epub 2022 Dec 5.

引用本文的文献

Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence.机器学习方法预测代谢组学数据预测 2 型糖尿病的发生。

Int J Mol Sci. 2024 May 14;25(10):5331. doi: 10.3390/ijms25105331.

Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach.利用可解释的机器学习方法从体检中识别 2 型糖尿病的诊断指标。

Front Endocrinol (Lausanne). 2024 Mar 18;15:1376220. doi: 10.3389/fendo.2024.1376220. eCollection 2024.

Prediction of Diabetes Using Data Mining and Machine Learning Algorithms: A Cross-Sectional Study.使用数据挖掘和机器学习算法预测糖尿病：一项横断面研究。

Healthc Inform Res. 2024 Jan;30(1):73-82. doi: 10.4258/hir.2024.30.1.73. Epub 2024 Jan 31.

Optimizing diabetes classification with a machine learning-based framework.基于机器学习的糖尿病分类优化框架。

BMC Bioinformatics. 2023 Nov 13;24(1):428. doi: 10.1186/s12859-023-05467-x.

Application of Machine Learning Models for Early Detection and Accurate Classification of Type 2 Diabetes.机器学习模型在2型糖尿病早期检测与准确分类中的应用

Diagnostics (Basel). 2023 Jul 15;13(14):2383. doi: 10.3390/diagnostics13142383.

Integrative analysis of Mendelian randomization and gene expression profiles reveals a null causal relationship between adiponectin and diabetic retinopathy.基于孟德尔随机化和基因表达谱的综合分析揭示脂联素与糖尿病视网膜病变之间不存在因果关系。

Adipocyte. 2023 Dec;12(1):2234522. doi: 10.1080/21623945.2023.2234522.

Glycation-Associated Diabetic Nephropathy and the Role of Long Noncoding RNAs.糖基化相关的糖尿病肾病及长链非编码RNA的作用

Biomedicines. 2022 Oct 19;10(10):2623. doi: 10.3390/biomedicines10102623.

An Ensemble Approach to Predict Early-Stage Diabetes Risk Using Machine Learning: An Empirical Study.基于机器学习的早期糖尿病风险预测的集成方法：一项实证研究。

Sensors (Basel). 2022 Jul 13;22(14):5247. doi: 10.3390/s22145247.

本文引用的文献

Teasing out Artificial Intelligence in Medicine: An Ethical Critique of Artificial Intelligence and Machine Learning in Medicine.医学中的人工智能：对医学中人工智能和机器学习的伦理批判。

J Bioeth Inq. 2021 Mar;18(1):121-139. doi: 10.1007/s11673-020-10080-1. Epub 2021 Jan 7.

Global and regional estimates and projections of diabetes-related health expenditure: Results from the International Diabetes Federation Diabetes Atlas, 9th edition.全球及各区域糖尿病相关卫生支出估计和预测：国际糖尿病联盟糖尿病地图集第 9 版结果。

Diabetes Res Clin Pract. 2020 Apr;162:108072. doi: 10.1016/j.diabres.2020.108072. Epub 2020 Feb 13.

Comparative Analysis of Classification Methods with PCA and LDA for Diabetes.用于糖尿病的主成分分析（PCA）和线性判别分析（LDA）分类方法的比较分析

Curr Diabetes Rev. 2020;16(8):833-850. doi: 10.2174/1573399816666200123124008.

Classification and prediction of diabetes disease using machine learning paradigm.使用机器学习范式对糖尿病疾病进行分类和预测。

Health Inf Sci Syst. 2020 Jan 3;8(1):7. doi: 10.1007/s13755-019-0095-z. eCollection 2020 Dec.

Accurate Diabetes Risk Stratification Using Machine Learning: Role of Missing Value and Outliers.利用机器学习进行准确的糖尿病风险分层：缺失值和异常值的作用。

J Med Syst. 2018 Apr 10;42(5):92. doi: 10.1007/s10916-018-0940-7.

Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm.糖尿病数据分类的比较方法：机器学习范例。

Comput Methods Programs Biomed. 2017 Dec;152:23-34. doi: 10.1016/j.cmpb.2017.09.004. Epub 2017 Sep 8.

Identification of Type 2 Diabetes Risk Factors Using Phenotypes Consisting of Anthropometry and Triglycerides based on Machine Learning.基于机器学习，利用包含人体测量学和甘油三酯的表型识别2型糖尿病风险因素。

IEEE J Biomed Health Inform. 2016 Jan;20(1):39-46. doi: 10.1109/JBHI.2015.2396520. Epub 2015 Feb 6.

Automated identification of normal and diabetes heart rate signals using nonlinear measures.利用非线性测度自动识别正常和糖尿病心率信号。

Comput Biol Med. 2013 Oct;43(10):1523-9. doi: 10.1016/j.compbiomed.2013.05.024. Epub 2013 Jun 6.

A multivariate logistic regression equation to screen for diabetes: development and validation.用于筛查糖尿病的多元逻辑回归方程：开发与验证

Diabetes Care. 2002 Nov;25(11):1999-2003. doi: 10.2337/diacare.25.11.1999.

Using neural networks for prediction of the subcellular location of proteins.利用神经网络预测蛋白质的亚细胞定位。

Nucleic Acids Res. 1998 May 1;26(9):2230-6. doi: 10.1093/nar/26.9.2230.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于糖尿病特征选择和分类的新方法：机器学习方法。

A Novel Approach for Feature Selection and Classification of Diabetes Mellitus: Machine Learning Methods.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献