B Jai Kumar, Ranganathan Mohanasundaram
SCHOOL OF COMPUTER SCIENCE AND ENGINEERING, Vellore Institute of Technology, VELLORE INSTITUTE OF TECHNOLOGY, VELLORE, Vellore, Tamil Nadu, 632014, INDIA.
SCHOOL OF COMPUTER SCIENCE AND ENGINEERING, Vellore Institute of Technology, VELLORE INSTITUTE OF TECHNOLOGY, VELLORE, Vellore, 632014, INDIA.
Biomed Phys Eng Express. 2024 Oct 10. doi: 10.1088/2057-1976/ad857b.
Currently, Diabetes Mellitus (DM) can be life-threatening due to the dietary habits and lifestyle choices of individuals. Diabetes is characterised by elevated levels of glucose in the blood and an excess of protein in the blood. Poor eating habits and lifestyles are largely responsible for the rise in overweight, obesity, and various related conditions. This study investigated many diabetes-related risk forecasting techniques and algorithms. The eight machine learning (ML) algorithms used the diabetes dataset to test various prediction techniques, including a Support Vector Classifier, gradient-boosting, multilayer perceptron, random forest, K-nearest neighbors, logistic regression, extreme gradient boosting, and decision tree. To enhance the diabetic prediction ability of the model, we suggested using Feature Engineering (FE) and feature scaling. For our investigation, we utilized the Mendeley dataset on diabetes to assess the capacity of the model to predict diabetes. We developed a model by using Python programming and eight classification techniques. The Random Forest with 99.21%, Gradient Boosting with 99.61%, Extreme Gradient Boosting, and Decision Tree achieved the highest F1 score (99.81%), accuracy rate (99.80%), precision (99.81%), and recall (99.81%) of all classification approaches.
目前,由于个人的饮食习惯和生活方式选择,糖尿病(DM)可能会危及生命。糖尿病的特征是血液中葡萄糖水平升高和血液中蛋白质过量。不良的饮食习惯和生活方式在很大程度上导致了超重、肥胖及各种相关病症的增加。本研究调查了许多与糖尿病相关的风险预测技术和算法。八种机器学习(ML)算法使用糖尿病数据集来测试各种预测技术,包括支持向量分类器、梯度提升、多层感知器、随机森林、K近邻、逻辑回归、极端梯度提升和决策树。为了提高模型的糖尿病预测能力,我们建议使用特征工程(FE)和特征缩放。在我们的调查中,我们利用了门捷列夫糖尿病数据集来评估模型预测糖尿病的能力。我们使用Python编程和八种分类技术开发了一个模型。在所有分类方法中,随机森林的F1分数最高(99.21%),梯度提升为99.61%,极端梯度提升和决策树的F1分数(99.81%)、准确率(99.80%)、精确率(99.81%)和召回率(99.81%)最高。