• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

糖尿病风险的不平衡分类预测分析。

Predictive Analysis of Diabetes-Risk with Class Imbalance.

机构信息

Information Systems Department, Arab Academy for Science and Technology -AASTMT, Cairo, Egypt.

Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh 11671, Saudi Arabia.

出版信息

Comput Intell Neurosci. 2022 Oct 11;2022:3078025. doi: 10.1155/2022/3078025. eCollection 2022.

DOI:10.1155/2022/3078025
PMID:36268149
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9578843/
Abstract

Diabetes type 2 (T2DM) is a common chronic disease, increasingly leading to many complications and affecting vital organs. Hyperglycemia is the main characteristic caused by insufficient insulin secretion and poses a serious risk to human health. The objective is to construct a type-2 diabetes prediction model with high classification accuracy. Advanced machine learning and predictive model techniques are utilized to achieve cutting-edge techniques for the early diagnosis of diabetes. This paper proposes an efficient performance model to predict and classify the minority class of type-2 diabetes. The impact of oversampling and undersampling approaches to reduce the effect of an unbalanced clas has been compared to classification performance algorithms. Synthetic Minority Oversampling (SMOTE) and Tomek-links techniques are applied and examined. The outcomes were then compared to the original unbalanced dataset using an artificial neural network (ANN) predictive model. The model is compared with other state-of-the-art classifiers such as support vector machine (SVM), random forest (RF), and decision tree (DT). The tuned model had the best accuracy of 92.2%. The experimental findings clearly manifest the improvement in accuracy and evaluation metrics in terms of AUC and F1-measure using the SMOTE oversampling strategy rather than the baseline and undersampling schemes. The study recommends adopting dynamic hyperparameter optimization to further improve accuracy.

摘要

2 型糖尿病(T2DM)是一种常见的慢性疾病,越来越多地导致许多并发症,并影响重要器官。胰岛素分泌不足导致的高血糖是其主要特征,对人类健康构成严重威胁。本研究旨在构建具有高分类准确性的 2 型糖尿病预测模型。先进的机器学习和预测模型技术被用于实现糖尿病早期诊断的前沿技术。本文提出了一种高效的性能模型,用于预测和分类 2 型糖尿病的少数类别。比较了过采样和欠采样方法对减少不平衡类影响的效果与分类性能算法。应用并检验了合成少数过采样(SMOTE)和 Tomelinks 技术。然后,使用人工神经网络(ANN)预测模型将结果与原始不平衡数据集进行比较。该模型与支持向量机(SVM)、随机森林(RF)和决策树(DT)等其他最先进的分类器进行了比较。调整后的模型的准确率最高,为 92.2%。实验结果清楚地表明,与基线和欠采样方案相比,使用 SMOTE 过采样策略在 AUC 和 F1 度量方面显著提高了准确性和评估指标。研究建议采用动态超参数优化来进一步提高准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/5fb82bce0732/CIN2022-3078025.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/ee54a0743040/CIN2022-3078025.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/5a45030584da/CIN2022-3078025.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/71f932d4393b/CIN2022-3078025.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/ca4209d0287f/CIN2022-3078025.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/b747caaf9b6e/CIN2022-3078025.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/fa687f105bfe/CIN2022-3078025.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/09fc37882146/CIN2022-3078025.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/3fda33dac9f8/CIN2022-3078025.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/6799dff1de20/CIN2022-3078025.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/73ecaa314c2e/CIN2022-3078025.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/05182f67df04/CIN2022-3078025.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/eba480eff338/CIN2022-3078025.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/1f33466949c8/CIN2022-3078025.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/a4d404c91446/CIN2022-3078025.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/54c07c3d759c/CIN2022-3078025.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/490996d2ec9b/CIN2022-3078025.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/5fb82bce0732/CIN2022-3078025.alg.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/ee54a0743040/CIN2022-3078025.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/5a45030584da/CIN2022-3078025.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/71f932d4393b/CIN2022-3078025.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/ca4209d0287f/CIN2022-3078025.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/b747caaf9b6e/CIN2022-3078025.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/fa687f105bfe/CIN2022-3078025.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/09fc37882146/CIN2022-3078025.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/3fda33dac9f8/CIN2022-3078025.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/6799dff1de20/CIN2022-3078025.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/73ecaa314c2e/CIN2022-3078025.010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/05182f67df04/CIN2022-3078025.011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/eba480eff338/CIN2022-3078025.012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/1f33466949c8/CIN2022-3078025.013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/a4d404c91446/CIN2022-3078025.014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/54c07c3d759c/CIN2022-3078025.015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/490996d2ec9b/CIN2022-3078025.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7558/9578843/5fb82bce0732/CIN2022-3078025.alg.002.jpg

相似文献

1
Predictive Analysis of Diabetes-Risk with Class Imbalance.糖尿病风险的不平衡分类预测分析。
Comput Intell Neurosci. 2022 Oct 11;2022:3078025. doi: 10.1155/2022/3078025. eCollection 2022.
2
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
3
Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective.从数据预处理和机器学习角度看糖尿病的预测与诊断
Comput Methods Programs Biomed. 2022 Jun;220:106773. doi: 10.1016/j.cmpb.2022.106773. Epub 2022 Mar 31.
4
Improvement of P300-Based Brain-Computer Interfaces for Home Appliances Control by Data Balancing Techniques.基于 P300 的脑机接口的数据均衡技术在家用电器控制中的改进。
Sensors (Basel). 2020 Sep 29;20(19):5576. doi: 10.3390/s20195576.
5
The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes.使用SMOTE过采样对三种分类器预测2型糖尿病性能的影响
Med Decis Making. 2016 Jan;36(1):137-44. doi: 10.1177/0272989X14560647. Epub 2014 Dec 1.
6
A novel stacking ensemble for detecting three types of diabetes mellitus using a Saudi Arabian dataset: Pre-diabetes, T1DM, and T2DM.一种利用沙特阿拉伯数据集检测三种类型糖尿病(前驱糖尿病、T1DM 和 T2DM)的新型堆叠集成方法。
Comput Biol Med. 2022 Aug;147:105757. doi: 10.1016/j.compbiomed.2022.105757. Epub 2022 Jun 20.
7
Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.使用健身数据比较机器学习技术预测全因死亡率:亨利福特锻炼测试(FIT)项目。
BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.
8
Application of machine learning algorithms in predicting HIV infection among men who have sex with men: Model development and validation.机器学习算法在预测男男性行为者中 HIV 感染中的应用:模型开发和验证。
Front Public Health. 2022 Aug 25;10:967681. doi: 10.3389/fpubh.2022.967681. eCollection 2022.
9
Enhancing and improving the performance of imbalanced class data using novel GBO and SSG: A comparative analysis.利用新型 GBO 和 SSG 增强和改进不平衡类数据的性能:比较分析。
Neural Netw. 2024 May;173:106157. doi: 10.1016/j.neunet.2024.106157. Epub 2024 Feb 2.
10
Predicting diabetes in adults: identifying important features in unbalanced data over a 5-year cohort study using machine learning algorithm.预测成年人糖尿病:使用机器学习算法在 5 年队列研究中识别不平衡数据中的重要特征。
BMC Med Res Methodol. 2024 Sep 27;24(1):220. doi: 10.1186/s12874-024-02341-z.

引用本文的文献

1
Risk-based evaluation of machine learning-based classification methods used for medical devices.用于医疗设备的基于机器学习的分类方法的基于风险的评估。
BMC Med Inform Decis Mak. 2025 Mar 11;25(1):126. doi: 10.1186/s12911-025-02909-9.

本文引用的文献

1
Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications.基于机器学习的医疗保健应用中的糖尿病分类和预测。
J Healthc Eng. 2021 Sep 29;2021:9930985. doi: 10.1155/2021/9930985. eCollection 2021.
2
Efficient treatment of outliers and class imbalance for diabetes prediction.高效处理糖尿病预测中的异常值和类别不平衡问题。
Artif Intell Med. 2020 Apr;104:101815. doi: 10.1016/j.artmed.2020.101815. Epub 2020 Feb 10.
3
Comparative Analysis of Classification Methods with PCA and LDA for Diabetes.用于糖尿病的主成分分析(PCA)和线性判别分析(LDA)分类方法的比较分析
Curr Diabetes Rev. 2020;16(8):833-850. doi: 10.2174/1573399816666200123124008.
4
A deep learning approach to adherence detection for type 2 diabetics.一种用于检测2型糖尿病患者依从性的深度学习方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:2896-2899. doi: 10.1109/EMBC.2017.8037462.
5
Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm.糖尿病数据分类的比较方法:机器学习范例。
Comput Methods Programs Biomed. 2017 Dec;152:23-34. doi: 10.1016/j.cmpb.2017.09.004. Epub 2017 Sep 8.
6
Real-Time Non-Invasive Detection and Classification of Diabetes Using Modified Convolution Neural Network.使用改进卷积神经网络实时非侵入式检测和分类糖尿病
IEEE J Biomed Health Inform. 2018 Sep;22(5):1630-1636. doi: 10.1109/JBHI.2017.2757510. Epub 2017 Sep 28.