• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习和可解释人工智能技术进行糖尿病预测。

Diabetes prediction using machine learning and explainable AI techniques.

作者信息

Tasin Isfafuzzaman, Nabil Tansin Ullah, Islam Sanjida, Khan Riasat

机构信息

Electrical and Computer Engineering North South University Dhaka Bangladesh.

出版信息

Healthc Technol Lett. 2022 Dec 14;10(1-2):1-10. doi: 10.1049/htl2.12039. eCollection 2023 Feb-Apr.

DOI:10.1049/htl2.12039
PMID:37077883
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10107388/
Abstract

Globally, diabetes affects 537 million people, making it the deadliest and the most common non-communicable disease. Many factors can cause a person to get affected by diabetes, like excessive body weight, abnormal cholesterol level, family history, physical inactivity, bad food habit etc. Increased urination is one of the most common symptoms of this disease. People with diabetes for a long time can get several complications like heart disorder, kidney disease, nerve damage, diabetic retinopathy etc. But its risk can be reduced if it is predicted early. In this paper, an automatic diabetes prediction system has been developed using a private dataset of female patients in Bangladesh and various machine learning techniques. The authors used the Pima Indian diabetes dataset and collected additional samples from 203 individuals from a local textile factory in Bangladesh. Feature selection algorithm mutual information has been applied in this work. A semi-supervised model with extreme gradient boosting has been utilized to predict the insulin features of the private dataset. SMOTE and ADASYN approaches have been employed to manage the class imbalance problem. The authors used machine learning classification methods, that is, decision tree, SVM, Random Forest, Logistic Regression, KNN, and various ensemble techniques, to determine which algorithm produces the best prediction results. After training on and testing all the classification models, the proposed system provided the best result in the XGBoost classifier with the ADASYN approach with 81% accuracy, 0.81 F1 coefficient and AUC of 0.84. Furthermore, the domain adaptation method has been implemented to demonstrate the versatility of the proposed system. The explainable AI approach with LIME and SHAP frameworks is implemented to understand how the model predicts the final results. Finally, a website framework and an Android smartphone application have been developed to input various features and predict diabetes instantaneously. The private dataset of female Bangladeshi patients and programming codes are available at the following link: https://github.com/tansin-nabil/Diabetes-Prediction-Using-Machine-Learning.

摘要

在全球范围内,糖尿病影响着5.37亿人,使其成为最致命且最常见的非传染性疾病。许多因素可导致人们患上糖尿病,如体重超标、胆固醇水平异常、家族病史、缺乏身体活动、不良饮食习惯等。多尿是这种疾病最常见的症状之一。长期患糖尿病的人会出现多种并发症,如心脏病、肾病、神经损伤、糖尿病视网膜病变等。但如果能早期预测,其风险是可以降低的。在本文中,利用孟加拉国女性患者的一个私有数据集和各种机器学习技术,开发了一个自动糖尿病预测系统。作者使用了皮马印第安人糖尿病数据集,并从孟加拉国当地一家纺织厂的203名个体中收集了额外样本。在这项工作中应用了特征选择算法互信息。利用带有极端梯度提升的半监督模型来预测私有数据集的胰岛素特征。采用了SMOTE和ADASYN方法来处理类别不平衡问题。作者使用机器学习分类方法,即决策树、支持向量机、随机森林、逻辑回归、K近邻,以及各种集成技术,来确定哪种算法能产生最佳预测结果。在对所有分类模型进行训练和测试后,所提出的系统在采用ADASYN方法的XGBoost分类器中取得了最佳结果,准确率为81%,F1系数为0.81,AUC为0.84。此外,还实施了域适应方法以证明所提出系统的通用性。采用带有LIME和SHAP框架的可解释人工智能方法来理解模型如何预测最终结果。最后,开发了一个网站框架和一个安卓智能手机应用程序,用于输入各种特征并即时预测糖尿病。孟加拉国女性患者的私有数据集和编程代码可在以下链接获取:https://github.com/tansin-nabil/Diabetes-Prediction-Using-Machine-Learning 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/231345a87c53/HTL2-10-1-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/745f178628d2/HTL2-10-1-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/c9301f787885/HTL2-10-1-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/7ea443404d11/HTL2-10-1-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/1b2ca5486e56/HTL2-10-1-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/e55ed2bfd426/HTL2-10-1-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/02072cce1300/HTL2-10-1-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/c0d0c37beb08/HTL2-10-1-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/569c6a49de4f/HTL2-10-1-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/1ad6c314d3b1/HTL2-10-1-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/6134ff7a8214/HTL2-10-1-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/17e23d76a239/HTL2-10-1-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/9187c049ddcb/HTL2-10-1-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/231345a87c53/HTL2-10-1-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/745f178628d2/HTL2-10-1-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/c9301f787885/HTL2-10-1-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/7ea443404d11/HTL2-10-1-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/1b2ca5486e56/HTL2-10-1-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/e55ed2bfd426/HTL2-10-1-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/02072cce1300/HTL2-10-1-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/c0d0c37beb08/HTL2-10-1-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/569c6a49de4f/HTL2-10-1-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/1ad6c314d3b1/HTL2-10-1-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/6134ff7a8214/HTL2-10-1-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/17e23d76a239/HTL2-10-1-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/9187c049ddcb/HTL2-10-1-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a37/10107388/231345a87c53/HTL2-10-1-g009.jpg

相似文献

1
Diabetes prediction using machine learning and explainable AI techniques.使用机器学习和可解释人工智能技术进行糖尿病预测。
Healthc Technol Lett. 2022 Dec 14;10(1-2):1-10. doi: 10.1049/htl2.12039. eCollection 2023 Feb-Apr.
2
A proposed technique for predicting heart disease using machine learning algorithms and an explainable AI method.一种使用机器学习算法和可解释人工智能方法预测心脏病的建议技术。
Sci Rep. 2024 Oct 7;14(1):23277. doi: 10.1038/s41598-024-74656-2.
3
Polycystic Ovary Syndrome Detection Machine Learning Model Based on Optimized Feature Selection and Explainable Artificial Intelligence.基于优化特征选择和可解释人工智能的多囊卵巢综合征检测机器学习模型
Diagnostics (Basel). 2023 Apr 21;13(8):1506. doi: 10.3390/diagnostics13081506.
4
Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches.基于生活方式数据,运用机器学习方法对2型糖尿病进行性能分析与预测。
J Diabetes Metab Disord. 2022 Mar 14;21(1):339-352. doi: 10.1007/s40200-022-00981-w. eCollection 2022 Jun.
5
An Ensemble Approach for the Prediction of Diabetes Mellitus Using a Soft Voting Classifier with an Explainable AI.基于软投票分类器和可解释 AI 的糖尿病预测集成方法。
Sensors (Basel). 2022 Sep 25;22(19):7268. doi: 10.3390/s22197268.
6
Application of machine learning techniques for predicting survival in ovarian cancer.机器学习技术在卵巢癌生存预测中的应用。
BMC Med Inform Decis Mak. 2022 Dec 30;22(1):345. doi: 10.1186/s12911-022-02087-y.
7
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
8
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
9
Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson's disease.基于多模态时间序列数据的可解释机器学习模型用于帕金森病的早期检测。
Comput Methods Programs Biomed. 2023 Jun;234:107495. doi: 10.1016/j.cmpb.2023.107495. Epub 2023 Mar 23.
10
A machine learning classifier-based approach for diabetes mellitus risk prediction.一种基于机器学习分类器的糖尿病风险预测方法。
Biomed Phys Eng Express. 2024 Oct 10. doi: 10.1088/2057-1976/ad857b.

引用本文的文献

1
Enhanced diabetes prediction using skip-gated recurrent unit with gradient clipping approach.使用带梯度裁剪方法的跳跃门控循环单元增强糖尿病预测
Front Endocrinol (Lausanne). 2025 Aug 26;16:1601883. doi: 10.3389/fendo.2025.1601883. eCollection 2025.
2
Evaluation of inflammatory bowel disease-related sleep disorders based on an interpretable machine learning approach: a multicenter study in China.基于可解释机器学习方法的炎症性肠病相关睡眠障碍评估:一项中国多中心研究
Therap Adv Gastroenterol. 2025 Aug 15;18:17562848251359141. doi: 10.1177/17562848251359141. eCollection 2025.
3
A Comprehensive Review of Explainable Artificial Intelligence (XAI) in Computer Vision.

本文引用的文献

1
Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective.从数据预处理和机器学习角度看糖尿病的预测与诊断
Comput Methods Programs Biomed. 2022 Jun;220:106773. doi: 10.1016/j.cmpb.2022.106773. Epub 2022 Mar 31.
2
Retrospective validation of a machine learning clinical decision support tool for myocardial infarction risk stratification.用于心肌梗死风险分层的机器学习临床决策支持工具的回顾性验证
Healthc Technol Lett. 2021 Aug 31;8(6):139-147. doi: 10.1049/htl2.12017. eCollection 2021 Dec.
3
A remote healthcare monitoring framework for diabetes prediction using machine learning.
计算机视觉中可解释人工智能(XAI)的全面综述
Sensors (Basel). 2025 Jul 4;25(13):4166. doi: 10.3390/s25134166.
4
Explainable AI in early autism detection: a literature review of interpretable machine learning approaches.早期自闭症检测中的可解释人工智能:可解释机器学习方法的文献综述
Discov Ment Health. 2025 Jul 1;5(1):98. doi: 10.1007/s44192-025-00232-3.
5
The role of explainable artificial intelligence in disease prediction: a systematic literature review and future research directions.可解释人工智能在疾病预测中的作用:系统文献综述与未来研究方向
BMC Med Inform Decis Mak. 2025 Mar 4;25(1):110. doi: 10.1186/s12911-025-02944-6.
6
Robust predictive framework for diabetes classification using optimized machine learning on imbalanced datasets.使用优化的机器学习方法对不平衡数据集进行糖尿病分类的稳健预测框架。
Front Artif Intell. 2025 Jan 7;7:1499530. doi: 10.3389/frai.2024.1499530. eCollection 2024.
7
Construction of a health literacy prediction model for diabetic patients: A multicenter study.糖尿病患者健康素养预测模型的构建:一项多中心研究。
Digit Health. 2025 Jan 9;11:20552076241311735. doi: 10.1177/20552076241311735. eCollection 2025 Jan-Dec.
8
Machine Learning Algorithm-Based Prediction of Diabetes Among Female Population Using PIMA Dataset.基于机器学习算法利用皮马印第安人糖尿病数据集对女性人群糖尿病的预测
Healthcare (Basel). 2024 Dec 29;13(1):37. doi: 10.3390/healthcare13010037.
9
Leveraging Shapley Additive Explanations for Feature Selection in Ensemble Models for Diabetes Prediction.利用夏普利加性解释进行糖尿病预测集成模型中的特征选择。
Bioengineering (Basel). 2024 Nov 30;11(12):1215. doi: 10.3390/bioengineering11121215.
10
An automated approach to predict diabetic patients using KNN imputation and effective data mining techniques.一种使用 KNN 插补和有效数据挖掘技术预测糖尿病患者的自动化方法。
BMC Med Res Methodol. 2024 Sep 27;24(1):221. doi: 10.1186/s12874-024-02324-0.
一种使用机器学习进行糖尿病预测的远程医疗监测框架。
Healthc Technol Lett. 2021 May 2;8(3):45-57. doi: 10.1049/htl2.12010. eCollection 2021 Jun.
4
Prediction of Type 2 Diabetes Based on Machine Learning Algorithm.基于机器学习算法的 2 型糖尿病预测。
Int J Environ Res Public Health. 2021 Mar 23;18(6):3317. doi: 10.3390/ijerph18063317.
5
Prevalence of diabetes and pre-diabetes in Bangladesh: a systematic review and meta-analysis.孟加拉国糖尿病和糖尿病前期的患病率:一项系统评价与荟萃分析。
BMJ Open. 2020 Sep 9;10(9):e036086. doi: 10.1136/bmjopen-2019-036086.
6
Regression multiple imputation for missing data analysis.用于缺失数据分析的回归多重填补
Stat Methods Med Res. 2020 Sep;29(9):2647-2664. doi: 10.1177/0962280220908613. Epub 2020 Mar 4.
7
Complications of Diabetes.糖尿病的并发症
J Diabetes Res. 2015;2015:189525. doi: 10.1155/2015/189525. Epub 2015 Jul 12.
8
Diabetes mellitus: The epidemic of the century.糖尿病:本世纪的流行病。
World J Diabetes. 2015 Jun 25;6(6):850-67. doi: 10.4239/wjd.v6.i6.850.
9
Risk factors contributing to type 2 diabetes and recent advances in the treatment and prevention.2型糖尿病的危险因素及治疗与预防的最新进展。
Int J Med Sci. 2014 Sep 6;11(11):1185-200. doi: 10.7150/ijms.10001. eCollection 2014.