• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用 KNN 插补和有效数据挖掘技术预测糖尿病患者的自动化方法。

An automated approach to predict diabetic patients using KNN imputation and effective data mining techniques.

机构信息

College of Computer Science and Engineering, University of Hafr Al-Batin, Hafr Al-Batin, 39524, Saudi Arabia.

Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, 11671, Saudi Arabia.

出版信息

BMC Med Res Methodol. 2024 Sep 27;24(1):221. doi: 10.1186/s12874-024-02324-0.

DOI:10.1186/s12874-024-02324-0
PMID:39333904
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438170/
Abstract

Diabetes is thought to be the most common illness in underdeveloped nations. Early detection and competent medical care are crucial steps in reducing the effects of diabetes. Examining the signs associated with diabetes is one of the most effective ways to identify the condition. The problem of missing data is not very well investigated in existing works. In addition, existing studies on diabetes detection lack accuracy and robustness. The available datasets frequently contain missing information for the automated detection of diabetes, which might negatively impact machine learning model performance. This work suggests an automated diabetes prediction method that achieves high accuracy and effectively manages missing variables in order to address this problem. The proposed strategy employs a stacked ensemble voting classifier model with three machine learning models. and a KNN Imputer to handle missing values. Using the KNN imputer, the suggested model performs exceptionally well, with accuracy, precision, recall, F1 score, and MCC of 98.59%, 99.26%, 99.75%, 99.45%, and 99.24%, respectively. In two scenarios one with missing values eliminated and the other with KNN imputer, the study thoroughly compared the suggested model with seven other machine learning techniques. The outcomes demonstrate the superiority of the suggested model over current state-of-the-art methods and confirm its efficacy. This work demonstrates the capability of KNN imputer and looks at the problem of missing values for diabetes detection. Medical professionals can utilize the results to improve care for diabetes patients and discover problems early.

摘要

糖尿病被认为是欠发达国家最常见的疾病。早期发现和专业的医疗护理是减少糖尿病影响的关键步骤。检查与糖尿病相关的症状是识别这种疾病的最有效方法之一。在现有工作中,对缺失数据的问题研究得还不够充分。此外,现有的糖尿病检测研究缺乏准确性和稳健性。现有的数据集经常包含用于糖尿病自动检测的缺失信息,这可能会对机器学习模型的性能产生负面影响。这项工作提出了一种自动化的糖尿病预测方法,该方法可以实现高精度,并有效地管理缺失变量,以解决这个问题。所提出的策略采用了具有三个机器学习模型的堆叠集成投票分类器模型和 KNN 插补器来处理缺失值。使用 KNN 插补器,所提出的模型表现出色,准确率、精度、召回率、F1 得分和 MCC 分别为 98.59%、99.26%、99.75%、99.45%和 99.24%。在两种情况下,一种是消除缺失值,另一种是使用 KNN 插补器,研究彻底比较了所提出的模型与其他七种机器学习技术。结果表明,所提出的模型优于当前最先进的方法,并证实了其有效性。这项工作展示了 KNN 插补器的能力,并研究了糖尿病检测中缺失值的问题。医疗专业人员可以利用这些结果来改善对糖尿病患者的护理,并及早发现问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/c41f28605c89/12874_2024_2324_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/d2299ee15903/12874_2024_2324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/dfbbb5c22c24/12874_2024_2324_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/016fd1956001/12874_2024_2324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/e0d9a92ebee6/12874_2024_2324_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/21c9b798cc1c/12874_2024_2324_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/c41f28605c89/12874_2024_2324_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/d2299ee15903/12874_2024_2324_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/dfbbb5c22c24/12874_2024_2324_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/016fd1956001/12874_2024_2324_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/e0d9a92ebee6/12874_2024_2324_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/21c9b798cc1c/12874_2024_2324_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3ea/11438170/c41f28605c89/12874_2024_2324_Fig5_HTML.jpg

相似文献

1
An automated approach to predict diabetic patients using KNN imputation and effective data mining techniques.一种使用 KNN 插补和有效数据挖掘技术预测糖尿病患者的自动化方法。
BMC Med Res Methodol. 2024 Sep 27;24(1):221. doi: 10.1186/s12874-024-02324-0.
2
Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel.使用K近邻插补器和堆叠集成学习模型进行宫颈癌检测。
Digit Health. 2023 Oct 3;9:20552076231203802. doi: 10.1177/20552076231203802. eCollection 2023 Jan-Dec.
3
Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach.使用K近邻插补合成少数过采样技术特征和多模型集成学习方法改善宫颈癌预测
Cancers (Basel). 2023 Sep 4;15(17):4412. doi: 10.3390/cancers15174412.
4
Improving prediction of cervical cancer using KNN imputer and multi-model ensemble learning.使用 KNN 插补器和多模型集成学习提高宫颈癌预测。
PLoS One. 2024 Jan 3;19(1):e0295632. doi: 10.1371/journal.pone.0295632. eCollection 2024.
5
Novel ensemble learning approach with SVM-imputed ADASYN features for enhanced cervical cancer prediction.基于支持向量机插补 ADASYN 特征的新型集成学习方法,用于增强宫颈癌预测。
PLoS One. 2024 Jan 10;19(1):e0296107. doi: 10.1371/journal.pone.0296107. eCollection 2024.
6
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
7
Improving mortality prediction in Acute Pancreatitis by machine learning and data augmentation.通过机器学习和数据增强提高急性胰腺炎的死亡率预测。
Comput Biol Med. 2022 Nov;150:106077. doi: 10.1016/j.compbiomed.2022.106077. Epub 2022 Sep 11.
8
An intelligent warning model for early prediction of cardiac arrest in sepsis patients.脓毒症患者心脏骤停早期预测的智能预警模型。
Comput Methods Programs Biomed. 2019 Sep;178:47-58. doi: 10.1016/j.cmpb.2019.06.010. Epub 2019 Jun 11.
9
Large scale biomedical texts classification: a kNN and an ESA-based approaches.大规模生物医学文本分类:基于k近邻算法和基于词嵌入语义分析的方法。
J Biomed Semantics. 2016 Jun 16;7:40. doi: 10.1186/s13326-016-0073-1.
10
An Ensemble Approach for the Prediction of Diabetes Mellitus Using a Soft Voting Classifier with an Explainable AI.基于软投票分类器和可解释 AI 的糖尿病预测集成方法。
Sensors (Basel). 2022 Sep 25;22(19):7268. doi: 10.3390/s22197268.

引用本文的文献

1
Machine Learning-Driven Insights in Cancer Metabolomics: From Subtyping to Biomarker Discovery and Prognostic Modeling.机器学习驱动的癌症代谢组学见解:从亚型分类到生物标志物发现与预后建模
Metabolites. 2025 Aug 1;15(8):514. doi: 10.3390/metabo15080514.
2
Postpartum depression risk prediction using explainable machine learning algorithms.使用可解释机器学习算法预测产后抑郁风险
Front Med (Lausanne). 2025 Aug 7;12:1565374. doi: 10.3389/fmed.2025.1565374. eCollection 2025.
3
Explainable AI for Chronic Kidney Disease Prediction in Medical IoT: Integrating GANs and Few-Shot Learning.

本文引用的文献

1
An ensemble learning approach for diabetes prediction using boosting techniques.一种使用提升技术进行糖尿病预测的集成学习方法。
Front Genet. 2023 Oct 26;14:1252159. doi: 10.3389/fgene.2023.1252159. eCollection 2023.
2
The imperative for regulatory oversight of large language models (or generative AI) in healthcare.对医疗保健领域的大语言模型(或生成式人工智能)进行监管监督的必要性。
NPJ Digit Med. 2023 Jul 6;6(1):120. doi: 10.1038/s41746-023-00873-0.
3
Diabetes prediction using machine learning and explainable AI techniques.使用机器学习和可解释人工智能技术进行糖尿病预测。
医疗物联网中用于慢性肾脏病预测的可解释人工智能:整合生成对抗网络和少样本学习
Bioengineering (Basel). 2025 Mar 29;12(4):356. doi: 10.3390/bioengineering12040356.
Healthc Technol Lett. 2022 Dec 14;10(1-2):1-10. doi: 10.1049/htl2.12039. eCollection 2023 Feb-Apr.
4
Early Prediction of Diabetes Using an Ensemble of Machine Learning Models.使用机器学习模型集成进行糖尿病早期预测。
Int J Environ Res Public Health. 2022 Sep 28;19(19):12378. doi: 10.3390/ijerph191912378.
5
An Ensemble Approach to Predict Early-Stage Diabetes Risk Using Machine Learning: An Empirical Study.基于机器学习的早期糖尿病风险预测的集成方法:一项实证研究。
Sensors (Basel). 2022 Jul 13;22(14):5247. doi: 10.3390/s22145247.
6
Continuous Glucose Monitoring in Healthy Adults-Possible Applications in Health Care, Wellness, and Sports.健康成年人的连续血糖监测——在医疗保健、健康和运动中的可能应用。
Sensors (Basel). 2022 Mar 5;22(5):2030. doi: 10.3390/s22052030.
7
A CNN based coronavirus disease prediction system for chest X-rays.一种基于卷积神经网络的胸部X光片冠状病毒病预测系统。
J Ambient Intell Humaniz Comput. 2022 Feb 27:1-15. doi: 10.1007/s12652-022-03775-3.
8
Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications.基于机器学习的医疗保健应用中的糖尿病分类和预测。
J Healthc Eng. 2021 Sep 29;2021:9930985. doi: 10.1155/2021/9930985. eCollection 2021.
9
Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients.深度迁移学习和数据增强改善2型糖尿病患者的血糖水平预测。
NPJ Digit Med. 2021 Jul 14;4(1):109. doi: 10.1038/s41746-021-00480-x.
10
Prediction of Type 2 Diabetes Based on Machine Learning Algorithm.基于机器学习算法的 2 型糖尿病预测。
Int J Environ Res Public Health. 2021 Mar 23;18(6):3317. doi: 10.3390/ijerph18063317.