Suppr超能文献

一种用于预测糖尿病的堆叠集成机器学习方法。

A stacked ensemble machine learning approach for the prediction of diabetes.

作者信息

Oliullah Khondokar, Rasel Mahedi Hasan, Islam Md Manzurul, Islam Md Reazul, Wadud Md Anwar Hussen, Whaiduzzaman Md

机构信息

Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh.

School of Information Systems, Queensland University of Technology, Brisbane, Australia.

出版信息

J Diabetes Metab Disord. 2023 Nov 22;23(1):603-617. doi: 10.1007/s40200-023-01321-2. eCollection 2024 Jun.

Abstract

OBJECTIVES

Diabetes has become a leading cause of mortality in both developed and developing countries, impacting a growing number of individuals worldwide. As the prevalence of the disease continues to rise, researchers have diligently worked towards developing accurate diabetes prediction models. The primary aim of this study is to utilize a diverse set of machine learning algorithms to detect the presence of diabetes, particularly in females, at an early stage. By leveraging these methods, this research seeks to provide physicians with valuable tools to identify the disease early, enabling timely interventions and improving patient outcomes.

METHODS

In this study, some state-of-the-art machine learning techniques, such as random forest classifiers with gridsearchCV, XGBoost, NGBoost, Bagging, LightGBM, and AdaBoost classifiers, were employed. These models were chosen as the base layer of our proposed stacked ensemble model because of their high accuracy. Before feeding the data into the models, the dataset was preprocessed to ensure optimal performance and obtain improved results.

RESULTS

The accuracy achieved in this study was 92.91%, which demonstrates its competitiveness with the existing approaches. Moreover, the utilization of the Shapley additive explanation (SHAP) facilitated the interpretation of machine learning models.

CONCLUSION

We anticipate that these findings will be beneficial to healthcare providers, stakeholders, students, and researchers involved in diabetes prediction research and development.

摘要

目标

糖尿病已成为发达国家和发展中国家的主要死因,影响着全球越来越多的人。随着该疾病患病率持续上升,研究人员一直在努力开发准确的糖尿病预测模型。本研究的主要目的是利用多种机器学习算法在早期阶段检测糖尿病的存在,尤其是在女性中。通过利用这些方法,本研究旨在为医生提供有价值的工具,以便早期识别疾病,从而实现及时干预并改善患者预后。

方法

在本研究中,采用了一些最先进的机器学习技术,如带网格搜索交叉验证的随机森林分类器、XGBoost、NGBoost、装袋法、LightGBM和AdaBoost分类器。由于这些模型具有较高的准确性,因此被选为我们提出的堆叠集成模型的基础层。在将数据输入模型之前,对数据集进行了预处理,以确保最佳性能并获得更好的结果。

结果

本研究取得的准确率为92.91%,这表明了其与现有方法的竞争力。此外,使用夏普利值加法解释(SHAP)有助于对机器学习模型进行解释。

结论

我们预计这些发现将对参与糖尿病预测研发的医疗保健提供者、利益相关者、学生和研究人员有益。

相似文献

1
A stacked ensemble machine learning approach for the prediction of diabetes.一种用于预测糖尿病的堆叠集成机器学习方法。
J Diabetes Metab Disord. 2023 Nov 22;23(1):603-617. doi: 10.1007/s40200-023-01321-2. eCollection 2024 Jun.
9
Diagnosis of Parkinson's disease based on voice signals using SHAP and hard voting ensemble method.基于 SHAP 和硬投票集成方法的语音信号帕金森病诊断。
Comput Methods Biomech Biomed Engin. 2024 Oct;27(13):1858-1874. doi: 10.1080/10255842.2023.2263125. Epub 2023 Sep 28.

本文引用的文献

3
Prediction of hypercholesterolemia using machine learning techniques.使用机器学习技术预测高胆固醇血症。
J Diabetes Metab Disord. 2022 Dec 22;22(1):255-265. doi: 10.1007/s40200-022-01125-w. eCollection 2023 Jun.
5
Predicting diabetic nephropathy in type 2 diabetic patients using machine learning algorithms.使用机器学习算法预测2型糖尿病患者的糖尿病肾病
J Diabetes Metab Disord. 2022 Jul 26;21(2):1433-1441. doi: 10.1007/s40200-022-01076-2. eCollection 2022 Dec.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验