利用新型集成特征工程方法和机器学习模型增强对糖尿病的检测。

Enhanced detection of diabetes mellitus using novel ensemble feature engineering approach and machine learning model.

机构信息

School of Systems and Technology, Department of Software Engineering, University of Management and Technology, Lahore, 54770, Pakistan.

Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Al Ahliyya Amman University, Amman, 19328, Jordan.

出版信息

Sci Rep. 2024 Oct 7;14(1):23274. doi: 10.1038/s41598-024-74357-w.

DOI:10.1038/s41598-024-74357-w

PMID:39375469

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11458802/

Abstract

Diabetes is a persistent health condition led by insufficient use or inappropriate use of insulin in the body. If left undetected, it can lead to further complications involving organ damage such as heart, lungs, and eyes. Timely detection of diabetes helps obtain the right medication, diet, and exercise plan to lead a healthy life. ML approach has been utilized to obtain rapid and reliable diabetes detection, however, existing approaches suffer from the use of limited datasets, lack of generalizability, and lower accuracy. This study proposes a novel feature extraction approach to overcome these limitations by using an ensemble of convolutional neural network (CNN) and long short-term memory (LSTM) models. Multiple datasets are combined to make a larger dataset for experiments and multiple features are utilized for investigating the efficacy of the proposed approach. Features from the extra tree classifier, CNN, and LSTM are also considered for comparison. Experimental results reveal the superb performance of CNN-LSTM-based features with random forest model obtaining a 0.99 accuracy score. This performance is further validated by comparison with existing approaches and k-fold cross-validation which shows the proposed approach provides robust results.

摘要

糖尿病是一种由体内胰岛素使用不足或使用不当引起的持续健康状况。如果未被发现，它可能导致涉及心脏、肺部和眼睛等器官损伤的进一步并发症。及时发现糖尿病有助于获得正确的药物、饮食和运动计划，从而过上健康的生活。机器学习方法已被用于快速可靠地检测糖尿病，但现有的方法存在数据集有限、缺乏通用性和准确性较低的问题。本研究提出了一种新的特征提取方法，通过使用卷积神经网络 (CNN) 和长短期记忆 (LSTM) 模型的集成来克服这些限制。组合多个数据集以构建更大的数据集进行实验，并利用多种特征来研究所提出方法的效果。还考虑了来自随机森林模型的额外树分类器、CNN 和 LSTM 的特征。实验结果表明，基于 CNN-LSTM 的特征具有出色的性能，随机森林模型获得了 0.99 的准确率。通过与现有方法和 k 折交叉验证的比较进一步验证了该方法的稳健性，结果表明所提出的方法提供了可靠的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a61c/11458802/5bb7cca35c90/41598_2024_74357_Fig1_HTML.jpg

相似文献

Enhanced detection of diabetes mellitus using novel ensemble feature engineering approach and machine learning model.利用新型集成特征工程方法和机器学习模型增强对糖尿病的检测。

Sci Rep. 2024 Oct 7;14(1):23274. doi: 10.1038/s41598-024-74357-w.

ECG-based cardiac arrhythmias detection through ensemble learning and fusion of deep spatial-temporal and long-range dependency features.基于 ECG 的心脏心律失常检测，通过深度时空和长距离依赖特征的集成学习和融合。

Artif Intell Med. 2024 Apr;150:102818. doi: 10.1016/j.artmed.2024.102818. Epub 2024 Feb 24.

Hybrid CNN-LSTM for Predicting Diabetes: A Review.混合 CNN-LSTM 预测糖尿病：综述

Curr Diabetes Rev. 2024;20(7):e201023222410. doi: 10.2174/0115733998261151230925062430.

Breast cancer detection employing stacked ensemble model with convolutional features.采用堆叠集成模型和卷积特征进行乳腺癌检测。

Cancer Biomark. 2024;40(2):155-170. doi: 10.3233/CBM-230294.

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型，对于使用可穿戴设备进行压力预测具有良好的泛化能力。

J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.

A Multilevel Transfer Learning Technique and LSTM Framework for Generating Medical Captions for Limited CT and DBT Images.一种用于为有限的CT和DBT图像生成医学图像说明的多级迁移学习技术和长短期记忆网络框架。

J Digit Imaging. 2022 Jun;35(3):564-580. doi: 10.1007/s10278-021-00567-7. Epub 2022 Feb 25.

Detecting sarcasm in multi-domain datasets using convolutional neural networks and long short term memory network model.使用卷积神经网络和长短期记忆网络模型检测多领域数据集中的讽刺意味。

PeerJ Comput Sci. 2021 Aug 25;7:e645. doi: 10.7717/peerj-cs.645. eCollection 2021.

Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective.从数据预处理和机器学习角度看糖尿病的预测与诊断

Comput Methods Programs Biomed. 2022 Jun;220:106773. doi: 10.1016/j.cmpb.2022.106773. Epub 2022 Mar 31.

Using a novel convolutional neural network for plant pests detection and disease classification.利用新型卷积神经网络进行植物病虫害检测和疾病分类。

J Sci Food Agric. 2023 Sep;103(12):5849-5861. doi: 10.1002/jsfa.12700. Epub 2023 May 24.

A deep learning approach based on convolutional LSTM for detecting diabetes.基于卷积长短期记忆网络的糖尿病检测深度学习方法。

Comput Biol Chem. 2020 Oct;88:107329. doi: 10.1016/j.compbiolchem.2020.107329. Epub 2020 Jul 10.

引用本文的文献

Efficient diagnosis of diabetes mellitus using an improved ensemble method.使用改进的集成方法对糖尿病进行高效诊断。

Sci Rep. 2025 Jan 25;15(1):3235. doi: 10.1038/s41598-025-87767-1.

本文引用的文献

Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine Learning.使用细针穿刺特征和监督式机器学习进行上采样的乳腺癌预测

Cancers (Basel). 2023 Jan 22;15(3):681. doi: 10.3390/cancers15030681.

Machine learning models for classification and identification of significant attributes to detect type 2 diabetes.用于分类和识别重要属性以检测2型糖尿病的机器学习模型。

Health Inf Sci Syst. 2022 Feb 9;10(1):2. doi: 10.1007/s13755-021-00168-2. eCollection 2022 Dec.

Machine Learning Approaches to Predict Risks of Diabetic Complications and Poor Glycemic Control in Nonadherent Type 2 Diabetes.预测非依从性2型糖尿病患者糖尿病并发症风险和血糖控制不佳的机器学习方法

Front Pharmacol. 2021 Jun 22;12:665951. doi: 10.3389/fphar.2021.665951. eCollection 2021.

Prediction of Type 2 Diabetes Based on Machine Learning Algorithm.基于机器学习算法的 2 型糖尿病预测。

Int J Environ Res Public Health. 2021 Mar 23;18(6):3317. doi: 10.3390/ijerph18063317.

Single-cell ATAC-Seq in human pancreatic islets and deep learning upscaling of rare cells reveals cell-specific type 2 diabetes regulatory signatures.单细胞 ATAC-Seq 在人胰腺胰岛中的应用和深度学习扩展稀有细胞揭示了细胞特异性 2 型糖尿病调控特征。

Mol Metab. 2020 Feb;32:109-121. doi: 10.1016/j.molmet.2019.12.006. Epub 2019 Dec 20.

Forecasting stock prices with a feature fusion LSTM-CNN model using different representations of the same data.基于相同数据的不同表示，使用特征融合 LSTM-CNN 模型预测股票价格。

PLoS One. 2019 Feb 15;14(2):e0212320. doi: 10.1371/journal.pone.0212320. eCollection 2019.

Identifying people at risk of developing type 2 diabetes: A comparison of predictive analytics techniques and predictor variables.识别有患 2 型糖尿病风险的人群：预测分析技术和预测变量的比较。

Int J Med Inform. 2018 Nov;119:22-38. doi: 10.1016/j.ijmedinf.2018.08.008. Epub 2018 Aug 28.

Machine Learning and Data Mining Methods in Diabetes Research.糖尿病研究中的机器学习与数据挖掘方法

Comput Struct Biotechnol J. 2017 Jan 8;15:104-116. doi: 10.1016/j.csbj.2016.12.005. eCollection 2017.

A Predictive Metabolic Signature for the Transition From Gestational Diabetes Mellitus to Type 2 Diabetes.从妊娠期糖尿病转变为2型糖尿病的预测性代谢特征

Diabetes. 2016 Sep;65(9):2529-39. doi: 10.2337/db15-1720. Epub 2016 Jun 23.

Acute Complications of Myocardial Infarction in the Current Era: Diagnosis and Management.当代心肌梗死的急性并发症：诊断与管理

J Investig Med. 2015 Oct;63(7):844-55. doi: 10.1097/JIM.0000000000000232.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用新型集成特征工程方法和机器学习模型增强对糖尿病的检测。

Enhanced detection of diabetes mellitus using novel ensemble feature engineering approach and machine learning model.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献