• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于中老年人群糖尿病诊断的集成装袋-随机森林学习模型

Integrated bagging-RF learning model for diabetes diagnosis in middle-aged and elderly population.

作者信息

Shi Yuanwu, Sun Jiuye

机构信息

College of Art and Design, Wuhan Textile University, Wuhan, Hubei, China.

出版信息

PeerJ Comput Sci. 2024 Oct 31;10:e2436. doi: 10.7717/peerj-cs.2436. eCollection 2024.

DOI:10.7717/peerj-cs.2436
PMID:39650520
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11623014/
Abstract

As the population ages, the increase in the number of middle-aged and older adults with diabetes poses new challenges to the allocation of resources in the healthcare system. Developing accurate diabetes prediction models is a critical public health strategy to improve the efficient use of healthcare resources and ensure timely and effective treatment. In order to improve the identification of diabetes in middle-aged and older patients, a Bagging-RF model is proposed. In the study, two diabetes datasets on Kaggle were first preprocessed, including unique heat coding, outlier removal, and age screening, after which the data were categorized into three age groups, 50-60, 60-70, and 70-80, and balanced using the SMOTE technique. Then, the machine learning classifiers were trained using the Bagging-RF integrated model with eight other machine learning classifiers. Finally, the model's performance was evaluated by accuracy, 1 score, and other metrics. The results showed that the Bagging-RF model outperformed the other eight machine learning classifiers, exhibiting 97.35%, 95.55%, 95.14% accuracy and 97.35%, 97.35%, 95.14% 1 Score at the Diabetes Prediction Dataset for diabetes prediction for the three age groups of 50-60, 60-70, and 70-80; and 97.03%, 94.90%, 93.70% accuracy and 97.03%, 94.90%, 93.70% 1 Score at the Diabetes Prediction Dataset. 95.55%, 95.13% 1 Score; and 97.03%, 94.90%, 93.70% accuracy; and 97.03%, 94.89%, 93.70% 1 Score at Diabetes Prediction Dataset. In addition, while other integrated learning models, such as ET, RF, Adaboost, and XGB, fail to outperform Bagging-RF, they also show excellent performance.

摘要

随着人口老龄化,糖尿病中老年患者数量的增加给医疗保健系统的资源分配带来了新挑战。开发准确的糖尿病预测模型是提高医疗资源利用效率并确保及时有效治疗的一项关键公共卫生策略。为了提高中老年患者糖尿病的识别率,提出了一种Bagging-RF模型。在该研究中,首先对Kaggle上的两个糖尿病数据集进行预处理,包括独热编码、异常值去除和年龄筛选,之后将数据分为50 - 60岁、60 - 70岁和70 - 80岁三个年龄组,并使用SMOTE技术进行平衡处理。然后,使用Bagging-RF集成模型与其他八个机器学习分类器对机器学习分类器进行训练。最后,通过准确率、F1分数等指标对模型性能进行评估。结果表明,Bagging-RF模型优于其他八个机器学习分类器,在50 - 60岁、60 - 70岁和70 - 80岁三个年龄组的糖尿病预测数据集上,糖尿病预测的准确率分别为97.35%、95.55%、95.14%,F1分数分别为97.35%、97.35%、95.14%;在糖尿病预测数据集上的准确率分别为97.03%、94.90%、93.70%,F1分数分别为97.03%、94.90%、93.70%,以及95.55%、95.13%的F1分数;在糖尿病预测数据集上的准确率分别为97.03%、94.90%、93.70%,F1分数分别为97.03%、94.89%、93.70%。此外,虽然其他集成学习模型,如ET、RF、Adaboost和XGB,未能超过Bagging-RF,但它们也表现出了出色的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/63ec2ff7669f/peerj-cs-10-2436-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/8597240d563c/peerj-cs-10-2436-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/790a3a6cac67/peerj-cs-10-2436-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/d540bd485f18/peerj-cs-10-2436-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/4e175f2fa071/peerj-cs-10-2436-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/7d9a1f6053a6/peerj-cs-10-2436-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/076cde0c1aba/peerj-cs-10-2436-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/ddc8e66d2e4e/peerj-cs-10-2436-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/549da2e15609/peerj-cs-10-2436-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/63ec2ff7669f/peerj-cs-10-2436-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/8597240d563c/peerj-cs-10-2436-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/790a3a6cac67/peerj-cs-10-2436-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/d540bd485f18/peerj-cs-10-2436-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/4e175f2fa071/peerj-cs-10-2436-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/7d9a1f6053a6/peerj-cs-10-2436-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/076cde0c1aba/peerj-cs-10-2436-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/ddc8e66d2e4e/peerj-cs-10-2436-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/549da2e15609/peerj-cs-10-2436-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/12a3/11623014/63ec2ff7669f/peerj-cs-10-2436-g009.jpg

相似文献

1
Integrated bagging-RF learning model for diabetes diagnosis in middle-aged and elderly population.用于中老年人群糖尿病诊断的集成装袋-随机森林学习模型
PeerJ Comput Sci. 2024 Oct 31;10:e2436. doi: 10.7717/peerj-cs.2436. eCollection 2024.
2
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
3
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
4
The Impact of the SMOTE Method on Machine Learning and Ensemble Learning Performance Results in Addressing Class Imbalance in Data Used for Predicting Total Testosterone Deficiency in Type 2 Diabetes Patients.SMOTE方法对机器学习和集成学习性能结果的影响:用于解决2型糖尿病患者总睾酮缺乏预测数据中的类不平衡问题
Diagnostics (Basel). 2024 Nov 22;14(23):2634. doi: 10.3390/diagnostics14232634.
5
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
6
Performance discrepancy mitigation in heart disease prediction for multisensory inter-datasets.多感官跨数据集心脏病预测中的性能差异缓解
PeerJ Comput Sci. 2024 Mar 18;10:e1917. doi: 10.7717/peerj-cs.1917. eCollection 2024.
7
Data Augmentation and Machine Learning algorithms for multi-class imbalanced morphometrics data of stingless bees.用于无刺蜂多类不平衡形态测量数据的数据增强和机器学习算法
Heliyon. 2025 Jan 23;11(3):e42214. doi: 10.1016/j.heliyon.2025.e42214. eCollection 2025 Feb 15.
8
Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models.可解释人工智能 (XAI) 在预测甲醇中毒患者需要插管中的应用:比较深度学习和机器学习模型的研究。
Sci Rep. 2024 Jul 8;14(1):15751. doi: 10.1038/s41598-024-66481-4.
9
Structure-activity relationship-based chemical classification of highly imbalanced Tox21 datasets.基于结构-活性关系的高度不平衡Tox21数据集的化学分类
J Cheminform. 2020 Oct 27;12(1):66. doi: 10.1186/s13321-020-00468-x.
10
Utilizing SMOTE-TomekLink and machine learning to construct a predictive model for elderly medical and daily care services demand.利用SMOTE-TomekLink和机器学习构建老年医疗和日常护理服务需求预测模型。
Sci Rep. 2025 Mar 11;15(1):8446. doi: 10.1038/s41598-025-92722-1.

本文引用的文献

1
A stacked ensemble machine learning approach for the prediction of diabetes.一种用于预测糖尿病的堆叠集成机器学习方法。
J Diabetes Metab Disord. 2023 Nov 22;23(1):603-617. doi: 10.1007/s40200-023-01321-2. eCollection 2024 Jun.
2
An intelligent diabetes classification and perception framework based on ensemble and deep learning method.基于集成与深度学习方法的智能糖尿病分类与感知框架
PeerJ Comput Sci. 2024 Mar 29;10:e1914. doi: 10.7717/peerj-cs.1914. eCollection 2024.
3
Evaluation metrics and statistical tests for machine learning.
机器学习的评估指标和统计检验。
Sci Rep. 2024 Mar 13;14(1):6086. doi: 10.1038/s41598-024-56706-x.
4
Multi-feature fusion and dandelion optimizer based model for automatically diagnosing the gastrointestinal diseases.基于多特征融合与蒲公英优化器的胃肠道疾病自动诊断模型
PeerJ Comput Sci. 2024 Feb 28;10:e1919. doi: 10.7717/peerj-cs.1919. eCollection 2024.
5
Novel ensemble learning approach with SVM-imputed ADASYN features for enhanced cervical cancer prediction.基于支持向量机插补 ADASYN 特征的新型集成学习方法,用于增强宫颈癌预测。
PLoS One. 2024 Jan 10;19(1):e0296107. doi: 10.1371/journal.pone.0296107. eCollection 2024.
6
Automatic classification of kidney CT images with relief based novel hybrid deep model.基于 Relief 的新型混合深度模型对肾脏 CT 图像进行自动分类
PeerJ Comput Sci. 2023 Nov 30;9:e1717. doi: 10.7717/peerj-cs.1717. eCollection 2023.
7
Revolutionizing Diabetes Diagnosis: Machine Learning Techniques Unleashed.变革糖尿病诊断:机器学习技术大显身手。
Healthcare (Basel). 2023 Oct 31;11(21):2864. doi: 10.3390/healthcare11212864.
8
Diabetes risk prediction model based on community follow-up data using machine learning.基于社区随访数据并运用机器学习的糖尿病风险预测模型
Prev Med Rep. 2023 Aug 20;35:102358. doi: 10.1016/j.pmedr.2023.102358. eCollection 2023 Oct.
9
A hybrid super ensemble learning model for the early-stage prediction of diabetes risk.一种用于糖尿病风险早期预测的混合超级集成学习模型。
Med Biol Eng Comput. 2023 Mar;61(3):785-797. doi: 10.1007/s11517-022-02749-z. Epub 2023 Jan 5.
10
Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type.基于生活方式类型的数据驱动糖尿病预测的机器学习模型。
Int J Environ Res Public Health. 2022 Nov 15;19(22):15027. doi: 10.3390/ijerph192215027.