• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于集成的大型医疗数据高效分类框架。

An Efficient, Ensemble-Based Classification Framework for Big Medical Data.

机构信息

Higher Colleges of Technology, Dubai, UAE.

School of Engineering, Malla Reddy University, Hyderabad, India.

出版信息

Big Data. 2022 Apr;10(2):151-160. doi: 10.1089/big.2021.0132. Epub 2021 Sep 23.

DOI:10.1089/big.2021.0132
PMID:34558983
Abstract

Fetching useful information from big medical datasets is a complicated task in the big data age. Various classification algorithms are used in the data mining process to analyze information from the big medical dataset. Nevertheless, these classification algorithms are insufficient to handle big medical data. This work proposes an efficient, ensemble-based classification framework for big medical data to deal with this problem. The proposed work involves initially applying the preprocessing technique to remove noise, missing values, and unwanted features from big medical data. The process selects a subset of classifiers from a pool of classifiers. The selected classifiers are combined to form a hybrid system for efficient classification. The methodology further involves incremental learning from data samples, explaining the predicted outputs, and achieving high classification performance. Java is used for simulation, and the Cleveland Heart Disease big dataset and Diabetes big dataset are used for classification. The experimental result shows that the proposed ensemble algorithm provides an efficient classification compared with existing algorithms based on accuracy, precision, F-measure, recall, and execution time.

摘要

从大型医疗数据集中获取有用信息是大数据时代的一项复杂任务。在数据挖掘过程中,使用各种分类算法来分析来自大型医疗数据集的信息。然而,这些分类算法不足以处理大型医疗数据。针对这个问题,本工作提出了一种用于大型医疗数据的高效、基于集成的分类框架。本工作涉及首先应用预处理技术从大型医疗数据中去除噪声、缺失值和不需要的特征。该过程从分类器池中选择一个分类器子集。选择的分类器被组合形成一个混合系统,以实现有效的分类。该方法还涉及从数据样本中进行增量学习,解释预测输出,并实现高分类性能。Java 用于模拟,Cleveland Heart Disease 大型数据集和 Diabetes 大型数据集用于分类。实验结果表明,与基于准确性、精度、F 度量、召回率和执行时间的现有算法相比,所提出的集成算法提供了更有效的分类。

相似文献

1
An Efficient, Ensemble-Based Classification Framework for Big Medical Data.基于集成的大型医疗数据高效分类框架。
Big Data. 2022 Apr;10(2):151-160. doi: 10.1089/big.2021.0132. Epub 2021 Sep 23.
2
R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data.R-Ensembler:一种基于粗糙集的贪婪集成属性选择算法,具有 kNN 插补功能,用于医学数据的分类。
Comput Methods Programs Biomed. 2020 Feb;184:105122. doi: 10.1016/j.cmpb.2019.105122. Epub 2019 Oct 8.
3
EAGA-MLP-An Enhanced and Adaptive Hybrid Classification Model for Diabetes Diagnosis.EAGA-MLP:一种用于糖尿病诊断的增强型自适应混合分类模型。
Sensors (Basel). 2020 Jul 20;20(14):4036. doi: 10.3390/s20144036.
4
LNTP-MDBN: Big Data Integrated Learning Framework for Heterogeneous Image Set Classification.LNTP-MDBN:用于异构图像集分类的大数据集成学习框架
Curr Med Imaging Rev. 2019;15(2):227-236. doi: 10.2174/1573405613666170721103949.
5
An Ensemble-Based Scalable Approach for Intrusion Detection Using Big Data Framework.一种基于集成的可扩展方法,用于使用大数据框架进行入侵检测。
Big Data. 2021 Aug;9(4):303-321. doi: 10.1089/big.2020.0201. Epub 2021 Jul 16.
6
Development of Rheumatoid Arthritis Classification from Electronic Image Sensor Using Ensemble Method.基于集成方法的电子图像传感器类风湿关节炎分类的研究
Sensors (Basel). 2019 Dec 27;20(1):167. doi: 10.3390/s20010167.
7
A Novel Intelligent Hybrid Optimized Analytics and Streaming Engine for Medical Big Data.一种用于医疗大数据的新型智能混合优化分析和流引擎。
Comput Math Methods Med. 2022 Mar 17;2022:7120983. doi: 10.1155/2022/7120983. eCollection 2022.
8
Effective prediction of heart disease using hybrid ensemble deep learning and tunicate swarm algorithm.使用混合集成深度学习和被囊动物群算法有效预测心脏病。
J Biomol Struct Dyn. 2022;40(23):13334-13345. doi: 10.1080/07391102.2021.1987328. Epub 2021 Oct 18.
9
Incremental Ant-Miner Classifier for Online Big Data Analytics.用于在线大数据分析的增量蚁群分类器
Sensors (Basel). 2022 Mar 13;22(6):2223. doi: 10.3390/s22062223.
10
Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling.基于集成剪枝和优化主题建模的生物医学文本分类
Comput Math Methods Med. 2018 Jul 22;2018:2497471. doi: 10.1155/2018/2497471. eCollection 2018.

引用本文的文献

1
Machine Learning-Driven Biomarker Discovery for Skeletal Complications in Type 1 Gaucher Disease Patients.机器学习驱动的 1 型戈谢病患者骨骼并发症生物标志物发现。
Int J Mol Sci. 2024 Aug 6;25(16):8586. doi: 10.3390/ijms25168586.
2
Clinical Uncertainty Influences Antibiotic Prescribing for Upper Respiratory Tract Infections: A Qualitative Study of Township Hospital Physicians and Village Doctors in Rural Shandong Province, China.临床不确定性影响上呼吸道感染的抗生素处方:对中国山东省农村乡镇医院医生和乡村医生的定性研究
Antibiotics (Basel). 2023 Jun 8;12(6):1027. doi: 10.3390/antibiotics12061027.