• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 Pareto 的特征和实例选择集成学习方法在多类不平衡数据集上的应用。

A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets.

机构信息

1 Department of Computer Science and Artificial Intelligence, University of Granada, Granada 18071, Spain.

2 Department of Civil Engineering, University of Burgos, Burgos 09006, Spain.

出版信息

Int J Neural Syst. 2017 Sep;27(6):1750028. doi: 10.1142/S0129065717500289. Epub 2017 Apr 11.

DOI:10.1142/S0129065717500289
PMID:28633551
Abstract

Imbalanced classification is related to those problems that have an uneven distribution among classes. In addition to the former, when instances are located into the overlapped areas, the correct modeling of the problem becomes harder. Current solutions for both issues are often focused on the binary case study, as multi-class datasets require an additional effort to be addressed. In this research, we overcome these problems by carrying out a combination between feature and instance selections. Feature selection will allow simplifying the overlapping areas easing the generation of rules to distinguish among the classes. Selection of instances from all classes will address the imbalance itself by finding the most appropriate class distribution for the learning task, as well as possibly removing noise and difficult borderline examples. For the sake of obtaining an optimal joint set of features and instances, we embedded the searching for both parameters in a Multi-Objective Evolutionary Algorithm, using the C4.5 decision tree as baseline classifier in this wrapper approach. The multi-objective scheme allows taking a double advantage: the search space becomes broader, and we may provide a set of different solutions in order to build an ensemble of classifiers. This proposal has been contrasted versus several state-of-the-art solutions on imbalanced classification showing excellent results in both binary and multi-class problems.

摘要

不平衡分类与那些在类之间分布不均匀的问题有关。除了前者,当实例位于重叠区域时,问题的正确建模变得更加困难。目前针对这两个问题的解决方案通常集中在二进制案例研究上,因为多类数据集需要额外的努力来解决。在这项研究中,我们通过特征和实例选择的组合来克服这些问题。特征选择将简化重叠区域,从而简化生成规则以区分不同的类别。从所有类别中选择实例可以通过为学习任务找到最合适的类别分布来解决不平衡问题,同时可能还可以去除噪声和困难的边界示例。为了获得最优的特征和实例组合,我们将这两个参数的搜索嵌入到多目标进化算法中,在这个包装器方法中,使用 C4.5 决策树作为基准分类器。多目标方案具有双重优势:搜索空间更加广阔,我们可以提供一组不同的解决方案,以构建分类器集合。该提案与不平衡分类的几种最先进的解决方案进行了对比,在二进制和多类问题上都取得了优异的结果。

相似文献

1
A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets.基于 Pareto 的特征和实例选择集成学习方法在多类不平衡数据集上的应用。
Int J Neural Syst. 2017 Sep;27(6):1750028. doi: 10.1142/S0129065717500289. Epub 2017 Apr 11.
2
A scalable memetic algorithm for simultaneous instance and feature selection.一种用于同时进行实例和特征选择的可扩展Memetic算法。
Evol Comput. 2014 Spring;22(1):1-45. doi: 10.1162/EVCO_a_00102. Epub 2013 Aug 8.
3
Ensemble of Rotation Trees for Imbalanced Medical Datasets.旋转树集成算法在医学数据集不均衡分类问题中的应用。
J Healthc Eng. 2018 Apr 10;2018:8902981. doi: 10.1155/2018/8902981. eCollection 2018.
4
Multi-objective evolutionary algorithms for fuzzy classification in survival prediction.多目标进化算法在生存预测中的模糊分类。
Artif Intell Med. 2014 Mar;60(3):197-219. doi: 10.1016/j.artmed.2013.12.006. Epub 2014 Jan 9.
5
Constructing ensembles of classifiers by means of weighted instance selection.通过加权实例选择构建分类器集成。
IEEE Trans Neural Netw. 2009 Feb;20(2):258-77. doi: 10.1109/TNN.2008.2005496. Epub 2009 Jan 27.
6
A hybrid cost-sensitive ensemble for imbalanced breast thermogram classification.一种用于不平衡乳腺热成像分类的混合成本敏感集成方法。
Artif Intell Med. 2015 Nov;65(3):219-27. doi: 10.1016/j.artmed.2015.07.005. Epub 2015 Jul 31.
7
Feature Selection and Classification of Clinical Datasets Using Bioinspired Algorithms and Super Learner.基于生物启发算法和超级学习者的临床数据集特征选择与分类。
Comput Math Methods Med. 2021 May 17;2021:6662420. doi: 10.1155/2021/6662420. eCollection 2021.
8
Feature Selection for High-Dimensional and Imbalanced Biomedical Data Based on Robust Correlation Based Redundancy and Binary Grasshopper Optimization Algorithm.基于稳健相关冗余和二进制沙蝇优化算法的高维不平衡生物医学数据特征选择。
Genes (Basel). 2020 Jun 27;11(7):717. doi: 10.3390/genes11070717.
9
Heterogeneous Ensemble Combination Search Using Genetic Algorithm for Class Imbalanced Data Classification.使用遗传算法的异构集成组合搜索用于类别不平衡数据分类
PLoS One. 2016 Jan 14;11(1):e0146116. doi: 10.1371/journal.pone.0146116. eCollection 2016.
10
Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification.用于分类中基于成本的特征选择的多目标粒子群优化方法
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):64-75. doi: 10.1109/TCBB.2015.2476796. Epub 2015 Sep 4.

引用本文的文献

1
Low-cost and precise traditional Chinese medicinal tree pest and disease monitoring using UAV RGB image only.仅使用无人机 RGB 图像低成本且精确监测传统中药树木的病虫害。
Sci Rep. 2024 Oct 26;14(1):25562. doi: 10.1038/s41598-024-76502-x.
2
Predicting Improved Daily Use of the More Affected Arm Poststroke Following Constraint-Induced Movement Therapy.预测强制性运动疗法后卒中后更瘫痪侧上肢日常使用改善情况。
Phys Ther. 2019 Dec 16;99(12):1667-1678. doi: 10.1093/ptj/pzz121.