• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过聚合进化层次分类器来对抗多类不平衡问题。

To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.

作者信息

Ning Zhihan, Jiang Zhixing, Zhang David

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Apr 8;PP. doi: 10.1109/TNNLS.2024.3383672.

DOI:10.1109/TNNLS.2024.3383672
PMID:38587952
Abstract

Real-world datasets are often imbalanced, posing frequent challenges to canonical machine learning algorithms that assume a balanced class distribution. Moreover, the imbalance problem becomes more complicated when the dataset is multiclass. Although many approaches have been presented for imbalanced learning (IL), research on the multiclass imbalanced problem is relatively limited and deficient. To alleviate these issues, we propose a forest of evolutionary hierarchical classifiers (FEHC) method for multiclass IL (MCIL). FEHC can be seen as a classifier fusion framework with a forest structure, and it aggregates several evolutionary hierarchical multiclassifiers (EHMCs) to reduce generalization error. Specifically, a multichromosome genetic algorithm (MCGA) is designed to simultaneously select (sub)optimal features, classifiers, and hierarchical structures when generating these EHMCs. The MCGA adopts a dynamic weighting module to learn difficult classes and promote the diversity of FEHC. We also present the "stratified underbagging" (SUB) strategy to address class imbalance and the "soft tree traversal" (STT) strategy to make FEHC converge faster and better. We thoroughly evaluate the proposed algorithm using 14 multiclass imbalanced datasets with various properties. Compared with popular and state-of-the-art approaches, FEHC obtains better performance under different evaluation metrics. Codes have been made publicly available on GitHub.https://github.com/CUHKSZ-NING/FEHCClassifier.

摘要

现实世界的数据集往往是不平衡的,这给假设类分布平衡的传统机器学习算法带来了频繁的挑战。此外,当数据集是多类时,不平衡问题会变得更加复杂。尽管已经提出了许多用于不平衡学习(IL)的方法,但对多类不平衡问题的研究相对有限且不足。为了缓解这些问题,我们提出了一种用于多类不平衡学习(MCIL)的进化分层分类器森林(FEHC)方法。FEHC可以看作是一个具有森林结构的分类器融合框架,它聚合了多个进化分层多分类器(EHMC)以减少泛化误差。具体来说,设计了一种多染色体遗传算法(MCGA),在生成这些EHMC时同时选择(子)最优特征、分类器和层次结构。MCGA采用动态加权模块来学习困难类并促进FEHC的多样性。我们还提出了“分层欠采样”(SUB)策略来解决类不平衡问题,以及“软树遍历”(STT)策略以使FEHC更快更好地收敛。我们使用14个具有不同属性的多类不平衡数据集对提出的算法进行了全面评估。与流行的和最新的方法相比,FEHC在不同的评估指标下都获得了更好的性能。代码已在GitHub上公开提供。https://github.com/CUHKSZ-NING/FEHCClassifier。

相似文献

1
To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.通过聚合进化层次分类器来对抗多类不平衡问题。
IEEE Trans Neural Netw Learn Syst. 2024 Apr 8;PP. doi: 10.1109/TNNLS.2024.3383672.
2
Learning With Multiclass AUC: Theory and Algorithms.多类别AUC学习:理论与算法
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7747-7763. doi: 10.1109/TPAMI.2021.3101125. Epub 2022 Oct 4.
3
Radial-Based Oversampling for Multiclass Imbalanced Data Classification.基于径向基的多类不平衡数据分类过采样方法
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2818-2831. doi: 10.1109/TNNLS.2019.2913673. Epub 2019 Jun 21.
4
Inverse free reduced universum twin support vector machine for imbalanced data classification.用于不平衡数据分类的逆自由约简全域孪生支持向量机
Neural Netw. 2023 Jan;157:125-135. doi: 10.1016/j.neunet.2022.10.003. Epub 2022 Oct 15.
5
Multiclass Imbalance Problems: Analysis and Potential Solutions.多类不平衡问题:分析与潜在解决方案
IEEE Trans Syst Man Cybern B Cybern. 2012 Aug;42(4):1119-30. doi: 10.1109/TSMCB.2012.2187280. Epub 2012 Mar 16.
6
A Pareto-based Ensemble with Feature and Instance Selection for Learning from Multi-Class Imbalanced Datasets.基于 Pareto 的特征和实例选择集成学习方法在多类不平衡数据集上的应用。
Int J Neural Syst. 2017 Sep;27(6):1750028. doi: 10.1142/S0129065717500289. Epub 2017 Apr 11.
7
Multiclass feature selection with metaheuristic optimization algorithms: a review.基于元启发式优化算法的多类特征选择:综述
Neural Comput Appl. 2022;34(22):19751-19790. doi: 10.1007/s00521-022-07705-4. Epub 2022 Aug 30.
8
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
9
Conversion of adverse data corpus to shrewd output using sampling metrics.使用抽样指标将不良数据语料库转换为精准输出。
Vis Comput Ind Biomed Art. 2020 Aug 11;3(1):19. doi: 10.1186/s42492-020-00055-9.
10
Decoupling representation learning for imbalanced electroencephalography classification in rapid serial visual presentation task.快速序列视觉呈现任务中用于不平衡脑电图分类的解耦表示学习
J Neural Eng. 2022 May 13;19(3). doi: 10.1088/1741-2552/ac6a7d.