• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

mCRF和mRD:基于新型多类标签噪声过滤学习框架的两种分类方法。

mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework.

作者信息

Xia Shuyin, Chen Baiyun, Wang Guoyin, Zheng Yong, Gao Xinbo, Giem Elisabeth, Chen Zizhong

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2916-2930. doi: 10.1109/TNNLS.2020.3047046. Epub 2022 Jul 6.

DOI:10.1109/TNNLS.2020.3047046
PMID:33428577
Abstract

Mitigating label noise is a crucial problem in classification. Noise filtering is an effective method of dealing with label noise which does not need to estimate the noise rate or rely on any loss function. However, most filtering methods focus mainly on binary classification, leaving the more difficult counterpart problem of multiclass classification relatively unexplored. To remedy this deficit, we present a definition for label noise in a multiclass setting and propose a general framework for a novel label noise filtering learning method for multiclass classification. Two examples of noise filtering methods for multiclass classification, multiclass complete random forest (mCRF) and multiclass relative density, are derived from their binary counterparts using our proposed framework. In addition, to optimize the NI_threshold hyperparameter in mCRF, we propose two new optimization methods: a new voting cross-validation method and an adaptive method that employs a 2-means clustering algorithm. Furthermore, we incorporate SMOTE into our label noise filtering learning framework to handle the ubiquitous problem of imbalanced data in multiclass classification. We report experiments on both synthetic data sets and UCI benchmarks to demonstrate our proposed methods are highly robust to label noise in comparison with state-of-the-art baselines. All code and data results are available at https://github.com/syxiaa/Multiclass-Label-Noise-Filtering-Learning.

摘要

减轻标签噪声是分类中的一个关键问题。噪声过滤是一种处理标签噪声的有效方法,它不需要估计噪声率或依赖任何损失函数。然而,大多数过滤方法主要集中在二分类上,而多分类中更具挑战性的对应问题相对较少被探索。为了弥补这一不足,我们给出了多分类设置下标签噪声的定义,并提出了一种用于多分类的新型标签噪声过滤学习方法的通用框架。基于我们提出的框架,从其二分类对应方法中推导出了两种多分类噪声过滤方法的示例,即多分类完全随机森林(mCRF)和多分类相对密度。此外,为了优化mCRF中的NI_threshold超参数,我们提出了两种新的优化方法:一种新的投票交叉验证方法和一种采用二均值聚类算法的自适应方法。此外,我们将SMOTE纳入我们的标签噪声过滤学习框架,以处理多分类中普遍存在的不平衡数据问题。我们在合成数据集和UCI基准上进行了实验,以证明与现有最先进的基线相比,我们提出的方法对标签噪声具有高度鲁棒性。所有代码和数据结果可在https://github.com/syxiaa/Multiclass-Label-Noise-Filtering-Learning获取。

相似文献

1
mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework.mCRF和mRD:基于新型多类标签噪声过滤学习框架的两种分类方法。
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):2916-2930. doi: 10.1109/TNNLS.2020.3047046. Epub 2022 Jul 6.
2
Learning With Multiclass AUC: Theory and Algorithms.多类别AUC学习:理论与算法
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7747-7763. doi: 10.1109/TPAMI.2021.3101125. Epub 2022 Oct 4.
3
Robust Point Cloud Segmentation With Noisy Annotations.基于含噪标注的鲁棒点云分割。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7696-7710. doi: 10.1109/TPAMI.2022.3225323. Epub 2023 May 5.
4
To Combat Multiclass Imbalanced Problems by Aggregating Evolutionary Hierarchical Classifiers.通过聚合进化层次分类器来对抗多类不平衡问题。
IEEE Trans Neural Netw Learn Syst. 2024 Apr 8;PP. doi: 10.1109/TNNLS.2024.3383672.
5
Hybrid Multi-Label Classification Model for Medical Applications Based on Adaptive Synthetic Data and Ensemble Learning.基于自适应合成数据和集成学习的医学应用混合多标签分类模型
Sensors (Basel). 2023 Jul 31;23(15):6836. doi: 10.3390/s23156836.
6
Iterative ensemble feature selection for multiclass classification of imbalanced microarray data.用于不平衡微阵列数据多类分类的迭代集成特征选择
J Biol Res (Thessalon). 2016 Jul 4;23(Suppl 1):13. doi: 10.1186/s40709-016-0045-8. eCollection 2016 May.
7
Radial-Based Oversampling for Multiclass Imbalanced Data Classification.基于径向基的多类不平衡数据分类过采样方法
IEEE Trans Neural Netw Learn Syst. 2020 Aug;31(8):2818-2831. doi: 10.1109/TNNLS.2019.2913673. Epub 2019 Jun 21.
8
Multiclass Learning With Partially Corrupted Labels.多类学习中的部分标签损坏问题。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2568-2580. doi: 10.1109/TNNLS.2017.2699783. Epub 2017 May 16.
9
Random Space Division Sampling for Label-Noisy Classification or Imbalanced Classification.随机空间划分采样在标签噪声分类或不平衡分类中的应用。
IEEE Trans Cybern. 2022 Oct;52(10):10444-10457. doi: 10.1109/TCYB.2021.3070005. Epub 2022 Sep 19.
10
A Multiview Learning Framework With a Linear Computational Cost.一种具有线性计算成本的多视图学习框架。
IEEE Trans Cybern. 2018 Aug;48(8):2416-2425. doi: 10.1109/TCYB.2017.2739423. Epub 2017 Aug 22.

引用本文的文献

1
COVID-19 chest X-ray image classification in the presence of noisy labels.存在噪声标签情况下的COVID-19胸部X光图像分类
Displays. 2023 Apr;77:102370. doi: 10.1016/j.displa.2023.102370. Epub 2023 Jan 8.
2
Data Valuation Algorithm for Inertial Measurement Unit-Based Human Activity Recognition.基于惯性测量单元的人类活动识别数据评估算法
Sensors (Basel). 2022 Dec 24;23(1):184. doi: 10.3390/s23010184.