不良标签：关于评估和增强标签噪声学习的稳健视角

BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning.

作者信息

Zhang Jingfeng, Song Bo, Wang Haohan, Han Bo, Liu Tongliang, Liu Lei, Sugiyama Masashi

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4398-4409. doi: 10.1109/TPAMI.2024.3355425. Epub 2024 May 7.

DOI:10.1109/TPAMI.2024.3355425

Abstract

Label-noise learning (LNL) aims to increase the model's generalization given training data with noisy labels. To facilitate practical LNL algorithms, researchers have proposed different label noise types, ranging from class-conditional to instance-dependent noises. In this paper, we introduce a novel label noise type called BadLabel, which can significantly degrade the performance of existing LNL algorithms by a large margin. BadLabel is crafted based on the label-flipping attack against standard classification, where specific samples are selected and their labels are flipped to other labels so that the loss values of clean and noisy labels become indistinguishable. To address the challenge posed by BadLabel, we further propose a robust LNL method that perturbs the labels in an adversarial manner at each epoch to make the loss values of clean and noisy labels again distinguishable. Once we select a small set of (mostly) clean labeled data, we can apply the techniques of semi-supervised learning to train the model accurately. Empirically, our experimental results demonstrate that existing LNL algorithms are vulnerable to the newly introduced BadLabel noise type, while our proposed robust LNL method can effectively improve the generalization performance of the model under various types of label noise. The new dataset of noisy labels and the source codes of robust LNL algorithms are available at https://github.com/zjfheart/BadLabels.

摘要

标签噪声学习（LNL）旨在在给定带有噪声标签的训练数据的情况下提高模型的泛化能力。为了促进实用的LNL算法，研究人员提出了不同类型的标签噪声，从类条件噪声到实例依赖噪声。在本文中，我们引入了一种名为BadLabel的新型标签噪声，它可以大幅显著降低现有LNL算法的性能。BadLabel是基于针对标准分类的标签翻转攻击构建的，其中选择特定样本并将其标签翻转到其他标签，以使干净标签和噪声标签的损失值变得难以区分。为了应对BadLabel带来的挑战，我们进一步提出了一种鲁棒的LNL方法，该方法在每个epoch以对抗方式扰动标签，以使干净标签和噪声标签的损失值再次可区分。一旦我们选择了一小部分（大部分）干净的带标签数据，我们就可以应用半监督学习技术来准确训练模型。从经验上看，我们的实验结果表明，现有的LNL算法容易受到新引入的BadLabel噪声类型的影响，而我们提出的鲁棒LNL方法可以在各种类型的标签噪声下有效提高模型的泛化性能。有噪声标签的新数据集和鲁棒LNL算法的源代码可在https://github.com/zjfheart/BadLabels上获取。

相似文献

BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning.

IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4398-4409. doi: 10.1109/TPAMI.2024.3355425. Epub 2024 May 7.

Learning With Noisy Labels Over Imbalanced Subpopulations.

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6544-6555. doi: 10.1109/TNNLS.2024.3389676. Epub 2025 Apr 4.

S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.

Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.

Invariant feature based label correction for DNN when Learning with Noisy Labels.

Neural Netw. 2024 Apr;172:106137. doi: 10.1016/j.neunet.2024.106137. Epub 2024 Jan 29.

A Time-Consistency Curriculum for Learning From Instance-Dependent Noisy Labels.

IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4830-4842. doi: 10.1109/TPAMI.2024.3360623. Epub 2024 Jun 5.

Knowledge Distillation Meets Label Noise Learning: Ambiguity-Guided Mutual Label Refinery.

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):939-952. doi: 10.1109/TNNLS.2023.3335829. Epub 2025 Jan 7.

Hard Sample Aware Noise Robust Learning for Histopathology Image Classification.

IEEE Trans Med Imaging. 2022 Apr;41(4):881-894. doi: 10.1109/TMI.2021.3125459. Epub 2022 Apr 1.

Generative Reasoning Integrated Label Noise Robust Deep Image Representation Learning.

IEEE Trans Image Process. 2023;32:4529-4542. doi: 10.1109/TIP.2023.3293776. Epub 2023 Aug 10.

A Parametrical Model for Instance-Dependent Label Noise.

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14055-14068. doi: 10.1109/TPAMI.2023.3301876. Epub 2023 Nov 3.

Learning From Pixel-Level Label Noise: A New Perspective for Semi-Supervised Semantic Segmentation.

IEEE Trans Image Process. 2022;31:623-635. doi: 10.1109/TIP.2021.3134142. Epub 2021 Dec 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

不良标签：关于评估和增强标签噪声学习的稳健视角

BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献