基于对比CutMix增强的长尾识别优化

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.

作者信息

Pan Haolin, Guo Yong, Yu Mianjie, Chen Jian

出版信息

IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.

DOI:10.1109/TIP.2024.3425148

Abstract

Real-world data often follows a long-tailed distribution, where a few head classes occupy most of the data and a large number of tail classes only contain very limited samples. In practice, deep models often show poor generalization performance on tail classes due to the imbalanced distribution. To tackle this, data augmentation has become an effective way by synthesizing new samples for tail classes. Among them, one popular way is to use CutMix that explicitly mixups the images of tail classes and the others, while constructing the labels according to the ratio of areas cropped from two images. However, the area-based labels entirely ignore the inherent semantic information of the augmented samples, often leading to misleading training signals. To address this issue, we propose a Contrastive CutMix (ConCutMix) that constructs augmented samples with semantically consistent labels to boost the performance of long-tailed recognition. Specifically, we compute the similarities between samples in the semantic space learned by contrastive learning, and use them to rectify the area-based labels. Experiments show that our ConCutMix significantly improves the accuracy on tail classes as well as the overall performance. For example, based on ResNeXt-50, we improve the overall accuracy on ImageNet-LT by 3.0% thanks to the significant improvement of 3.3% on tail classes. We highlight that the improvement also generalizes well to other benchmarks and models. Our code and pretrained models are available at https://github.com/PanHaulin/ConCutMix.

摘要

现实世界的数据通常遵循长尾分布，即少数头部类别占据了大部分数据，而大量的尾部类别只包含非常有限的样本。在实际应用中，由于分布不均衡，深度模型在尾部类别上往往表现出较差的泛化性能。为了解决这个问题，数据增强已成为一种有效的方法，即通过为尾部类别合成新样本。其中，一种流行的方法是使用CutMix，它明确地将尾部类别的图像与其他图像进行混合，同时根据从两张图像中裁剪的区域比例来构建标签。然而，基于区域的标签完全忽略了增强样本的固有语义信息，常常导致误导性的训练信号。为了解决这个问题，我们提出了一种对比CutMix（ConCutMix）方法，该方法通过构建具有语义一致标签的增强样本，以提高长尾识别的性能。具体来说，我们计算通过对比学习在语义空间中样本之间的相似度，并使用它们来修正基于区域的标签。实验表明，我们的ConCutMix显著提高了尾部类别的准确率以及整体性能。例如，基于ResNeXt-50，由于尾部类别上有3.3%的显著提升，我们将ImageNet-LT上的整体准确率提高了3.0%。我们强调，这种改进在其他基准测试和模型上也具有良好的通用性。我们的代码和预训练模型可在https://github.com/PanHaulin/ConCutMix获取。

相似文献

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.基于对比CutMix增强的长尾识别优化

IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.

A dual-branch model with inter- and intra-branch contrastive loss for long-tailed recognition.用于长尾识别的具有分支间和分支内对比损失的双分支模型。

Neural Netw. 2023 Nov;168:214-222. doi: 10.1016/j.neunet.2023.09.022. Epub 2023 Sep 21.

A Long-Tailed Image Classification Method Based on Enhanced Contrastive Visual Language.基于增强对比视觉语言的长尾图像分类方法。

Sensors (Basel). 2023 Jul 26;23(15):6694. doi: 10.3390/s23156694.

Generalized Parametric Contrastive Learning.广义参数对比学习

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):7463-7474. doi: 10.1109/TPAMI.2023.3278694. Epub 2024 Nov 6.

ResLT: Residual Learning for Long-Tailed Recognition.结果：用于长尾识别的残差学习。

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3695-3706. doi: 10.1109/TPAMI.2022.3174892. Epub 2023 Feb 3.

ChatDiff: A ChatGPT-based diffusion model for long-tailed classification.ChatDiff：一种基于ChatGPT的用于长尾分类的扩散模型。

Neural Netw. 2025 Jan;181:106794. doi: 10.1016/j.neunet.2024.106794. Epub 2024 Oct 15.

Probabilistic Contrastive Learning for Long-Tailed Visual Recognition.用于长尾视觉识别的概率对比学习

IEEE Trans Pattern Anal Mach Intell. 2024 Sep;46(9):5890-5904. doi: 10.1109/TPAMI.2024.3369102. Epub 2024 Aug 6.

Open Long-Tailed Recognition in a Dynamic World.动态世界中的开放长尾识别

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1836-1851. doi: 10.1109/TPAMI.2022.3200091. Epub 2024 Feb 6.

Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。

Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.

Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning.播种视图：用于对比表示学习的层次语义对齐

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3753-3767. doi: 10.1109/TPAMI.2022.3176690. Epub 2023 Feb 3.

基于对比CutMix增强的长尾识别优化

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.

作者信息

Pan Haolin, Guo Yong, Yu Mianjie, Chen Jian

出版信息

IEEE Trans Image Process. 2024;33:4215-4230. doi: 10.1109/TIP.2024.3425148. Epub 2024 Jul 22.

DOI:10.1109/TIP.2024.3425148

PMID:39008385

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于对比CutMix增强的长尾识别优化

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.

作者信息

出版信息

相似文献

基于对比CutMix增强的长尾识别优化

Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation.

作者信息

出版信息

相似文献