Penalizing the Hard Example But Not Too Much: A Strong Baseline for Fine-Grained Visual Classification.

Suppr

超能文献

作者信息

Liang Yuanzhi, Zhu Linchao, Wang Xiaohan, Yang Yi

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):7048-7059. doi: 10.1109/TNNLS.2022.3213563. Epub 2024 May 2.

DOI:10.1109/TNNLS.2022.3213563

PMID:36409807

Abstract

Though significant progress has been achieved on fine-grained visual classification (FGVC), severe overfitting still hinders model generalization. A recent study shows that hard samples in the training set can be easily fit, but most existing FGVC methods fail to classify some hard examples in the test set. The reason is that the model overfits those hard examples in the training set, but does not learn to generalize to unseen examples in the test set. In this article, we propose a moderate hard example modulation (MHEM) strategy to properly modulate the hard examples. MHEM encourages the model to not overfit hard examples and offers better generalization and discrimination. First, we introduce three conditions and formulate a general form of a modulated loss function. Second, we instantiate the loss function and provide a strong baseline for FGVC, where the performance of a naive backbone can be boosted and be comparable with recent methods. Moreover, we demonstrate that our baseline can be readily incorporated into the existing methods and empower these methods to be more discriminative. Equipped with our strong baseline, we achieve consistent improvements on three typical FGVC datasets, i.e., CUB-200-2011, Stanford Cars, and FGVC-Aircraft. We hope the idea of moderate hard example modulation will inspire future research work toward more effective fine-grained visual recognition.

摘要

相似文献

Penalizing the Hard Example But Not Too Much: A Strong Baseline for Fine-Grained Visual Classification.

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):7048-7059. doi: 10.1109/TNNLS.2022.3213563. Epub 2024 May 2.

Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization.用于细粒度视觉分类的多分辨率判别混合网络

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3488-3500. doi: 10.1109/TNNLS.2021.3112768. Epub 2023 Jul 6.

Fine-Grained Recognition With Learnable Semantic Data Augmentation.基于可学习语义数据增强的细粒度识别

IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30.

AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification.AP-CNN：用于细粒度视觉分类的弱监督注意力金字塔卷积神经网络。

IEEE Trans Image Process. 2021;30:2826-2836. doi: 10.1109/TIP.2021.3055617. Epub 2021 Feb 12.

Image local structure information learning for fine-grained visual classification.细粒度视觉分类中的图像局部结构信息学习。

Sci Rep. 2022 Nov 10;12(1):19205. doi: 10.1038/s41598-022-23835-0.

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification.问题出在通道上：用于细粒度图像分类的互通道损失

IEEE Trans Image Process. 2020 Feb 20. doi: 10.1109/TIP.2020.2973812.

SIM-OFE: Structure Information Mining and Object-Aware Feature Enhancement for Fine-Grained Visual Categorization.SIM-OFE：用于细粒度视觉分类的结构信息挖掘与目标感知特征增强

IEEE Trans Image Process. 2024;33:5312-5326. doi: 10.1109/TIP.2024.3459788. Epub 2024 Sep 27.

Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification.用于细粒度视觉分类的类别一致多粒度特征的渐进式学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9521-9535. doi: 10.1109/TPAMI.2021.3126668. Epub 2022 Nov 7.

Dual-Dependency Attention Transformer for Fine-Grained Visual Classification.用于细粒度视觉分类的双依赖注意力变换器

Sensors (Basel). 2024 Apr 6;24(7):2337. doi: 10.3390/s24072337.

Semi-Supervised Learning for FGVC With Out-of-Category Data.用于带有类别外数据的细粒度视觉分类的半监督学习

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):2658-2671. doi: 10.1109/TPAMI.2023.3322463. Epub 2024 Apr 3.

引用本文的文献

Labor linkages and flow paths of industry in China.中国产业的劳动力联系与流动路径。

Heliyon. 2024 Apr 26;10(9):e30118. doi: 10.1016/j.heliyon.2024.e30118. eCollection 2024 May 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验