用于细粒度视觉识别的全局协方差池化的特征值研究

On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition.

作者信息

Song Yue, Sebe Nicu, Wang Wei

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3554-3566. doi: 10.1109/TPAMI.2022.3178802. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3178802

Abstract

The Fine-Grained Visual Categorization (FGVC) is challenging because the subtle inter-class variations are difficult to be captured. One notable research line uses the Global Covariance Pooling (GCP) layer to learn powerful representations with second-order statistics, which can effectively model inter-class differences. In our previous conference paper, we show that truncating small eigenvalues of the GCP covariance can attain smoother gradient and improve the performance on large-scale benchmarks. However, on fine-grained datasets, truncating the small eigenvalues would make the model fail to converge. This observation contradicts the common assumption that the small eigenvalues merely correspond to the noisy and unimportant information. Consequently, ignoring them should have little influence on the performance. To diagnose this peculiar behavior, we propose two attribution methods whose visualizations demonstrate that the seemingly unimportant small eigenvalues are crucial as they are in charge of extracting the discriminative class-specific features. Inspired by this observation, we propose a network branch dedicated to magnifying the importance of small eigenvalues. Without introducing any additional parameters, this branch simply amplifies the small eigenvalues and achieves state-of-the-art performances of GCP methods on three fine-grained benchmarks. Furthermore, the performance is also competitive against other FGVC approaches on larger datasets. Code is available at https://github.com/KingJamesSong/DifferentiableSVD.

摘要

细粒度视觉分类（FGVC）具有挑战性，因为难以捕捉类间的细微差异。一条值得注意的研究路线使用全局协方差池化（GCP）层，通过二阶统计量来学习强大的表示，这可以有效地对类间差异进行建模。在我们之前的会议论文中，我们表明截断GCP协方差的小特征值可以获得更平滑的梯度，并提高在大规模基准测试中的性能。然而，在细粒度数据集上，截断小特征值会导致模型无法收敛。这一观察结果与通常的假设相矛盾，即小特征值仅对应于噪声和不重要的信息。因此，忽略它们对性能应该影响不大。为了诊断这种特殊行为，我们提出了两种归因方法，其可视化结果表明，看似不重要的小特征值至关重要，因为它们负责提取有区分力的类特定特征。受此观察结果的启发，我们提出了一个专门用于放大小特征值重要性的网络分支。该分支在不引入任何额外参数的情况下，简单地放大小特征值，并在三个细粒度基准测试中取得了GCP方法的最优性能。此外，在更大的数据集上，该性能与其他FGVC方法相比也具有竞争力。代码可在https://github.com/KingJamesSong/DifferentiableSVD获取。

相似文献

On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition.用于细粒度视觉识别的全局协方差池化的特征值研究

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3554-3566. doi: 10.1109/TPAMI.2022.3178802. Epub 2023 Feb 3.

SIM-OFE: Structure Information Mining and Object-Aware Feature Enhancement for Fine-Grained Visual Categorization.SIM-OFE：用于细粒度视觉分类的结构信息挖掘与目标感知特征增强

IEEE Trans Image Process. 2024;33:5312-5326. doi: 10.1109/TIP.2024.3459788. Epub 2024 Sep 27.

Fine-Grained Recognition With Learnable Semantic Data Augmentation.基于可学习语义数据增强的细粒度识别

IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30.

Multi-Scale Feature Fusion of Covariance Pooling Networks for Fine-Grained Visual Recognition.协方差池化网络的多尺度特征融合用于细粒度视觉识别。

Sensors (Basel). 2023 Apr 13;23(8):3970. doi: 10.3390/s23083970.

Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification.用于细粒度视觉分类的类别一致多粒度特征的渐进式学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9521-9535. doi: 10.1109/TPAMI.2021.3126668. Epub 2022 Nov 7.

Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization.用于细粒度视觉分类的多分辨率判别混合网络

IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3488-3500. doi: 10.1109/TNNLS.2021.3112768. Epub 2023 Jul 6.

Orthogonal SVD Covariance Conditioning and Latent Disentanglement.正交 SVD 协方差条件化和潜在解缠结。

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8773-8786. doi: 10.1109/TPAMI.2022.3228979. Epub 2023 Jun 5.

Multi-Objective Matrix Normalization for Fine-grained Visual Recognition.用于细粒度视觉识别的多目标矩阵归一化

IEEE Trans Image Process. 2020 Mar 6. doi: 10.1109/TIP.2020.2977457.

SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization.SR-GNN：用于细粒度图像分类的空间关系感知图神经网络

IEEE Trans Image Process. 2022 Sep 14;PP. doi: 10.1109/TIP.2022.3205215.

Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification.用于细粒度视觉分类的多粒度部分采样注意力机制

IEEE Trans Image Process. 2024;33:4529-4542. doi: 10.1109/TIP.2024.3441813. Epub 2024 Aug 23.

引用本文的文献

Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition.交织洞察：用于细粒度视觉识别的高阶特征交互

Int J Comput Vis. 2025;133(4):1755-1779. doi: 10.1007/s11263-024-02260-y. Epub 2024 Oct 20.

Learning to integrate parts for whole through correlated neural variability.通过相关的神经变异性来学习整合整体部分。

PLoS Comput Biol. 2024 Sep 3;20(9):e1012401. doi: 10.1371/journal.pcbi.1012401. eCollection 2024 Sep.

Truck model recognition for an automatic overload detection system based on the improved MMAL-Net.基于改进型MMAL-Net的自动过载检测系统的卡车型号识别

Front Neurosci. 2023 Aug 10;17:1243847. doi: 10.3389/fnins.2023.1243847. eCollection 2023.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于细粒度视觉识别的全局协方差池化的特征值研究

On the Eigenvalues of Global Covariance Pooling for Fine-Grained Visual Recognition.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献