Li Xiaoxu, Wu Jijie, Sun Zhuo, Ma Zhanyu, Cao Jie, Xue Jing-Hao
IEEE Trans Image Process. 2021;30:1318-1331. doi: 10.1109/TIP.2020.3043128. Epub 2020 Dec 23.
Few-shot learning for fine-grained image classification has gained recent attention in computer vision. Among the approaches for few-shot learning, due to the simplicity and effectiveness, metric-based methods are favorably state-of-the-art on many tasks. Most of the metric-based methods assume a single similarity measure and thus obtain a single feature space. However, if samples can simultaneously be well classified via two distinct similarity measures, the samples within a class can distribute more compactly in a smaller feature space, producing more discriminative feature maps. Motivated by this, we propose a so-called Bi-Similarity Network (BSNet) that consists of a single embedding module and a bi-similarity module of two similarity measures. After the support images and the query images pass through the convolution-based embedding module, the bi-similarity module learns feature maps according to two similarity measures of diverse characteristics. In this way, the model is enabled to learn more discriminative and less similarity-biased features from few shots of fine-grained images, such that the model generalization ability can be significantly improved. Through extensive experiments by slightly modifying established metric/similarity based networks, we show that the proposed approach produces a substantial improvement on several fine-grained image benchmark datasets. Codes are available at: https://github.com/PRIS-CV/BSNet.
少样本学习用于细粒度图像分类最近在计算机视觉领域受到了关注。在少样本学习的方法中,基于度量的方法由于简单有效,在许多任务上处于领先地位。大多数基于度量的方法假设单一的相似性度量,从而获得单一的特征空间。然而,如果样本可以通过两种不同的相似性度量同时得到很好的分类,那么一个类中的样本可以在更小的特征空间中更紧凑地分布,从而产生更具判别力的特征图。受此启发,我们提出了一种所谓的双相似性网络(BSNet),它由一个单一的嵌入模块和一个具有两种相似性度量的双相似性模块组成。支持图像和查询图像通过基于卷积的嵌入模块后,双相似性模块根据两种不同特征的相似性度量学习特征图。通过这种方式,该模型能够从少量细粒度图像中学习到更具判别力且更少相似性偏差的特征,从而显著提高模型的泛化能力。通过对基于度量/相似性的现有网络进行微小修改的大量实验,我们表明所提出的方法在几个细粒度图像基准数据集上取得了显著改进。代码可在以下网址获取:https://github.com/PRIS-CV/BSNet。