通过二元视觉语义嵌入实现可扩展的零样本学习。

Shen Fumin, Zhou Xiang, Yu Jun, Yang Yang, Liu Li, Shen Heng Tao

IEEE Trans Image Process. 2019 Feb 18. doi: 10.1109/TIP.2019.2899987.

Zero-shot learning aims to classify visual instances from unseen classes in the absence of training examples. This is typically achieved by directly mapping visual features to a semantic embedding space of classes (e.g., attributes or word vectors), where the similarity between the two modalities can be readily measured. However, the semantic space may not be reliable for recognition due to the noisy class embeddings or visual bias problem. In this work, we propose a novel Binary embedding based Zero-Shot Learning (BZSL) method, which recognizes visual instances from unseen classes through an intermediate discriminative Hamming space. Specifically, BZSL jointly learns two binary coding functions to encode both visual instances and class embeddings into the Hamming space, which well alleviates the visual-semantic bias problem. As a desiring property, classifying an unseen instance thereby can be efficiently done by retrieving its nearest-class codes with minimal Hamming distance. During training, by introducing two auxiliary variables for the coding functions, we formulate an equivalent correlation maximization problem, which admits an analytical solution. The resulting algorithm thus enjoys both highly efficient training and scalable novel class inferring. Extensive experiments on four benchmark datasets, including the full ImageNet Fall 2011 dataset with over 20K unseen classes, demonstrate the superiority of our method on the zero-shot learning task. Particularly, we show that increasing the binary embedding dimension can inevitably improve the recognition accuracy.

零样本学习旨在在没有训练示例的情况下对来自未见类别的视觉实例进行分类。这通常通过将视觉特征直接映射到类别的语义嵌入空间（例如，属性或词向量）来实现，在该空间中可以很容易地测量两种模态之间的相似性。然而，由于嘈杂的类嵌入或视觉偏差问题，语义空间可能对于识别不可靠。在这项工作中，我们提出了一种新颖的基于二进制嵌入的零样本学习（BZSL）方法，该方法通过中间判别汉明空间识别来自未见类别的视觉实例。具体而言，BZSL联合学习两个二进制编码函数，将视觉实例和类嵌入都编码到汉明空间中，这很好地缓解了视觉语义偏差问题。作为一个理想的特性，通过检索具有最小汉明距离的最近类代码，可以有效地对未见实例进行分类。在训练期间，通过为编码函数引入两个辅助变量，我们制定了一个等效的相关性最大化问题，该问题允许解析解。因此，所得算法兼具高效训练和可扩展的新颖类推断能力。在四个基准数据集上进行的广泛实验，包括具有超过20K未见类别的完整ImageNet 2011秋季数据集，证明了我们的方法在零样本学习任务上的优越性。特别地，我们表明增加二进制嵌入维度可以不可避免地提高识别准确率。

相似文献

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

IEEE Trans Image Process. 2019 Feb 18. doi: 10.1109/TIP.2019.2899987.

Cross-modal distribution alignment embedding network for generalized zero-shot learning.

Neural Netw. 2022 Apr;148:176-182. doi: 10.1016/j.neunet.2022.01.007. Epub 2022 Jan 29.

Zero-Shot Learning via Category-Specific Visual-Semantic Mapping and Label Refinement.

IEEE Trans Image Process. 2018 Sep 28. doi: 10.1109/TIP.2018.2872916.

Attributes learning network for generalized zero-shot learning.

Neural Netw. 2022 Jun;150:112-118. doi: 10.1016/j.neunet.2022.02.018. Epub 2022 Mar 5.

Visual-guided attentive attributes embedding for zero-shot learning.

Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.

Zero-Shot Learning via Latent Space Encoding.

IEEE Trans Cybern. 2019 Oct;49(10):3755-3766. doi: 10.1109/TCYB.2018.2850750. Epub 2018 Jul 16.

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning.

IEEE Trans Image Process. 2018 Jul 31. doi: 10.1109/TIP.2018.2861573.

Generative Zero-Shot Learning via Low-Rank Embedded Semantic Dictionary.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2861-2874. doi: 10.1109/TPAMI.2018.2867870. Epub 2018 Aug 30.

Investigating the Bilateral Connections in Generative Zero-Shot Learning.

IEEE Trans Cybern. 2022 Aug;52(8):8167-8178. doi: 10.1109/TCYB.2021.3050803. Epub 2022 Jul 19.

Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection.

Int J Neural Syst. 2023 Jul;33(7):2350035. doi: 10.1142/S0129065723500351. Epub 2023 Jun 14.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

IEEE Trans Image Process. 2019 Feb 18. doi: 10.1109/TIP.2019.2899987.

Cross-modal distribution alignment embedding network for generalized zero-shot learning.

Neural Netw. 2022 Apr;148:176-182. doi: 10.1016/j.neunet.2022.01.007. Epub 2022 Jan 29.

Zero-Shot Learning via Category-Specific Visual-Semantic Mapping and Label Refinement.

IEEE Trans Image Process. 2018 Sep 28. doi: 10.1109/TIP.2018.2872916.

Attributes learning network for generalized zero-shot learning.

Neural Netw. 2022 Jun;150:112-118. doi: 10.1016/j.neunet.2022.02.018. Epub 2022 Mar 5.

Visual-guided attentive attributes embedding for zero-shot learning.

Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.

Zero-Shot Learning via Latent Space Encoding.

IEEE Trans Cybern. 2019 Oct;49(10):3755-3766. doi: 10.1109/TCYB.2018.2850750. Epub 2018 Jul 16.

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning.

IEEE Trans Image Process. 2018 Jul 31. doi: 10.1109/TIP.2018.2861573.

Generative Zero-Shot Learning via Low-Rank Embedded Semantic Dictionary.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2861-2874. doi: 10.1109/TPAMI.2018.2867870. Epub 2018 Aug 30.

Investigating the Bilateral Connections in Generative Zero-Shot Learning.

IEEE Trans Cybern. 2022 Aug;52(8):8167-8178. doi: 10.1109/TCYB.2021.3050803. Epub 2022 Jul 19.

Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection.

Int J Neural Syst. 2023 Jul;33(7):2350035. doi: 10.1142/S0129065723500351. Epub 2023 Jun 14.

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献