生成式多标签零样本学习

Generative Multi-Label Zero-Shot Learning.

作者信息

Gupta Akshita, Narayan Sanath, Khan Salman, Khan Fahad Shahbaz, Shao Ling, van de Weijer Joost

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14611-14624. doi: 10.1109/TPAMI.2023.3295772. Epub 2023 Nov 3.

DOI:10.1109/TPAMI.2023.3295772

Abstract

Multi-label zero-shot learning strives to classify images into multiple unseen categories for which no data is available during training. The test samples can additionally contain seen categories in the generalized variant. Existing approaches rely on learning either shared or label-specific attention from the seen classes. Nevertheless, computing reliable attention maps for unseen classes during inference in a multi-label setting is still a challenge. In contrast, state-of-the-art single-label generative adversarial network (GAN) based approaches learn to directly synthesize the class-specific visual features from the corresponding class attribute embeddings. However, synthesizing multi-label features from GANs is still unexplored in the context of zero-shot setting. When multiple objects occur jointly in a single image, a critical question is how to effectively fuse multi-class information. In this work, we introduce different fusion approaches at the attribute-level, feature-level and cross-level (across attribute and feature-levels) for synthesizing multi-label features from their corresponding multi-label class embeddings. To the best of our knowledge, our work is the first to tackle the problem of multi-label feature synthesis in the (generalized) zero-shot setting. Our cross-level fusion-based generative approach outperforms the state-of-the-art on three zero-shot benchmarks: NUS-WIDE, Open Images and MS COCO. Furthermore, we show the generalization capabilities of our fusion approach in the zero-shot detection task on MS COCO, achieving favorable performance against existing methods.

摘要

多标签零样本学习致力于将图像分类到多个在训练期间没有可用数据的未见类别中。在广义变体中，测试样本还可以包含已见类别。现有方法依赖于从已见类别中学习共享注意力或特定于标签的注意力。然而，在多标签设置的推理过程中为未见类别计算可靠的注意力图仍然是一个挑战。相比之下，基于生成对抗网络（GAN）的最先进的单标签方法学习从相应的类别属性嵌入中直接合成特定于类别的视觉特征。然而，在零样本设置的背景下，从GAN中合成多标签特征仍未得到探索。当多个物体在单个图像中共同出现时，一个关键问题是如何有效地融合多类信息。在这项工作中，我们在属性级别、特征级别和跨级别（跨属性和特征级别）引入了不同的融合方法，用于从相应的多标签类别嵌入中合成多标签特征。据我们所知，我们的工作是首次解决（广义）零样本设置中的多标签特征合成问题。我们基于跨级别融合的生成方法在三个零样本基准测试（NUS-WIDE、开放图像和MS COCO）上优于现有技术。此外，我们展示了我们的融合方法在MS COCO的零样本检测任务中的泛化能力，相对于现有方法取得了良好的性能。

相似文献

Generative Multi-Label Zero-Shot Learning.生成式多标签零样本学习

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14611-14624. doi: 10.1109/TPAMI.2023.3295772. Epub 2023 Nov 3.

Augmented semantic feature based generative network for generalized zero-shot learning.基于增强语义特征的生成网络用于广义零样本学习。

Neural Netw. 2021 Nov;143:1-11. doi: 10.1016/j.neunet.2021.04.014. Epub 2021 Apr 21.

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning.零变分自编码器生成对抗网络：为广义和转导式零样本学习生成未见特征

IEEE Trans Image Process. 2020 Jan 13. doi: 10.1109/TIP.2020.2964429.

Generative Mixup Networks for Zero-Shot Learning.用于零样本学习的生成式混合网络

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4054-4065. doi: 10.1109/TNNLS.2022.3142181. Epub 2025 Feb 28.

Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection.基于对比学习的零样本检测的Transformer 方法。

Int J Neural Syst. 2023 Jul;33(7):2350035. doi: 10.1142/S0129065723500351. Epub 2023 Jun 14.

Deep Ranking for Image Zero-Shot Multi-Label Classification.用于图像零样本多标签分类的深度排序

IEEE Trans Image Process. 2020 May 14. doi: 10.1109/TIP.2020.2991527.

Generative Zero-Shot Learning via Low-Rank Embedded Semantic Dictionary.基于低秩嵌入语义字典的生成式零样本学习

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2861-2874. doi: 10.1109/TPAMI.2018.2867870. Epub 2018 Aug 30.

Multi-label zero-shot learning with graph convolutional networks.基于图卷积网络的多标签零样本学习。

Neural Netw. 2020 Dec;132:333-341. doi: 10.1016/j.neunet.2020.09.010. Epub 2020 Sep 21.

Graph embedding based multi-label Zero-shot Learning.基于图嵌入的多标签零样本学习。

Neural Netw. 2023 Oct;167:129-140. doi: 10.1016/j.neunet.2023.08.023. Epub 2023 Aug 19.

Leveraging Balanced Semantic Embedding for Generative Zero-Shot Learning.利用平衡语义嵌入进行生成式零样本学习。

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9575-9582. doi: 10.1109/TNNLS.2022.3208525. Epub 2023 Oct 27.

引用本文的文献

A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images.一种基于胸部X光图像的用于疾病诊断的跨模态深度度量学习模型。

Multimed Tools Appl. 2023 Mar 15:1-22. doi: 10.1007/s11042-023-14790-7.

生成式多标签零样本学习

Generative Multi-Label Zero-Shot Learning.

作者信息

Gupta Akshita, Narayan Sanath, Khan Salman, Khan Fahad Shahbaz, Shao Ling, van de Weijer Joost

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14611-14624. doi: 10.1109/TPAMI.2023.3295772. Epub 2023 Nov 3.

DOI:10.1109/TPAMI.2023.3295772

PMID:37450360

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

生成式多标签零样本学习

Generative Multi-Label Zero-Shot Learning.

作者信息

出版信息

相似文献

引用本文的文献

生成式多标签零样本学习

Generative Multi-Label Zero-Shot Learning.

作者信息

出版信息

相似文献

引用本文的文献