用于零样本学习的判别式交叉对齐变分自编码器

A Discriminative Cross-Aligned Variational Autoencoder for Zero-Shot Learning.

作者信息

Liu Yang, Gao Xinbo, Han Jungong, Shao Ling

出版信息

IEEE Trans Cybern. 2023 Jun;53(6):3794-3805. doi: 10.1109/TCYB.2022.3164142. Epub 2023 May 17.

DOI:10.1109/TCYB.2022.3164142

Abstract

Zero-shot learning (ZSL) aims to classify unseen samples based on the relationship between the learned visual features and semantic features. Traditional ZSL methods typically capture the underlying multimodal data structures by learning an embedding function between the visual space and the semantic space with the Euclidean metric. However, these models suffer from the hubness problem and domain bias problem, which leads to unsatisfactory performance, especially in the generalized ZSL (GZSL) task. To tackle such a problem, we formulate a discriminative cross-aligned variational autoencoder (DCA-VAE) for ZSL. The proposed model effectively utilizes a modified cross-modal-alignment variational autoencoder (VAE) to transform both visual features and semantic features obtained by the discriminative cosine metric into latent features. The key to our method is that we collect principal discriminative information from visual and semantic features to construct latent features which contain the discriminative multimodal information associated with unseen samples. Finally, the proposed model DCA-VAE is validated on six benchmarks including the large dataset ImageNet, and several experimental results demonstrate the superiority of DCA-VAE over most existing embedding or generative ZSL models on the standard ZSL and the more realistic GZSL tasks.

摘要

零样本学习（ZSL）旨在基于所学视觉特征和语义特征之间的关系对未见样本进行分类。传统的ZSL方法通常通过学习具有欧几里得度量的视觉空间和语义空间之间的嵌入函数来捕获潜在的多模态数据结构。然而，这些模型存在枢纽性问题和域偏差问题，这导致性能不尽人意，尤其是在广义ZSL（GZSL）任务中。为了解决此类问题，我们为ZSL制定了一种判别式交叉对齐变分自编码器（DCA-VAE）。所提出的模型有效地利用了一种改进的跨模态对齐变分自编码器（VAE），将通过判别余弦度量获得的视觉特征和语义特征都转换为潜在特征。我们方法的关键在于，我们从视觉和语义特征中收集主要判别信息，以构建包含与未见样本相关的判别多模态信息的潜在特征。最后，在包括大型数据集ImageNet在内的六个基准上对所提出的模型DCA-VAE进行了验证，一些实验结果证明了DCA-VAE在标准ZSL和更实际的GZSL任务上优于大多数现有的嵌入或生成式ZSL模型。

相似文献

A Discriminative Cross-Aligned Variational Autoencoder for Zero-Shot Learning.用于零样本学习的判别式交叉对齐变分自编码器

IEEE Trans Cybern. 2023 Jun;53(6):3794-3805. doi: 10.1109/TCYB.2022.3164142. Epub 2023 May 17.

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning.零变分自编码器生成对抗网络：为广义和转导式零样本学习生成未见特征

IEEE Trans Image Process. 2020 Jan 13. doi: 10.1109/TIP.2020.2964429.

Zero-Shot Learning With Attentive Region Embedding and Enhanced Semantics.基于注意力区域嵌入和增强语义的零样本学习

IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):4220-4231. doi: 10.1109/TNNLS.2022.3202014. Epub 2024 Feb 29.

Zero-Shot Learning via Robust Latent Representation and Manifold Regularization.基于鲁棒潜在表示和流形正则化的零样本学习。

IEEE Trans Image Process. 2019 Apr;28(4):1824-1836. doi: 10.1109/TIP.2018.2881926. Epub 2018 Nov 16.

Modality independent adversarial network for generalized zero shot image classification.模态无关对抗网络的广义零样本图像分类。

Neural Netw. 2021 Feb;134:11-22. doi: 10.1016/j.neunet.2020.11.007. Epub 2020 Nov 21.

Leveraging Balanced Semantic Embedding for Generative Zero-Shot Learning.利用平衡语义嵌入进行生成式零样本学习。

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9575-9582. doi: 10.1109/TNNLS.2022.3208525. Epub 2023 Oct 27.

Cross-modal distribution alignment embedding network for generalized zero-shot learning.跨模态分布对齐嵌入网络的广义零样本学习。

Neural Netw. 2022 Apr;148:176-182. doi: 10.1016/j.neunet.2022.01.007. Epub 2022 Jan 29.

Augmented semantic feature based generative network for generalized zero-shot learning.基于增强语义特征的生成网络用于广义零样本学习。

Neural Netw. 2021 Nov;143:1-11. doi: 10.1016/j.neunet.2021.04.014. Epub 2021 Apr 21.

Zero-Shot Learning on Semantic Class Prototype Graph.语义类原型图上的零样本学习

IEEE Trans Pattern Anal Mach Intell. 2018 Aug;40(8):2009-2022. doi: 10.1109/TPAMI.2017.2737007. Epub 2017 Aug 7.

Visual-guided attentive attributes embedding for zero-shot learning.基于视觉引导的注意力属性嵌入的零样本学习。

Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于零样本学习的判别式交叉对齐变分自编码器

A Discriminative Cross-Aligned Variational Autoencoder for Zero-Shot Learning.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献