Ding Zhengming, Shao Ming, Fu Yun
IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2861-2874. doi: 10.1109/TPAMI.2018.2867870. Epub 2018 Aug 30.
Zero-shot learning for visual recognition, which approaches identifying unseen categories through a shared visual-semantic function learned on the seen categories and is expected to well adapt to unseen categories, has received considerable research attention most recently. However, the semantic gap between discriminant visual features and their underlying semantics is still the biggest obstacle, because there usually exists domain disparity across the seen and unseen classes. To deal with this challenge, we design two-stage generative adversarial networks to enhance the generalizability of semantic dictionary through low-rank embedding for zero-shot learning. In detail, we formulate a novel framework to simultaneously seek a two-stage generative model and a semantic dictionary to connect visual features with their semantics under a low-rank embedding. Our first-stage generative model is able to augment more semantic features for the unseen classes, which are then used to generate more discriminant visual features in the second stage, to expand the seen visual feature space. Therefore, we will be able to seek a better semantic dictionary to constitute the latent basis for the unseen classes based on the augmented semantic and visual data. Finally, our approach could capture a variety of visual characteristics from seen classes that are "ready-to-use" for new classes. Extensive experiments on four zero-shot benchmarks demonstrate that our proposed algorithm outperforms the state-of-the-art zero-shot algorithms.
用于视觉识别的零样本学习最近受到了相当多的研究关注,它通过在已见类别上学习的共享视觉语义函数来识别未见类别,并有望很好地适应未见类别。然而,判别视觉特征与其潜在语义之间的语义鸿沟仍然是最大的障碍,因为在已见类别和未见类别之间通常存在领域差异。为了应对这一挑战,我们设计了两阶段生成对抗网络,通过低秩嵌入来增强语义字典的通用性,以进行零样本学习。具体来说,我们制定了一个新颖的框架,在低秩嵌入下同时寻找一个两阶段生成模型和一个语义字典,以将视觉特征与其语义联系起来。我们的第一阶段生成模型能够为未见类别增强更多语义特征,然后在第二阶段用于生成更具判别力的视觉特征,以扩展已见视觉特征空间。因此,我们将能够基于增强的语义和视觉数据寻找一个更好的语义字典,为未见类别构成潜在基础。最后,我们的方法可以从已见类别中捕捉各种视觉特征,这些特征可“直接用于”新类别。在四个零样本基准上进行的大量实验表明,我们提出的算法优于当前最先进的零样本算法。