Institute for Infocomm Research (I2R), A*STAR, Singapore.
Indian Institute of Science, Bangalore, India.
Neural Netw. 2022 Nov;155:487-497. doi: 10.1016/j.neunet.2022.08.034. Epub 2022 Sep 6.
Learning continually from a stream of training data or tasks with an ability to learn the unseen classes using a zero-shot learning framework is gaining attention in the literature. It is referred to as continual zero-shot learning (CZSL). Existing CZSL requires clear task-boundary information during training which is not practically feasible. This paper proposes a task-free generalized CZSL (Tf-GCZSL) method with short-term/long-term memory to overcome the requirement of task-boundary in training. A variational autoencoder (VAE) handles the fundamental ZSL tasks. The short-term and long-term memory help to overcome the condition of the task boundary in the CZSL framework. Further, the proposed Tf-GCZSL method combines the concept of experience replay with dark knowledge distillation and regularization to overcome the catastrophic forgetting issues in a continual learning framework. Finally, the Tf-GCZSL uses a fully connected classifier developed using the synthetic features generated at the latent space of the VAE. The performance of the proposed Tf-GCZSL is evaluated in the existing task-agnostic prediction setting and the proposed task-free setting for the generalized CZSL over the five ZSL benchmark datasets. The results clearly indicate that the proposed Tf-GCZSL improves the prediction at least by 12%, 1%, 3%, 4%, and 3% over existing state-of-the-art and baseline methods for CUB, aPY, AWA1, AWA2, and SUN datasets, respectively in both settings (task-agnostic prediction and task-free learning). The source code is available at https://github.com/Chandan-IITI/Tf-GCZSL.
从训练数据或任务流中持续学习,并使用零样本学习框架学习未见类别的能力在文献中受到关注。这被称为持续零样本学习(CZSL)。现有的 CZSL 在训练期间需要明确的任务边界信息,这在实践中是不可行的。本文提出了一种无任务的广义 CZSL(Tf-GCZSL)方法,具有短期/长期记忆,以克服训练中对任务边界的要求。变分自动编码器(VAE)处理基本的 ZSL 任务。短期和长期记忆有助于克服 CZSL 框架中的任务边界条件。此外,所提出的 Tf-GCZSL 方法结合了经验重放的概念与暗知识蒸馏和正则化,以克服连续学习框架中的灾难性遗忘问题。最后,Tf-GCZSL 使用在 VAE 的潜在空间生成的合成特征开发的全连接分类器。在所提出的无任务设置中,在五个 ZSL 基准数据集上对 Tf-GCZSL 在现有的无任务预测设置和广义 CZSL 中的性能进行了评估。结果清楚地表明,在所提出的无任务设置中,与现有最先进的方法和基线方法相比,Tf-GCZSL 在 CUB、aPY、AWA1、AWA2 和 SUN 数据集上的预测分别至少提高了 12%、1%、3%、4%和 3%。代码可在 https://github.com/Chandan-IITI/Tf-GCZSL 上获得。