IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1443-1456. doi: 10.1109/TPAMI.2020.3018506. Epub 2022 Feb 3.
Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which only a few labeled samples are available. As deep neural networks (DNNs) tend to overfit using a few samples only, typical meta-learning models use shallow neural networks, thus limiting its effectiveness. In order to achieve top performance, some recent works tried to use the DNNs pre-trained on large-scale datasets but mostly in straight-forward manners, e.g., (1) taking their weights as a warm start of meta-training, and (2) freezing their convolutional layers as the feature extractor of base-learners. In this paper, we propose a novel approach called meta-transfer learning (MTL), which learns to transfer the weights of a deep NN for few-shot learning tasks. Specifically, meta refers to training multiple tasks, and transfer is achieved by learning scaling and shifting functions of DNN weights (and biases) for each task. To further boost the learning efficiency of MTL, we introduce the hard task (HT) meta-batch scheme as an effective learning curriculum of few-shot classification tasks. We conduct experiments for five-class few-shot classification tasks on three challenging benchmarks, miniImageNet, tieredImageNet, and Fewshot-CIFAR100 (FC100), in both supervised and semi-supervised settings. Extensive comparisons to related works validate that our MTL approach trained with the proposed HT meta-batch scheme achieves top performance. An ablation study also shows that both components contribute to fast convergence and high accuracy.
元学习已被提出作为一种解决具有挑战性的少样本学习设置的框架。其关键思想是利用大量相似的少样本任务,以学习如何将基础学习器适应只有少量标记样本的新任务。由于深度神经网络(DNN)仅使用少量样本就容易过拟合,因此典型的元学习模型使用浅层神经网络,从而限制了其有效性。为了实现最佳性能,一些最近的工作尝试使用在大规模数据集上预训练的 DNN,但大多是直接的方法,例如:(1)将其权重作为元训练的热身起点,(2)将其卷积层冻结为基础学习器的特征提取器。在本文中,我们提出了一种称为元迁移学习(MTL)的新方法,该方法旨在学习将深度神经网络的权重用于少样本学习任务。具体来说,元是指训练多个任务,而转移是通过学习 DNN 权重(和偏差)的缩放和移位函数来实现的。为了进一步提高 MTL 的学习效率,我们引入了硬任务(HT)元批量方案作为少样本分类任务的有效学习课程。我们在三个具有挑战性的基准(miniImageNet、tieredImageNet 和 Fewshot-CIFAR100(FC100))上进行了五类少样本分类任务的实验,包括有监督和半监督设置。与相关工作的广泛比较验证了我们使用所提出的 HT 元批量方案训练的 MTL 方法的最佳性能。消融研究还表明,这两个组件都有助于快速收敛和高精度。