Baik Sungyong, Choi Myungsub, Choi Janghoon, Kim Heewon, Lee Kyoung Mu
IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1441-1454. doi: 10.1109/TPAMI.2023.3261387. Epub 2024 Feb 6.
The objective of few-shot learning is to design a system that can adapt to a given task with only few examples while achieving generalization. Model-agnostic meta-learning (MAML), which has recently gained the popularity for its simplicity and flexibility, learns a good initialization for fast adaptation to a task under few-data regime. However, its performance has been relatively limited especially when novel tasks are different from tasks previously seen during training. In this work, instead of searching for a better initialization, we focus on designing a better fast adaptation process. Consequently, we propose a new task-adaptive weight update rule that greatly enhances the fast adaptation process. Specifically, we introduce a small meta-network that can generate per-step hyperparameters for each given task: learning rate and weight decay coefficients. The experimental results validate that learning a good weight update rule for fast adaptation is the equally important component that has drawn relatively less attention in the recent few-shot learning approaches. Surprisingly, fast adaptation from random initialization with ALFA can already outperform MAML. Furthermore, the proposed weight-update rule is shown to consistently improve the task-adaptation capability of MAML across diverse problem domains: few-shot classification, cross-domain few-shot classification, regression, visual tracking, and video frame interpolation.
少样本学习的目标是设计一个系统,该系统仅通过少量示例就能适应给定任务并实现泛化。模型无关元学习(MAML)因其简单性和灵活性最近受到欢迎,它学习一个良好的初始化,以便在少数据情况下快速适应任务。然而,其性能相对有限,特别是当新任务与训练期间之前见过的任务不同时。在这项工作中,我们不是寻找更好的初始化,而是专注于设计一个更好的快速适应过程。因此,我们提出了一种新的任务自适应权重更新规则,该规则大大增强了快速适应过程。具体来说,我们引入了一个小型元网络,它可以为每个给定任务生成每一步的超参数:学习率和权重衰减系数。实验结果验证了学习一个用于快速适应的良好权重更新规则是一个同样重要的组成部分,而在最近的少样本学习方法中相对较少受到关注。令人惊讶的是,使用ALFA从随机初始化进行快速适应已经可以优于MAML。此外,所提出的权重更新规则被证明可以在不同的问题领域中持续提高MAML的任务适应能力:少样本分类、跨域少样本分类、回归、视觉跟踪和视频帧插值。