用于持续学习的子空间蒸馏

Subspace distillation for continual learning.

作者信息

Roy Kaushik, Simon Christian, Moghadam Peyman, Harandi Mehrtash

机构信息

Monash University, Melbourne, VIC, Australia; CSIRO, Data61, Brisbane, QLD, Australia.

Monash University, Melbourne, VIC, Australia; Australian national University, Canberra, ACT, Australia; CSIRO, Data61, Brisbane, QLD, Australia.

出版信息

Neural Netw. 2023 Oct;167:65-79. doi: 10.1016/j.neunet.2023.07.047. Epub 2023 Aug 6.

DOI:10.1016/j.neunet.2023.07.047

PMID:37625243

Abstract

An ultimate objective in continual learning is to preserve knowledge learned in preceding tasks while learning new tasks. To mitigate forgetting prior knowledge, we propose a novel knowledge distillation technique that takes into the account the manifold structure of the latent/output space of a neural network in learning novel tasks. To achieve this, we propose to approximate the data manifold up-to its first order, hence benefiting from linear subspaces to model the structure and maintain the knowledge of a neural network while learning novel concepts. We demonstrate that the modeling with subspaces provides several intriguing properties, including robustness to noise and therefore effective for mitigating Catastrophic Forgetting in continual learning. We also discuss and show how our proposed method can be adopted to address both classification and segmentation problems. Empirically, we observe that our proposed method outperforms various continual learning methods on several challenging datasets including Pascal VOC, and Tiny-Imagenet. Furthermore, we show how the proposed method can be seamlessly combined with existing learning approaches to improve their performances. The codes of this article will be available at https://github.com/csiro-robotics/SDCL.

摘要

持续学习的一个最终目标是在学习新任务的同时保留在先前任务中学到的知识。为了减轻对先前知识的遗忘，我们提出了一种新颖的知识蒸馏技术，该技术在学习新任务时考虑了神经网络的潜在/输出空间的流形结构。为了实现这一点，我们建议将数据流形近似到其一阶，从而受益于线性子空间来对结构进行建模，并在学习新概念时保持神经网络的知识。我们证明，用子空间进行建模具有几个有趣的特性，包括对噪声的鲁棒性，因此对于减轻持续学习中的灾难性遗忘是有效的。我们还讨论并展示了如何采用我们提出的方法来解决分类和分割问题。从经验上看，我们观察到我们提出的方法在包括Pascal VOC和Tiny-Imagenet在内的几个具有挑战性的数据集上优于各种持续学习方法。此外，我们展示了所提出的方法如何能够无缝地与现有的学习方法相结合以提高它们的性能。本文的代码将在https://github.com/csiro-robotics/SDCL上提供。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用于持续学习的子空间蒸馏

Subspace distillation for continual learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

用于持续学习的子空间蒸馏

Subspace distillation for continual learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献