Kushawaha Nilay, Fruzzetti Lorenzo, Donato Enrico, Falotico Egidio
IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):13162-13176. doi: 10.1109/TNNLS.2024.3446171.
Catastrophic forgetting is a phenomenon in which a neural network, upon learning a new task, struggles to maintain its performance on previously learned tasks. It is a common challenge in the realm of continual learning (CL) through neural networks. The mammalian brain addresses catastrophic forgetting by consolidating memories in different parts of the brain, involving the hippocampus and the neocortex. Taking inspiration from this brain strategy, we present a CL framework that combines a plastic model simulating the fast learning capabilities of the hippocampus and a stable model representing the slow consolidation nature of the neocortex. To supplement this, we introduce a variational autoencoder (VAE)-based pseudo memory for rehearsal purposes. In addition by applying lateral inhibition masks on the gradients of the convolutional layer, we aim at damping the activity of adjacent neurons and introduce a sleep phase to reorganize the learned representations. Empirical evaluation demonstrates the positive impact of such additions on the performance of our proposed framework; we evaluate the proposed model on several class-incremental and domain-incremental datasets and compare it with the standard benchmark algorithms, showing significant improvements. With the aim to showcase practical applicability, we implement the algorithm in a physical environment for object classification using a soft pneumatic gripper. The algorithm learns new classes incrementally in real time and also exhibits significant backward knowledge transfer (KT).
灾难性遗忘是一种现象,即神经网络在学习新任务时,难以维持其在先前学习任务上的表现。这是通过神经网络进行持续学习(CL)领域中的一个常见挑战。哺乳动物的大脑通过在大脑的不同部位(包括海马体和新皮层)巩固记忆来应对灾难性遗忘。受这种大脑策略的启发,我们提出了一个持续学习框架,该框架结合了一个模拟海马体快速学习能力的可塑性模型和一个代表新皮层缓慢巩固特性的稳定模型。为了补充这一点,我们引入了一个基于变分自编码器(VAE)的伪记忆用于排练。此外,通过在卷积层的梯度上应用侧向抑制掩码,我们旨在抑制相邻神经元的活动,并引入一个睡眠阶段来重新组织学习到的表征。实证评估证明了这些补充对我们提出的框架性能的积极影响;我们在几个类别增量和领域增量数据集上评估了所提出的模型,并将其与标准基准算法进行比较,显示出显著的改进。为了展示实际适用性,我们在一个使用软气动夹具进行物体分类的物理环境中实现了该算法。该算法实时增量式地学习新类别,并且还表现出显著的反向知识迁移(KT)。