利用赫布式上下文门控和指数衰减任务信号对人类进行连续学习建模。

Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals.

机构信息

Department of Experimental Psychology, University of Oxford; Oxford, United Kingdom.

Department of Computational Sciences, Wigner Research Centre for Physics; Budapest, Hungary.

出版信息

PLoS Comput Biol. 2023 Jan 19;19(1):e1010808. doi: 10.1371/journal.pcbi.1010808. eCollection 2023 Jan.

DOI:10.1371/journal.pcbi.1010808

PMID:36656823

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9851563/

Abstract

Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier work on gating in the primate prefrontal cortex, that capture the cost of interleaved training and allow the network to learn two tasks in sequence without forgetting. We augment standard stochastic gradient descent with two algorithmic motifs, so-called "sluggish" task units and a Hebbian training step that strengthens connections between task units and hidden units that encode task-relevant information. We found that the "sluggish" units introduce a switch-cost during training, which biases representations under interleaved training towards a joint representation that ignores the contextual cue, while the Hebbian step promotes the formation of a gating scheme from task units to the hidden layer that produces orthogonal representations which are perfectly guarded against interference. Validating the model on previously published human behavioural data revealed that it matches performance of participants who had been trained on blocked or interleaved curricula, and that these performance differences were driven by misestimation of the true category boundary.

摘要

人类可以在最小的相互干扰下连续学习多个任务，但在同时接受多个任务训练时表现更差。标准的深度神经网络则恰恰相反。在这里，我们受灵长类动物前额叶皮层门控的早期研究启发，为人工神经网络提出了新的计算约束，这些约束捕捉了交错训练的成本，并允许网络在不遗忘的情况下按顺序学习两个任务。我们通过两个算法模式来增强标准随机梯度下降，即所谓的“迟钝”任务单元和赫布学习步骤，该步骤增强了编码任务相关信息的任务单元和隐藏单元之间的连接。我们发现，“迟钝”单元在训练过程中引入了切换成本，该成本使得在交错训练下的表示偏向于忽略上下文提示的联合表示，而赫布学习步骤则促进了从任务单元到隐藏层的门控方案的形成，该方案产生了完全不受干扰的正交表示。在以前发表的人类行为数据上验证该模型表明，它与在分组或交错课程中接受训练的参与者的表现相匹配，并且这些性能差异是由对真实类别边界的错误估计所驱动的。