Department of Bioengineering, Imperial College London, South Kensington Campus, London, United Kingdom.
Department of Physiology, Development and Neuroscience, Physiological Laboratory, Cambridge, United Kingdom.
PLoS Comput Biol. 2021 Jun 10;17(6):e1009017. doi: 10.1371/journal.pcbi.1009017. eCollection 2021 Jun.
To survive, animals have to quickly modify their behaviour when the reward changes. The internal representations responsible for this are updated through synaptic weight changes, mediated by certain neuromodulators conveying feedback from the environment. In previous experiments, we discovered a form of hippocampal Spike-Timing-Dependent-Plasticity (STDP) that is sequentially modulated by acetylcholine and dopamine. Acetylcholine facilitates synaptic depression, while dopamine retroactively converts the depression into potentiation. When these experimental findings were implemented as a learning rule in a computational model, our simulations showed that cholinergic-facilitated depression is important for reversal learning. In the present study, we tested the model's prediction by optogenetically inactivating cholinergic neurons in mice during a hippocampus-dependent spatial learning task with changing rewards. We found that reversal learning, but not initial place learning, was impaired, verifying our computational prediction that acetylcholine-modulated plasticity promotes the unlearning of old reward locations. Further, differences in neuromodulator concentrations in the model captured mouse-by-mouse performance variability in the optogenetic experiments. Our line of work sheds light on how neuromodulators enable the learning of new contingencies.
为了生存,动物必须在奖励发生变化时迅速改变行为。负责这一点的内部表示是通过突触权重变化来更新的,这些变化由某些神经调质介导,从环境中传递反馈。在以前的实验中,我们发现了一种海马体尖峰时间依赖可塑性(STDP)的形式,它被乙酰胆碱和多巴胺依次调节。乙酰胆碱促进突触抑制,而多巴胺则将抑制向后转化为增强。当这些实验结果被作为一个学习规则在一个计算模型中实现时,我们的模拟表明,胆碱能促进的抑制对于反转学习很重要。在本研究中,我们通过在具有变化奖励的海马体依赖空间学习任务期间用光遗传学使小鼠的胆碱能神经元失活,来测试模型的预测。我们发现,反转学习,而不是初始位置学习,受到损害,验证了我们的计算预测,即乙酰胆碱调节的可塑性促进了对旧奖励位置的遗忘。此外,模型中神经调质浓度的差异捕捉到了光遗传实验中鼠标间的性能变异性。我们的研究揭示了神经调质如何使新的关联得以学习。