Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, New York, United States of America.
Neurosciences Program, Stanford University, Stanford, California, United States of America.
PLoS Comput Biol. 2021 Aug 10;17(8):e1009205. doi: 10.1371/journal.pcbi.1009205. eCollection 2021 Aug.
The Drosophila mushroom body exhibits dopamine dependent synaptic plasticity that underlies the acquisition of associative memories. Recordings of dopamine neurons in this system have identified signals related to external reinforcement such as reward and punishment. However, other factors including locomotion, novelty, reward expectation, and internal state have also recently been shown to modulate dopamine neurons. This heterogeneity is at odds with typical modeling approaches in which these neurons are assumed to encode a global, scalar error signal. How is dopamine dependent plasticity coordinated in the presence of such heterogeneity? We develop a modeling approach that infers a pattern of dopamine activity sufficient to solve defined behavioral tasks, given architectural constraints informed by knowledge of mushroom body circuitry. Model dopamine neurons exhibit diverse tuning to task parameters while nonetheless producing coherent learned behaviors. Notably, reward prediction error emerges as a mode of population activity distributed across these neurons. Our results provide a mechanistic framework that accounts for the heterogeneity of dopamine activity during learning and behavior.
果蝇的蘑菇体表现出多巴胺依赖的突触可塑性,这种可塑性是联想记忆获得的基础。对该系统中多巴胺神经元的记录已经确定了与外部强化(如奖励和惩罚)相关的信号。然而,其他因素,包括运动、新奇性、奖励预期和内部状态,最近也被证明可以调节多巴胺神经元。这种异质性与典型的建模方法不一致,在典型的建模方法中,这些神经元被假设编码一个全局的、标量的误差信号。在存在这种异质性的情况下,多巴胺依赖性可塑性是如何协调的?我们开发了一种建模方法,该方法根据蘑菇体电路知识提供的架构约束,推断出足以解决定义的行为任务的多巴胺活动模式。模型多巴胺神经元对任务参数表现出多样化的调谐,但仍能产生一致的学习行为。值得注意的是,奖励预测误差作为一种分布在这些神经元中的群体活动模式出现。我们的研究结果提供了一个机械框架,解释了学习和行为过程中多巴胺活动的异质性。