Department of Psychology, Princeton Neuroscience Institute, Princeton University, New Jersey, United States.
Elife. 2019 Apr 4;8:e42992. doi: 10.7554/eLife.42992.
Although midbrain dopamine (DA) neurons have been thought to primarily encode reward prediction error (RPE), recent studies have also found movement-related DAergic signals. For example, we recently reported that DA neurons in mice projecting to dorsomedial striatum are modulated by choices contralateral to the recording side. Here, we introduce, and ultimately reject, a candidate resolution for the puzzling RPE vs movement dichotomy, by showing how seemingly movement-related activity might be explained by an action-specific RPE. By considering both choice and RPE on a trial-by-trial basis, we find that DA signals are modulated by contralateral choice in a manner that is distinct from RPE, implying that choice encoding is better explained by movement direction. This fundamental separation between RPE and movement encoding may help shed light on the diversity of functions and dysfunctions of the DA system.
虽然中脑多巴胺 (DA) 神经元主要被认为编码奖励预测误差 (RPE),但最近的研究也发现了与运动相关的 DA 能信号。例如,我们最近报道说,投射到背内侧纹状体的小鼠中脑 DA 神经元被记录侧对侧的选择所调制。在这里,我们提出并最终否定了一个令人困惑的 RPE 与运动二分法的候选解决方案,方法是展示看似与运动相关的活动如何可以用特定于动作的 RPE 来解释。通过在逐次试验的基础上同时考虑选择和 RPE,我们发现 DA 信号以一种与 RPE 不同的方式被对侧选择所调制,这意味着运动方向对选择编码的解释更好。RPE 和运动编码之间的这种基本分离可能有助于阐明 DA 系统功能和功能障碍的多样性。