Histed Mark H, Pasupathy Anitha, Miller Earl K
The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
Neuron. 2009 Jul 30;63(2):244-53. doi: 10.1016/j.neuron.2009.06.019.
Learning from experience requires knowing whether a past action resulted in a desired outcome. The prefrontal cortex and basal ganglia are thought to play key roles in such learning of arbitrary stimulus-response associations. Previous studies have found neural activity in these areas, similar to dopaminergic neurons' signals, that transiently reflect whether a response is correct or incorrect. However, it is unclear how this transient activity, which fades in under a second, influences actions that occur much later. Here, we report that single neurons in both areas show sustained, persistent outcome-related responses. Moreover, single behavioral outcomes influence future neural activity and behavior: behavioral responses are more often correct and single neurons more accurately discriminate between the possible responses when the previous response was correct. These long-lasting signals about trial outcome provide a way to link one action to the next and may allow reward signals to be combined over time to implement successful learning.
从经验中学习需要知道过去的行为是否产生了预期的结果。前额叶皮层和基底神经节被认为在这种任意刺激-反应关联的学习中起关键作用。先前的研究发现,这些区域的神经活动类似于多巴胺能神经元的信号,能短暂反映反应是正确还是错误。然而,尚不清楚这种在不到一秒钟内就会消失的短暂活动是如何影响很久之后才发生的行为的。在此,我们报告这两个区域的单个神经元均显示出持续、持久的与结果相关的反应。此外,单个行为结果会影响未来的神经活动和行为:当先前的反应正确时,行为反应更常是正确的,单个神经元也能更准确地区分可能的反应。这些关于试验结果的持久信号提供了一种将一个行为与下一个行为联系起来的方式,并且可能使奖励信号随着时间的推移进行整合,以实现成功的学习。