Amodeo L R, McMurray M S, Roitman J D
Department of Psychology, University of Illinois at Chicago, 1007 West Harrison Street, Chicago, IL 60607, USA.
Department of Psychology, Miami University, 90 N Patterson Avenue, Oxford, OH 45056, USA.
Neuroscience. 2017 Mar 14;345:27-37. doi: 10.1016/j.neuroscience.2016.03.034. Epub 2016 Mar 17.
In a continuously changing environment, in which behavioral outcomes are rarely certain, animals must be able to learn to integrate feedback from their choices over time and adapt to changing reward contingencies to maintain flexible behavior. The orbitofrontal region of prefrontal cortex (OFC) has been widely implicated as playing a role in the ability to flexibly control behavior. We used a probabilistic reversal learning task to measure rats' behavioral flexibility and its neural basis in the activity of single neurons in OFC. In this task, one lever, designated as 'correct', was rewarded at a high probability (80%) and a second, spatially distinct lever, designated as 'incorrect', was rewarded at a low probability (20%). Once rats reached a learning criterion for reliably selecting the correct lever, reward contingencies of the two levers were switched, and daily sessions were conducted until rats reliably selected the new correct lever. All rats performed the initial Acquisition and subsequent Reversal successfully, with more sessions needed to learn the Reversal. OFC neurons were recorded during five behavioral sessions spanning Acquisition and Reversal learning. The dominant pattern of neural responding in OFC, identified by principal component analysis of the population of neurons recorded, was modulated by reward outcome across behavioral sessions. Generally, activity was higher following rewarded choices than unrewarded. However, there was a correlation between reduced responses to reward following incorrect choices and the establishment of the preference for the correct lever. These results show how signaling by individual OFC neurons may participate in the flexible adaptation of behavior under changing reward contingencies.
在一个不断变化的环境中,行为结果很少是确定的,动物必须能够学会随着时间的推移整合来自其选择的反馈,并适应不断变化的奖励条件以维持灵活的行为。前额叶皮层的眶额区域(OFC)被广泛认为在灵活控制行为的能力中发挥作用。我们使用概率性反转学习任务来测量大鼠的行为灵活性及其在OFC单个神经元活动中的神经基础。在这个任务中,一个被指定为“正确”的杠杆以高概率(80%)得到奖励,另一个在空间上不同的杠杆,被指定为“错误”,以低概率(20%)得到奖励。一旦大鼠达到可靠选择正确杠杆的学习标准,两个杠杆的奖励条件就会切换,并进行每日实验,直到大鼠可靠地选择新的正确杠杆。所有大鼠都成功完成了初始习得和随后的反转,学习反转需要更多的实验次数。在跨越习得和反转学习的五个行为实验过程中记录了OFC神经元。通过对记录的神经元群体进行主成分分析确定的OFC中神经反应的主导模式,在不同行为实验中受到奖励结果的调节。一般来说,奖励选择后的活动高于无奖励选择后的活动。然而,错误选择后对奖励反应的降低与对正确杠杆偏好的建立之间存在相关性。这些结果表明了单个OFC神经元的信号传递可能如何参与在不断变化的奖励条件下行为的灵活适应。