Klanker Marianne, Sandberg Tessa, Joosten Ruud, Willuhn Ingo, Feenstra Matthijs, Denys Damiaan
Netherlands Institute for Neuroscience, Institute of the Royal Netherlands Academy of Arts and Sciences, Meibergdreef 47, 1105 BA Amsterdam, The Netherlands; Department of Psychiatry, Academic Medical Center, University of Amsterdam, Postbus 22660, 1100 DD Amsterdam, The Netherlands.
Netherlands Institute for Neuroscience, Institute of the Royal Netherlands Academy of Arts and Sciences, Meibergdreef 47, 1105 BA Amsterdam, The Netherlands.
Neurobiol Learn Mem. 2015 Nov;125:135-45. doi: 10.1016/j.nlm.2015.08.011. Epub 2015 Sep 5.
Striatal dopamine (DA) is central to reward-based learning. Less is known about the contribution of DA to the ability to adapt previously learned behavior in response to changes in the environment, such as a reversal of response-reward contingencies. We hypothesized that DA is involved in the rapid updating of response-reward information essential for successful reversal learning. We trained rats to discriminate between two levers, where lever availability was signaled by a non-discriminative cue. Pressing one lever was always rewarded, whereas the other lever was never rewarded. After reaching stable discrimination performance, a reversal was presented, so that the previously non-rewarded lever was now rewarded and vice versa. We used fast-scan cyclic voltammetry to monitor DA release in the ventromedial striatum. During discrimination performance (pre-reversal), cue presentation induced phasic DA release, whereas reward delivery did not. The opposite pattern was observed post-reversal: Striatal DA release emerged after reward delivery, while cue-induced release diminished. Trial-by-trial analysis showed rapid reinstatement of cue-induced DA release on trials immediately following initial correct responses. This effect of positive feedback was observed in animals that learned the reversal, but not in 'non-learners'. In contrast, neither pre-reversal responding and DA signaling, nor post-reversal DA signaling in response to negative feedback differed between learners and non-learners. Together, we show that phasic DA dynamics in the ventromedial striatum encoding reward-predicting cues are associated with positive feedback during reversal learning. Furthermore, these signals predict individual differences in learning that are not present prior to reversal, suggesting a distinct role for dopamine in the adaptation of previously learned behavior.
纹状体多巴胺(DA)对于基于奖励的学习至关重要。关于DA在适应先前学习行为以应对环境变化(如反应-奖励偶联的逆转)能力方面的贡献,我们所知较少。我们假设DA参与了成功逆转学习所必需的反应-奖励信息的快速更新。我们训练大鼠区分两个杠杆,其中杠杆可用性由非区分性线索表示。按压一个杠杆总是会得到奖励,而另一个杠杆则从未得到奖励。在达到稳定的辨别性能后,进行逆转,使先前未得到奖励的杠杆现在得到奖励,反之亦然。我们使用快速扫描循环伏安法监测腹内侧纹状体中的DA释放。在辨别性能期间(逆转前),线索呈现诱导了阶段性DA释放,而奖励发放则没有。逆转后观察到相反的模式:奖励发放后腹侧纹状体DA释放出现,而线索诱导的释放减少。逐次试验分析表明,在最初正确反应后的试验中,线索诱导的DA释放迅速恢复。在学会逆转的动物中观察到了这种正反馈效应,但在“未学习者”中没有观察到。相比之下,学习者和未学习者在逆转前的反应和DA信号传导,以及逆转后对负反馈的DA信号传导方面均无差异。我们共同表明,腹内侧纹状体中编码奖励预测线索的阶段性DA动态与逆转学习期间的正反馈相关。此外,这些信号预测了逆转前不存在的学习个体差异,表明多巴胺在先前学习行为的适应中具有独特作用。