University of California Los Angeles, Department of Psychology, Los Angeles, CA 90095, USA.
Biol Psychiatry. 2012 May 15;71(10):846-54. doi: 10.1016/j.biopsych.2011.12.019. Epub 2012 Feb 2.
Sequential reward-seeking actions are readily learned despite the temporal gap between the earliest (distal) action in the sequence and the reward delivery. Fast dopamine signaling is hypothesized to mediate this form of learning by reporting errors in reward prediction. However, such a role for dopamine release in voluntarily initiated action sequences remains to be demonstrated.
Using fast-scan cyclic voltammetry, we monitored phasic mesolimbic dopamine release, in real time, as rats performed a self-initiated sequence of lever presses to earn sucrose rewards. Before testing, rats received either 0 (n = 11), 5 (n = 11), or 10 (n = 8) days of action sequence training.
For rats acquiring the action sequence task at test, dopamine release was strongly elicited by response-contingent (but unexpected) rewards. With learning, a significant elevation in dopamine release preceded performance of the proximal action and subsequently came to precede the distal action. This predistal dopamine release response was also observed in rats previously trained on the action sequence task, and the amplitude of this signal predicted the latency with which rats completed the action sequence. Importantly, the dopamine response to contingent reward delivery was not observed in rats given extensive pretraining. Pharmacological analysis confirmed that task performance was dopamine-dependent.
These data suggest that phasic mesolimbic dopamine release mediates the influence that rewards exert over the performance of self-paced, sequentially-organized behavior and sheds light on how dopamine signaling abnormalities may contribute to disorders of behavioral control.
尽管序列中最早的(远端)动作与奖励传递之间存在时间间隔,但顺序奖励寻求行为很容易被学习。快速多巴胺信号被假设通过报告奖励预测中的错误来介导这种形式的学习。然而,多巴胺释放在自愿发起的动作序列中的这种作用仍有待证明。
使用快速扫描循环伏安法,我们实时监测大鼠执行自我发起的杠杆按压序列以赚取蔗糖奖励时的相位中脑边缘多巴胺释放。在测试之前,大鼠接受 0(n = 11)、5(n = 11)或 10(n = 8)天的动作序列训练。
对于在测试中学习动作序列任务的大鼠,响应相关(但意外)的奖励强烈激发了多巴胺释放。随着学习的进行,近端动作之前会出现明显的多巴胺释放升高,随后会先于远端动作。在之前接受过动作序列任务训练的大鼠中也观察到这种预远端多巴胺释放反应,并且该信号的幅度预测了大鼠完成动作序列的潜伏期。重要的是,在给予广泛的预训练的大鼠中没有观察到对 contingent 奖励传递的多巴胺反应。药理学分析证实,任务表现是多巴胺依赖性的。
这些数据表明,相位中脑边缘多巴胺释放介导了奖励对自我调节、顺序组织行为的影响,并揭示了多巴胺信号异常如何导致行为控制障碍。