Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, 1105 BA Amsterdam, The Netherlands.
Department of Psychiatry, Amsterdam University Medical Centers, University of Amsterdam, 1105 AZ Amsterdam, The Netherlands.
J Neurosci. 2023 May 24;43(21):3922-3932. doi: 10.1523/JNEUROSCI.1527-22.2023. Epub 2023 Apr 25.
The mesolimbic dopamine system is implicated in signaling reward-related information as well as in actions that generate rewarding outcomes. These implications are commonly investigated in either pavlovian or operant reinforcement paradigms, where only the latter requires instrumental action. To parse contributions of reward- and action-related information to dopamine signals, we directly compared the two paradigms: male rats underwent either pavlovian or operant conditioning while dopamine release was measured in the nucleus accumbens, a brain region central for processing this information. Task conditions were identical with the exception of the operant-lever response requirement. Rats in both groups released the same quantity of dopamine at the onset of the reward-predictive cue. However, only the operant-conditioning group showed a subsequent, sustained plateau in dopamine concentration throughout the entire 5 s cue presentation (preceding the required action). This dopamine ramp was unaffected by probabilistic reward delivery, occurred exclusively before operant actions, and was not related to task performance or task acquisition as it persisted throughout the 2 week daily behavioral training. Instead, the ramp flexibly increased in duration with longer cue presentation, seemingly modulating the initial cue-onset-triggered dopamine release, that is, the reward prediction error (RPE) signal, as both signal amplitude and sustainment diminished when reward timing was made more predictable. Thus, our findings suggest that RPE and action components of dopamine release can be differentiated temporally into phasic and ramping/sustained signals, respectively, where the latter depends on the former and presumably reflects the anticipation or incentivization of appetitive action, conceptually akin to motivation. It is unclear whether the components of dopamine signals that are related to reward-associated information and reward-driven approach behavior can be separated. Most studies investigating the dopamine system use either pavlovian or operant conditioning, which both involve the delivery of reward and necessitate appetitive approach behavior. Thus, used exclusively, neither paradigm can disentangle the contributions of these components to dopamine release. However, by combining both paradigms in the same study, we find that anticipation of a reward-driven operant action induces a modulation of reward-prediction-associated dopamine release, producing so-called dopamine ramps. Therefore, our findings provide new insight into dopamine ramps and suggest that dopamine signals integrate reward and appetitive action in a temporally distinguishable, yet dependent, manner.
中脑边缘多巴胺系统参与信号传递与奖励相关的信息,以及产生奖励结果的行为。这些含义通常在巴甫洛夫或操作性强化范式中进行研究,后者仅需要工具性动作。为了解析奖励和动作相关信息对多巴胺信号的贡献,我们直接比较了这两种范式:雄性大鼠接受巴甫洛夫或操作性条件作用,同时测量伏隔核中的多巴胺释放,伏隔核是处理这种信息的大脑区域的中心。任务条件相同,除了操作性杠杆反应的要求。两组大鼠在奖励预测线索开始时释放相同数量的多巴胺。然而,只有操作性条件作用组在整个 5 秒线索呈现期间(在所需动作之前)显示出随后的持续多巴胺浓度平台。这种多巴胺斜坡不受概率奖励传递的影响,仅在操作性动作之前发生,并且与任务表现或任务获取无关,因为它在整个 2 周的日常行为训练期间持续存在。相反,斜坡随着线索呈现时间的延长而灵活地延长,似乎调节了初始线索开始触发的多巴胺释放,即奖励预测误差(RPE)信号,因为当奖励时间变得更可预测时,信号幅度和维持都会减小。因此,我们的发现表明,多巴胺释放的 RPE 和动作成分可以在时间上区分成相位和斜坡/持续信号,后者取决于前者,可能反映了对食欲动作的预期或激励,概念上类似于动机。尚不清楚与奖励相关信息和奖励驱动接近行为相关的多巴胺信号的成分是否可以分开。大多数研究多巴胺系统的研究使用巴甫洛夫或操作性条件作用,这两种方法都涉及奖励的传递,并需要食欲接近行为。因此,单独使用任何一种范式都不能将这些成分对多巴胺释放的贡献分开。然而,通过在同一研究中结合这两种范式,我们发现对驱动奖励的操作性动作的预期会引起与奖励预测相关的多巴胺释放的调制,产生所谓的多巴胺斜坡。因此,我们的发现为多巴胺斜坡提供了新的见解,并表明多巴胺信号以可区分但依赖的方式整合奖励和食欲动作。