Ellwood Ian T, Patel Tosha, Wadia Varun, Lee Anthony T, Liptak Alayna T, Bender Kevin J, Sohal Vikaas S
Department of Psychiatry.
Weill Institute for Neurosciences.
J Neurosci. 2017 Aug 30;37(35):8315-8329. doi: 10.1523/JNEUROSCI.1221-17.2017. Epub 2017 Jul 24.
Dopamine neurons in the ventral tegmental area (VTA) encode reward prediction errors and can drive reinforcement learning through their projections to striatum, but much less is known about their projections to prefrontal cortex (PFC). Here, we studied these projections and observed phasic VTA-PFC fiber photometry signals after the delivery of rewards. Next, we studied how optogenetic stimulation of these projections affects behavior using conditioned place preference and a task in which mice learn associations between cues and food rewards and then use those associations to make choices. Neither phasic nor tonic stimulation of dopaminergic VTA-PFC projections elicited place preference. Furthermore, substituting phasic VTA-PFC stimulation for food rewards was not sufficient to reinforce new cue-reward associations nor maintain previously learned ones. However, the same patterns of stimulation that failed to reinforce place preference or cue-reward associations were able to modify behavior in other ways. First, continuous tonic stimulation maintained previously learned cue-reward associations even after they ceased being valid. Second, delivering phasic stimulation either continuously or after choices not previously associated with reward induced mice to make choices that deviated from previously learned associations. In summary, despite the fact that dopaminergic VTA-PFC projections exhibit phasic increases in activity that are time locked to the delivery of rewards, phasic activation of these projections does not necessarily reinforce specific actions. Rather, dopaminergic VTA-PFC activity can control whether mice maintain or deviate from previously learned cue-reward associations. Dopaminergic inputs from ventral tegmental area (VTA) to striatum encode reward prediction errors and reinforce specific actions; however, it is currently unknown whether dopaminergic inputs to prefrontal cortex (PFC) play similar or distinct roles. Here, we used bulk Ca imaging to show that unexpected rewards or reward-predicting cues elicit phasic increases in the activity of dopaminergic VTA-PFC fibers. However, in multiple behavioral paradigms, we failed to observe reinforcing effects after stimulation of these fibers. In these same experiments, we did find that tonic or phasic patterns of stimulation caused mice to maintain or deviate from previously learned cue-reward associations, respectively. Therefore, although they may exhibit similar patterns of activity, dopaminergic inputs to striatum and PFC can elicit divergent behavioral effects.
腹侧被盖区(VTA)的多巴胺能神经元编码奖励预测误差,并可通过其向纹状体的投射驱动强化学习,但对其向前额叶皮质(PFC)的投射了解较少。在此,我们研究了这些投射,并观察到奖励发放后VTA-PFC纤维光度信号的相位变化。接下来,我们使用条件性位置偏好以及一项任务来研究对这些投射的光遗传学刺激如何影响行为,在该任务中,小鼠学习线索与食物奖励之间的关联,然后利用这些关联做出选择。对多巴胺能VTA-PFC投射的相位或持续性刺激均未引发位置偏好。此外,用VTA-PFC的相位刺激替代食物奖励不足以强化新的线索-奖励关联,也无法维持先前习得的关联。然而,同样未能强化位置偏好或线索-奖励关联的刺激模式能够以其他方式改变行为。首先,持续的持续性刺激即使在先前习得的线索-奖励关联不再有效后仍能维持它们。其次,持续或在先前与奖励无关的选择后给予相位刺激会诱导小鼠做出偏离先前习得关联的选择。总之,尽管多巴胺能VTA-PFC投射的活动在奖励发放时呈现出相位性增加,但这些投射的相位激活不一定会强化特定行为。相反,多巴胺能VTA-PFC活动可以控制小鼠是维持还是偏离先前习得的线索-奖励关联。从腹侧被盖区(VTA)到纹状体的多巴胺能输入编码奖励预测误差并强化特定行为;然而,目前尚不清楚多巴胺能输入到前额叶皮质(PFC)是否发挥类似或不同的作用。在此,我们使用群体钙成像显示,意外奖励或奖励预测线索会引发多巴胺能VTA-PFC纤维活动的相位性增加。然而,在多个行为范式中,我们在刺激这些纤维后未观察到强化作用。在这些相同的实验中,我们确实发现持续性或相位性刺激模式分别导致小鼠维持或偏离先前习得的线索-奖励关联。因此,尽管它们可能表现出相似的活动模式,但多巴胺能输入到纹状体和PFC可引发不同的行为效应。