Lefner Merridee J, Moghaddam Bita
Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, OR.
bioRxiv. 2024 Oct 7:2024.10.07.617060. doi: 10.1101/2024.10.07.617060.
In dynamic environments where stimuli predicting rewarding or aversive outcomes unexpectedly change, it is critical to flexibly update behavior while preserving recollection of previous associations. Dopamine and GABA neurons in the ventral tegmental area (VTA) are implicated in reward and punishment learning, yet little is known about how each population adapts when the predicted outcome valence changes. We measured VTA dopamine and GABA population activity while male and female rats learned to associate three discrete auditory cues to three distinct outcomes: reward, punishment, or no outcome within the same session. After learning, the reward and punishment cue-outcome contingencies were reversed, and subsequently re-reversed. As expected, the dopamine population rapidly adapted to learning and contingency reversals by increasing the response to appetitive stimuli and decreasing the response to aversive stimuli. In contrast, the GABA population increased activity to all sensory events regardless of valence, including the neutral cue. Reversing learned contingencies selectively influenced GABA responses to the reward-predictive cue, prolonging increased activity within and across sessions. The observed valence-specific dissociations in the directionality and temporal progression of VTA dopamine and GABA calcium activity indicates that these populations are independently recruited and serve distinct roles during appetitive and aversive associative learning and contingency reversal.
在动态环境中,预测奖励或厌恶结果的刺激会意外变化,此时在保留对先前关联记忆的同时灵活更新行为至关重要。腹侧被盖区(VTA)中的多巴胺能神经元和γ-氨基丁酸能神经元与奖赏和惩罚学习有关,但对于每种神经元群体在预测结果效价变化时如何适应却知之甚少。我们在雄性和雌性大鼠学习将三种不同的听觉线索与三种不同结果(奖赏、惩罚或同一会话内无结果)建立关联时,测量了VTA多巴胺能神经元群体和γ-氨基丁酸能神经元群体的活动。学习后,奖赏和惩罚线索与结果的偶联关系被颠倒,随后又恢复原状。正如预期的那样,多巴胺能神经元群体通过增加对奖赏性刺激的反应并减少对厌恶刺激的反应,迅速适应学习和偶联关系的颠倒。相比之下,γ-氨基丁酸能神经元群体对所有感觉事件(无论效价如何,包括中性线索)的活动都增加。颠倒已学习的偶联关系选择性地影响了γ-氨基丁酸能神经元对奖赏预测线索的反应,在同一会话内和不同会话之间延长了增加的活动。在VTA多巴胺能神经元和γ-氨基丁酸能神经元钙活动的方向性和时间进程中观察到的效价特异性分离表明,这些神经元群体在奖赏性和厌恶性交联学习以及偶联关系颠倒过程中是独立被募集的,并且发挥着不同的作用。