Bornhoft Kaisa N, Prohofsky Julianna, O'Neal Timothy J, Wolff Amy R, Saunders Benjamin T
bioRxiv. 2024 May 17:2024.05.17.594692. doi: 10.1101/2024.05.17.594692.
Adaptive decision making relies on dynamic updating of learned associations where environmental cues come to predict positive and negatively valenced stimuli, such as food or threat. Flexible cue-guided behaviors depend on a network of brain systems, including dopamine signaling in the striatum, which is critical for learning and maintenance of conditioned behaviors. Critically, it remains unclear how dopamine signaling encodes multi-valent, dynamic learning contexts, where positive and negative associations must be rapidly disambiguated. To understand this, we employed a Pavlovian discrimination paradigm, where cues predicting positive and negative outcomes were intermingled during conditioning sessions, and their meaning was serially reversed across training. We found that rats readily distinguished these cues, and updated their behavior rapidly upon valence reversal. Using fiber photometry, we recorded dopamine signaling in three major striatal subregions -,the dorsolateral striatum (DLS), the nucleus accumbens core, and the nucleus accumbens medial shell - and found heterogeneous responses to positive and negative conditioned cues and their predicted outcomes. Valence ambiguity introduced by cue reversal reshaped striatal dopamine on different timelines: nucleus accumbens core and shell signals updated more readily than those in the DLS. Together, these results suggest that striatal dopamine flexibly encodes multi-valent learning contexts, and these signals are dynamically modulated by changing contingencies to resolve ambiguity about the meaning of environmental cues.
适应性决策依赖于对习得关联的动态更新,其中环境线索开始预测具有正性和负性效价的刺激,如食物或威胁。灵活的线索引导行为依赖于一个脑系统网络,包括纹状体中的多巴胺信号传导,这对条件行为的学习和维持至关重要。关键的是,目前尚不清楚多巴胺信号如何编码多价的、动态的学习情境,在这种情境中,正性和负性关联必须迅速区分。为了理解这一点,我们采用了一种巴甫洛夫辨别范式,即在条件训练过程中,预测正性和负性结果的线索相互交织,并且它们的含义在训练过程中依次反转。我们发现大鼠能够轻松区分这些线索,并在效价反转时迅速更新它们的行为。使用光纤光度法,我们记录了三个主要纹状体亚区域——背外侧纹状体(DLS)、伏隔核核心和伏隔核内侧壳——中的多巴胺信号传导,发现对正性和负性条件线索及其预测结果存在异质性反应。线索反转引入的效价模糊性在不同时间尺度上重塑了纹状体多巴胺:伏隔核核心和壳信号比DLS中的信号更新更容易。总之,这些结果表明纹状体多巴胺灵活地编码多价学习情境,并且这些信号通过改变偶然性来动态调节,以解决关于环境线索含义的模糊性。