Otto-von-Guericke University, Institute of Psychology, D-39106, Magdeburg, Germany.
Center for Behavioral Brain Sciences, D-39106, Magdeburg, Germany.
Nat Commun. 2017 Nov 22;8(1):1690. doi: 10.1038/s41467-017-01703-0.
Optimal decision-making employs short-term rewards and abstract long-term information based on which of these is deemed relevant. Employing short- vs. long-term information is associated with different learning mechanisms, yet neural evidence showing that these two are dissociable is lacking. Here we demonstrate that long-term, inference-based beliefs are biased by short-term reward experiences and that dissociable brain regions facilitate both types of learning. Long-term inferences are associated with dorsal striatal and frontopolar cortex activity, while short-term rewards engage the ventral striatum. Stronger concurrent representation of reward signals by mediodorsal striatum and frontopolar cortex correlates with less biased, more optimal individual long-term inference. Moreover, dynamic modulation of activity in a cortical cognitive control network and the medial striatum is associated with trial-by-trial control of biases in belief updating. This suggests that counteracting the processing of optimally to-be-ignored short-term rewards and cortical suppression of associated reward-signals, determines long-term learning success and failure.
最优决策基于相关的短期奖励和抽象的长期信息进行短期和长期信息的运用与不同的学习机制相关联,但缺乏表明这两者可分离的神经证据。在这里,我们证明了长期的基于推理的信念会受到短期奖励经验的影响,并且可分离的大脑区域促进了这两种学习。长期推理与背侧纹状体和额极皮层活动有关,而短期奖励则涉及腹侧纹状体。中脑背侧纹状体和额极皮层对奖励信号的同时表示越强,个体的长期推理就越不受偏见影响,也越优化。此外,皮质认知控制网络和内侧纹状体的活动的动态调节与信念更新中的偏差的逐个试次控制相关。这表明,对抗处理要被忽略的最佳短期奖励的处理以及对相关奖励信号的皮质抑制,决定了长期学习的成功和失败。