Suppr超能文献

解码学习过程中奖励预测的形成。

Decoding the formation of reward predictions across learning.

机构信息

Bernstein Center for Computational Neuroscience and Berlin Center for Advanced Neuroimaging, Charité-Universitätsmedizin Berlin, D-10115 Berlin, Germany.

出版信息

J Neurosci. 2011 Oct 12;31(41):14624-30. doi: 10.1523/JNEUROSCI.3412-11.2011.

Abstract

The predicted reward of different behavioral options plays an important role in guiding decisions. Previous research has identified reward predictions in prefrontal and striatal brain regions. Moreover, it has been shown that the neural representation of a predicted reward is similar to the neural representation of the actual reward outcome. However, it has remained unknown how these representations emerge over the course of learning and how they relate to decision making. Here, we sought to investigate learning of predicted reward representations using functional magnetic resonance imaging and multivariate pattern classification. Using a pavlovian conditioning procedure, human subjects learned multiple novel cue-outcome associations in each scanning run. We demonstrate that across learning activity patterns in the orbitofrontal cortex, the dorsolateral prefrontal cortex (DLPFC), and the dorsal striatum, coding the value of predicted rewards become similar to the patterns coding the value of actual reward outcomes. Furthermore, we provide evidence that predicted reward representations in the striatum precede those in prefrontal regions and that representations in the DLPFC are linked to subsequent value-based choices. Our results show that different brain regions represent outcome predictions by eliciting the neural representation of the actual outcome. Furthermore, they suggest that reward predictions in the DLPFC are directly related to value-based choices.

摘要

不同行为选择的预测奖励在指导决策中起着重要作用。先前的研究已经确定了前额叶和纹状体脑区的奖励预测。此外,已经表明,预测奖励的神经表示与实际奖励结果的神经表示相似。然而,这些表示如何在学习过程中出现以及它们与决策的关系仍然未知。在这里,我们试图使用功能磁共振成像和多元模式分类来研究预测奖励表示的学习。使用巴甫洛夫条件反射程序,人类受试者在每次扫描运行中学习多个新的线索-结果关联。我们证明,在眶额皮层、背外侧前额叶皮层 (DLPFC) 和背侧纹状体中,编码预测奖励价值的活动模式变得与编码实际奖励结果价值的模式相似。此外,我们提供的证据表明,纹状体中的预测奖励表示先于前额叶区域中的表示,并且 DLPFC 中的表示与随后的基于价值的选择有关。我们的结果表明,不同的大脑区域通过引发实际结果的神经表示来表示结果预测。此外,它们表明 DLPFC 中的奖励预测与基于价值的选择直接相关。

相似文献

1
Decoding the formation of reward predictions across learning.解码学习过程中奖励预测的形成。
J Neurosci. 2011 Oct 12;31(41):14624-30. doi: 10.1523/JNEUROSCI.3412-11.2011.
2

引用本文的文献

8
Computing Value from Quality and Quantity in Human Decision-Making.从人类决策中的质量和数量中计算价值。
J Neurosci. 2019 Jan 2;39(1):163-176. doi: 10.1523/JNEUROSCI.0706-18.2018. Epub 2018 Nov 19.
10

本文引用的文献

4
Category learning in the brain.大脑中的类别学习。
Annu Rev Neurosci. 2010;33:203-19. doi: 10.1146/annurev.neuro.051508.135546.
6
7
The neural code of reward anticipation in human orbitofrontal cortex.人类眶额皮质中奖励预期的神经编码。
Proc Natl Acad Sci U S A. 2010 Mar 30;107(13):6010-5. doi: 10.1073/pnas.0912838107. Epub 2010 Mar 15.
8
Adaptation of reward sensitivity in orbitofrontal neurons.眶额皮质神经元奖赏敏感性的适应性。
J Neurosci. 2010 Jan 13;30(2):534-44. doi: 10.1523/JNEUROSCI.4009-09.2010.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验