Suppr超能文献

灵长类动物外侧前额叶皮层和尾状核神经元对正、负奖励预测误差的编码。

Encoding of both positive and negative reward prediction errors by neurons of the primate lateral prefrontal cortex and caudate nucleus.

机构信息

Rhodan Center for Nervous System Repair, Department of Neurosurgery, Massachusetts General Hospital, and Harvard Medical School, Boston, Massachusetts 02114, USA.

出版信息

J Neurosci. 2011 Dec 7;31(49):17772-87. doi: 10.1523/JNEUROSCI.3793-11.2011.

Abstract

Learning can be motivated by unanticipated success or unexpected failure. The former encourages us to repeat an action or activity, whereas the latter leads us to find an alternative strategy. Understanding the neural representation of these unexpected events is therefore critical to elucidate learning-related circuits. We examined the activity of neurons in the lateral prefrontal cortex (PFC) and caudate nucleus of monkeys as they performed a trial-and-error learning task. Unexpected outcomes were widely represented in both structures, and neurons driven by unexpectedly negative outcomes were as frequent as those activated by unexpectedly positive outcomes. Moreover, both positive and negative reward prediction errors (RPEs) were represented primarily by increases in firing rate, unlike the manner in which dopamine neurons have been observed to reflect these values. Interestingly, positive RPEs tended to appear with shorter latency than negative RPEs, perhaps reflecting the mechanism of their generation. Last, in the PFC but not the caudate, trial-by-trial variations in outcome-related activity were linked to the animals' subsequent behavioral decisions. More broadly, the robustness of RPE signaling by these neurons suggests that actor-critic models of reinforcement learning in which the PFC and particularly the caudate are considered primarily to be "actors" rather than "critics," should be reconsidered to include a prominent evaluative role for these structures.

摘要

学习可以受到意外成功或意外失败的激励。前者鼓励我们重复一个动作或活动,而后者则促使我们寻找替代策略。因此,理解这些意外事件的神经表示对于阐明与学习相关的回路至关重要。我们观察了猴子外侧前额叶皮层 (PFC) 和尾状核中神经元在进行试错学习任务时的活动。在这两个结构中,意外结果都得到了广泛的表示,并且由意外负面结果驱动的神经元与由意外正面结果驱动的神经元一样频繁。此外,无论是正的还是负的奖励预测误差 (RPE) 主要都表现为放电率的增加,这与观察到多巴胺神经元反映这些值的方式不同。有趣的是,正 RPE 似乎比负 RPE 出现的潜伏期更短,这可能反映了它们产生的机制。最后,在 PFC 中而不是尾状核中,与结果相关的活动在试验中的变化与动物随后的行为决策有关。更广泛地说,这些神经元的 RPE 信号的稳健性表明,强化学习的行为-评价模型应该重新考虑,其中 PFC 特别是尾状核被认为主要是“行为者”而不是“评价者”,以包括这些结构的突出评价作用。

相似文献

2
Action and outcome encoding in the primate caudate nucleus.灵长类动物尾状核中的动作与结果编码
J Neurosci. 2007 Dec 26;27(52):14502-14. doi: 10.1523/JNEUROSCI.3060-07.2007.

引用本文的文献

1
Basal ganglia activation localized in MEG using a reward task.使用奖励任务在脑磁图中定位基底神经节激活。
Neuroimage Rep. 2021 Jul 28;1(3):100034. doi: 10.1016/j.ynirp.2021.100034. eCollection 2021 Sep.
7
Beta Oscillations in Monkey Striatum Encode Reward Prediction Error Signals.猴子纹状体中的β振荡编码奖励预测误差信号。
J Neurosci. 2023 May 3;43(18):3339-3352. doi: 10.1523/JNEUROSCI.0952-22.2023. Epub 2023 Apr 4.
8
The Neurobase of ambiguity loss aversion about decision making.决策中模糊性损失厌恶的神经基础。
Front Psychol. 2023 Jan 26;14:1055640. doi: 10.3389/fpsyg.2023.1055640. eCollection 2023.
10
Minocycline differentially modulates human spatial memory systems.米诺环素对人类空间记忆系统有不同的调节作用。
Neuropsychopharmacology. 2020 Dec;45(13):2162-2169. doi: 10.1038/s41386-020-00811-8. Epub 2020 Aug 24.

本文引用的文献

2
Reward prediction error coding in dorsal striatal neurons.背侧纹状体神经元中的奖励预测误差编码。
J Neurosci. 2010 Aug 25;30(34):11447-57. doi: 10.1523/JNEUROSCI.1719-10.2010.
3
Role of striatum in updating values of chosen actions.纹状体在更新所选动作价值中的作用。
J Neurosci. 2009 Nov 25;29(47):14701-12. doi: 10.1523/JNEUROSCI.2728-09.2009.
8
Neuronal correlates of instrumental learning in the dorsal striatum.背侧纹状体中工具性学习的神经元关联
J Neurophysiol. 2009 Jul;102(1):475-89. doi: 10.1152/jn.00262.2009. Epub 2009 May 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验