前额皮质与迭代竞争游戏中的混合学习。

The prefrontal cortex and hybrid learning during iterative competitive games.

机构信息

Laboratory of Neurobiology, The Rockefeller University, New York, New York, USA.

出版信息

Ann N Y Acad Sci. 2011 Dec;1239:100-8. doi: 10.1111/j.1749-6632.2011.06223.x.

DOI:10.1111/j.1749-6632.2011.06223.x

PMID:22145879

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3302724/

Abstract

Behavioral changes driven by reinforcement and punishment are referred to as simple or model-free reinforcement learning. Animals can also change their behaviors by observing events that are neither appetitive nor aversive when these events provide new information about payoffs available from alternative actions. This is an example of model-based reinforcement learning and can be accomplished by incorporating hypothetical reward signals into the value functions for specific actions. Recent neuroimaging and single-neuron recording studies showed that the prefrontal cortex and the striatum are involved not only in reinforcement and punishment, but also in model-based reinforcement learning. We found evidence for both types of learning, and hence hybrid learning, in monkeys during simulated competitive games. In addition, in both the dorsolateral prefrontal cortex and orbitofrontal cortex, individual neurons heterogeneously encoded signals related to actual and hypothetical outcomes from specific actions, suggesting that both areas might contribute to hybrid learning.

摘要

由强化和惩罚驱动的行为变化被称为简单或无模型的强化学习。当这些事件提供了来自替代行为的可获得回报的新信息时，动物也可以通过观察既不是令人愉快的也不是令人厌恶的事件来改变它们的行为。这是基于模型的强化学习的一个例子，可以通过将假设的奖励信号纳入特定动作的价值函数来实现。最近的神经影像学和单细胞记录研究表明，前额叶皮层和纹状体不仅参与强化和惩罚，还参与基于模型的强化学习。我们在模拟竞争游戏中发现猴子既有这两种类型的学习，也有混合学习的证据。此外，在背外侧前额叶皮层和眶额皮层中，单个神经元不均匀地编码与特定动作的实际和假设结果相关的信号，这表明这两个区域可能有助于混合学习。

相似文献

The prefrontal cortex and hybrid learning during iterative competitive games.

Ann N Y Acad Sci. 2011 Dec;1239:100-8. doi: 10.1111/j.1749-6632.2011.06223.x.

Representations of appetitive and aversive information in the primate orbitofrontal cortex.

Ann N Y Acad Sci. 2011 Dec;1239:59-70. doi: 10.1111/j.1749-6632.2011.06255.x.

Valuation of uncertain and delayed rewards in primate prefrontal cortex.

Neural Netw. 2009 Apr;22(3):294-304. doi: 10.1016/j.neunet.2009.03.010. Epub 2009 Mar 29.

Neural correlates of strategic reasoning during competitive games.

Science. 2014 Oct 17;346(6207):340-3. doi: 10.1126/science.1256254. Epub 2014 Sep 18.

Mechanisms of reinforcement learning and decision making in the primate dorsolateral prefrontal cortex.

Ann N Y Acad Sci. 2007 May;1104:108-22. doi: 10.1196/annals.1390.007. Epub 2007 Mar 8.

Prefrontal cortex and decision making in a mixed-strategy game.

Nat Neurosci. 2004 Apr;7(4):404-10. doi: 10.1038/nn1209. Epub 2004 Mar 7.

Distributed coding of actual and hypothetical outcomes in the orbital and dorsolateral prefrontal cortex.

Neuron. 2011 May 26;70(4):731-41. doi: 10.1016/j.neuron.2011.03.026.

Does the orbitofrontal cortex signal value?

Ann N Y Acad Sci. 2011 Dec;1239:87-99. doi: 10.1111/j.1749-6632.2011.06210.x.

Cortical mechanisms for reinforcement learning in competitive games.

Philos Trans R Soc Lond B Biol Sci. 2008 Dec 12;363(1511):3845-57. doi: 10.1098/rstb.2008.0158.

The convergence of information about rewarding and aversive stimuli in single neurons.

J Neurosci. 2009 Sep 16;29(37):11471-83. doi: 10.1523/JNEUROSCI.1815-09.2009.

引用本文的文献

Inference as a fundamental process in behavior.

Curr Opin Behav Sci. 2021 Apr;38:8-13. doi: 10.1016/j.cobeha.2020.06.005. Epub 2020 Jul 22.

Prefrontal Cortex Predicts State Switches during Reversal Learning.

Neuron. 2020 Jun 17;106(6):1044-1054.e4. doi: 10.1016/j.neuron.2020.03.024. Epub 2020 Apr 20.

Habits without values.

Psychol Rev. 2019 Mar;126(2):292-311. doi: 10.1037/rev0000120. Epub 2019 Jan 24.

The Anterior Insula Tracks Behavioral Entropy during an Interpersonal Competitive Game.

PLoS One. 2015 Jun 3;10(6):e0123329. doi: 10.1371/journal.pone.0123329. eCollection 2015.

The orbitofrontal oracle: cortical mechanisms for the prediction and evaluation of specific behavioral outcomes.

Neuron. 2014 Dec 17;84(6):1143-56. doi: 10.1016/j.neuron.2014.10.049.

Decision making: from neuroscience to psychiatry.

Neuron. 2013 Apr 24;78(2):233-48. doi: 10.1016/j.neuron.2013.04.008.

Insights from the application of computational neuroimaging to social neuroscience.

Curr Opin Neurobiol. 2013 Jun;23(3):387-92. doi: 10.1016/j.conb.2013.02.007. Epub 2013 Mar 18.

本文引用的文献

Role of rodent secondary motor cortex in value-based action selection.

Nat Neurosci. 2011 Aug 14;14(9):1202-8. doi: 10.1038/nn.2881.

Counterfactual choice and learning in a neural network centered on human lateral frontopolar cortex.

PLoS Biol. 2011 Jun;9(6):e1001093. doi: 10.1371/journal.pbio.1001093. Epub 2011 Jun 28.

Distributed coding of actual and hypothetical outcomes in the orbital and dorsolateral prefrontal cortex.

Neuron. 2011 May 26;70(4):731-41. doi: 10.1016/j.neuron.2011.03.026.

Neural correlates of forward planning in a spatial decision task in humans.

J Neurosci. 2011 Apr 6;31(14):5526-39. doi: 10.1523/JNEUROSCI.4647-10.2011.

Neurobiology of economic choice: a good-based model.

Annu Rev Neurosci. 2011;34:333-59. doi: 10.1146/annurev-neuro-061010-113648.

Model-based influences on humans' choices and striatal prediction errors.

Neuron. 2011 Mar 24;69(6):1204-15. doi: 10.1016/j.neuron.2011.02.027.

A reservoir of time constants for memory traces in cortical neurons.

Nat Neurosci. 2011 Mar;14(3):366-72. doi: 10.1038/nn.2752. Epub 2011 Feb 13.

Heterogeneous coding of temporally discounted values in the dorsal and ventral striatum during intertemporal choice.

Neuron. 2011 Jan 13;69(1):170-82. doi: 10.1016/j.neuron.2010.11.041.

States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.

Distinct roles of rodent orbitofrontal and medial prefrontal cortex in decision making.

Neuron. 2010 May 13;66(3):449-60. doi: 10.1016/j.neuron.2010.03.033.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

前额皮质与迭代竞争游戏中的混合学习。

The prefrontal cortex and hybrid learning during iterative competitive games.

机构信息

Laboratory of Neurobiology, The Rockefeller University, New York, New York, USA.

出版信息

Ann N Y Acad Sci. 2011 Dec;1239:100-8. doi: 10.1111/j.1749-6632.2011.06223.x.

DOI:10.1111/j.1749-6632.2011.06223.x

PMID:22145879

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3302724/

Abstract

摘要

前额皮质与迭代竞争游戏中的混合学习。

The prefrontal cortex and hybrid learning during iterative competitive games.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

前额皮质与迭代竞争游戏中的混合学习。

The prefrontal cortex and hybrid learning during iterative competitive games.

机构信息

出版信息