前额皮质与迭代竞争游戏中的混合学习。

The prefrontal cortex and hybrid learning during iterative competitive games.

机构信息

Laboratory of Neurobiology, The Rockefeller University, New York, New York, USA.

出版信息

Ann N Y Acad Sci. 2011 Dec;1239:100-8. doi: 10.1111/j.1749-6632.2011.06223.x.

Abstract

Behavioral changes driven by reinforcement and punishment are referred to as simple or model-free reinforcement learning. Animals can also change their behaviors by observing events that are neither appetitive nor aversive when these events provide new information about payoffs available from alternative actions. This is an example of model-based reinforcement learning and can be accomplished by incorporating hypothetical reward signals into the value functions for specific actions. Recent neuroimaging and single-neuron recording studies showed that the prefrontal cortex and the striatum are involved not only in reinforcement and punishment, but also in model-based reinforcement learning. We found evidence for both types of learning, and hence hybrid learning, in monkeys during simulated competitive games. In addition, in both the dorsolateral prefrontal cortex and orbitofrontal cortex, individual neurons heterogeneously encoded signals related to actual and hypothetical outcomes from specific actions, suggesting that both areas might contribute to hybrid learning.

摘要

由强化和惩罚驱动的行为变化被称为简单或无模型的强化学习。当这些事件提供了来自替代行为的可获得回报的新信息时,动物也可以通过观察既不是令人愉快的也不是令人厌恶的事件来改变它们的行为。这是基于模型的强化学习的一个例子,可以通过将假设的奖励信号纳入特定动作的价值函数来实现。最近的神经影像学和单细胞记录研究表明,前额叶皮层和纹状体不仅参与强化和惩罚,还参与基于模型的强化学习。我们在模拟竞争游戏中发现猴子既有这两种类型的学习,也有混合学习的证据。此外,在背外侧前额叶皮层和眶额皮层中,单个神经元不均匀地编码与特定动作的实际和假设结果相关的信号,这表明这两个区域可能有助于混合学习。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索