Amiez C, Joseph J P, Procyk E
Department of Cognitive Neurosciences, Inserm U371 Cerveau et Vision, IFR19, UCB-Lyon1, 18 av. Doyen Lépine, 69500 Bron, France.
Cereb Cortex. 2006 Jul;16(7):1040-55. doi: 10.1093/cercor/bhj046. Epub 2005 Oct 5.
The anterior cingulate cortex (ACC) is known to play a crucial role in the fast adaptations of behavior based on immediate reward values. What is less certain is whether the ACC is also involved in long-term adaptations to situations with uncertain outcomes. To study this issue, we placed macaque monkeys in a probabilistic context in which the appropriate strategy to maximize reward was to identify the stimulus with the highest reward value (optimal stimulus). Only knowledge of the theoretical average reward value associated with this stimulus--referred to as 'the task value'--was available. Remarkably, in each trial, ACC pre-reward activity correlated with the task value. Importantly, this neuronal activity was observed prior to the discovery of the optimal stimulus. We hypothesize that the received rewards and the task value, constructed a priori through learning, are used to guide behavior and identify the optimal stimulus. We tested this hypothesis by muscimol deactivation of the ACC. As predicted, this inactivation impaired the search for the optimal stimulus. We propose that ACC participates in long-term adaptation of voluntary reward-based behaviors by encoding general task values and received rewards.
已知前扣带回皮质(ACC)在基于即时奖励价值的行为快速适应中起着关键作用。尚不确定的是,ACC是否也参与对结果不确定情况的长期适应。为了研究这个问题,我们将猕猴置于一种概率情境中,在这种情境下,最大化奖励的合适策略是识别具有最高奖励价值的刺激(最优刺激)。只有与该刺激相关的理论平均奖励价值——称为“任务价值”——是已知的。值得注意的是,在每次试验中,ACC奖励前的活动与任务价值相关。重要的是,这种神经元活动在发现最优刺激之前就被观察到了。我们假设,接收到的奖励和通过学习先验构建的任务价值被用于指导行为并识别最优刺激。我们通过用蝇蕈醇使ACC失活来检验这个假设。正如预测的那样,这种失活损害了对最优刺激的搜索。我们提出,ACC通过编码一般任务价值和接收到的奖励来参与基于自愿奖励行为的长期适应。