Quilodran René, Rothé Marie, Procyk Emmanuel
Inserm, U846, Stem Cell and Brain Research Institute, 69500 Bron, France.
Neuron. 2008 Jan 24;57(2):314-25. doi: 10.1016/j.neuron.2007.11.031.
Rapid optimization of behavior requires decisions about when to explore and when to exploit discovered resources. The mechanisms that lead to fast adaptations and their interaction with action valuation are a central issue. We show here that the anterior cingulate cortex (ACC) encodes multiple feedbacks devoted to exploration and its immediate termination. In a task that alternates exploration and exploitation periods, the ACC monitored negative and positive outcomes relevant for different adaptations. In particular, it produced signals specific of the first reward, i.e., the end of exploration. Those signals disappeared in exploitation periods but immediately transferred to the initiation of trials-a transfer comparable to learning phenomena observed for dopaminergic neurons. Importantly, these were also observed for high gamma oscillations of local field potentials shown to correlate with brain imaging signal. Thus, mechanisms of action valuation and monitoring of events/actions are combined for rapid behavioral regulation.
行为的快速优化需要决定何时进行探索以及何时利用已发现的资源。导致快速适应的机制及其与行动评估的相互作用是一个核心问题。我们在此表明,前扣带回皮质(ACC)编码了多个用于探索及其立即终止的反馈。在一个交替进行探索和利用阶段的任务中,ACC监测与不同适应相关的负面和正面结果。特别是,它产生了特定于首次奖励(即探索结束)的信号。这些信号在利用阶段消失,但立即转移到试验开始时——这种转移类似于在多巴胺能神经元中观察到的学习现象。重要的是,在与脑成像信号相关的局部场电位的高伽马振荡中也观察到了这些情况。因此,行动评估机制和对事件/行动的监测相结合,以实现快速的行为调节。