Valentin Vivian V, Dickinson Anthony, O'Doherty John P
Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, California 91125, USA.
J Neurosci. 2007 Apr 11;27(15):4019-26. doi: 10.1523/JNEUROSCI.0564-07.2007.
Instrumental conditioning is considered to involve at least two distinct learning systems: a goal-directed system that learns associations between responses and the incentive value of outcomes, and a habit system that learns associations between stimuli and responses without any link to the outcome that that response engendered. Lesion studies in rodents suggest that these two distinct components of instrumental conditioning may be mediated by anatomically distinct neural systems. The aim of the present study was to determine the neural substrates of the goal-directed component of instrumental learning in humans. Nineteen human subjects were scanned with functional magnetic resonance imaging while they learned to choose instrumental actions that were associated with the subsequent delivery of different food rewards (tomato juice, chocolate milk, and orange juice). After training, one of these foods was devalued by feeding the subject to satiety on that food. The subjects were then scanned again, while being re-exposed to the instrumental choice procedure (in extinction). We hypothesized that regions of the brain involved in goal-directed learning would show changes in their activity as a function of outcome devaluation. Our results indicate that neural activity in one brain region in particular, the orbitofrontal cortex, showed a strong modulation in its activity during selection of a devalued compared with a nondevalued action. These results suggest an important contribution of orbitofrontal cortex in guiding goal-directed instrumental choices in humans.
一个是目标导向系统,它学习反应与结果的激励价值之间的关联;另一个是习惯系统,它学习刺激与反应之间的关联,而与该反应所产生的结果没有任何联系。对啮齿动物的损伤研究表明,工具性条件作用的这两个不同组成部分可能由解剖学上不同的神经系统介导。本研究的目的是确定人类工具性学习中目标导向成分的神经基础。19名人类受试者在学习选择与随后提供不同食物奖励(番茄汁、巧克力牛奶和橙汁)相关的工具性动作时,接受了功能磁共振成像扫描。训练后,通过让受试者吃饱其中一种食物来降低其价值。然后,在受试者再次接触工具性选择程序(消退阶段)时,再次对他们进行扫描。我们假设,参与目标导向学习的大脑区域会随着结果贬值而表现出活动变化。我们的结果表明,特别是一个大脑区域——眶额皮质,在选择贬值动作与未贬值动作时,其活动表现出强烈的调节作用。这些结果表明眶额皮质在指导人类目标导向的工具性选择中具有重要作用。