Holroyd Clay B, Krigolson Olave E, Baker Robert, Lee Seung, Gibson Jessica
University of Victoria, Victoria, British Columbia, Canada.
Cogn Affect Behav Neurosci. 2009 Mar;9(1):59-70. doi: 10.3758/CABN.9.1.59.
A recent theory holds that the anterior cingulate cortex (ACC) uses reinforcement learning signals conveyed by the midbrain dopamine system to facilitate flexible action selection. According to this position, the impact of reward prediction error signals on ACC modulates the amplitude of a component of the event-related brain potential called the error-related negativity (ERN). The theory predicts that ERN amplitude is monotonically related to the expectedness of the event: It is larger for unexpected outcomes than for expected outcomes. However, a recent failure to confirm this prediction has called the theory into question. In the present article, we investigated this discrepancy in three trial-and-error learning experiments. All three experiments provided support for the theory, but the effect sizes were largest when an optimal response strategy could actually be learned. This observation suggests that ACC utilizes dopamine reward prediction error signals for adaptive decision making when the optimal behavior is, in fact, learnable.
最近的一种理论认为,前扣带回皮质(ACC)利用中脑多巴胺系统传递的强化学习信号来促进灵活的动作选择。根据这一观点,奖励预测误差信号对ACC的影响会调节一种称为错误相关负波(ERN)的事件相关脑电位成分的幅度。该理论预测,ERN幅度与事件的预期性呈单调相关:意外结果的ERN幅度大于预期结果的ERN幅度。然而,最近未能证实这一预测,使该理论受到质疑。在本文中,我们在三个试错学习实验中研究了这一差异。所有三个实验都为该理论提供了支持,但当实际能够学习到最优反应策略时,效应量最大。这一观察结果表明,当最优行为实际上是可学习的时,ACC利用多巴胺奖励预测误差信号进行适应性决策。