Department of Psychology,
Department of Psychology and.
J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.
Failures to obtain reward can occur from errors in action selection or action execution. Recently, we observed marked differences in choice behavior when the failure to obtain a reward was attributed to errors in action execution compared with errors in action selection (McDougle et al., 2016). Specifically, participants appeared to solve this credit assignment problem by discounting outcomes in which the absence of reward was attributed to errors in action execution. Building on recent evidence indicating relatively direct communication between the cerebellum and basal ganglia, we hypothesized that cerebellar-dependent sensory prediction errors (SPEs), a signal indicating execution failure, could attenuate value updating within a basal ganglia-dependent reinforcement learning system. Here we compared the SPE hypothesis to an alternative, "top-down" hypothesis in which changes in choice behavior reflect participants' sense of agency. In two experiments with male and female human participants, we manipulated the strength of SPEs, along with the participants' sense of agency in the second experiment. The results showed that, whereas the strength of SPE had no effect on choice behavior, participants were much more likely to discount the absence of rewards under conditions in which they believed the reward outcome depended on their ability to produce accurate movements. These results provide strong evidence that SPEs do not directly influence reinforcement learning. Instead, a participant's sense of agency appears to play a significant role in modulating choice behavior when unexpected outcomes can arise from errors in action execution. When learning from the outcome of actions, the brain faces a credit assignment problem: Failures of reward can be attributed to poor choice selection or poor action execution. Here, we test a specific hypothesis that execution errors are implicitly signaled by cerebellar-based sensory prediction errors. We evaluate this hypothesis and compare it with a more "top-down" hypothesis in which the modulation of choice behavior from execution errors reflects participants' sense of agency. We find that sensory prediction errors have no significant effect on reinforcement learning. Instead, instructions influencing participants' belief of causal outcomes appear to be the main factor influencing their choice behavior.
未能获得奖励可能源于动作选择或动作执行中的错误。最近,我们观察到在归因于动作执行错误而不是动作选择错误的情况下,选择行为存在明显差异(McDougle 等人,2016)。具体来说,参与者似乎通过对归因于动作执行错误而导致缺乏奖励的结果进行折扣来解决这种信用分配问题。基于最近的证据表明小脑和基底神经节之间存在相对直接的通信,我们假设小脑依赖的感觉预测误差(SPE),表示执行失败的信号,可以减弱基于基底神经节的强化学习系统中的价值更新。在这里,我们将 SPE 假设与替代的“自上而下”假设进行了比较,其中选择行为的变化反映了参与者的代理感。在两项有男性和女性人类参与者参与的实验中,我们操纵了 SPE 的强度以及参与者在第二项实验中的代理感。结果表明,尽管 SPE 的强度对选择行为没有影响,但当参与者认为奖励结果取决于他们产生准确动作的能力时,他们更有可能对缺乏奖励进行折扣。这些结果提供了强有力的证据表明 SPE 不会直接影响强化学习。相反,当意外的结果可能源于动作执行错误时,参与者的代理感似乎在调节选择行为方面起着重要作用。当从动作的结果中学习时,大脑面临着一个信用分配问题:奖励的失败可能归因于选择不佳或动作执行不佳。在这里,我们测试了一个特定的假设,即基于小脑的感觉预测误差隐含地发出了执行错误的信号。我们评估了这个假设,并将其与一个更“自上而下”的假设进行了比较,即从执行错误中调节选择行为反映了参与者的代理感。我们发现感觉预测误差对强化学习没有显著影响。相反,影响参与者对因果结果的信念的指令似乎是影响其选择行为的主要因素。