Suppr超能文献

在一项运动决策任务中,信用分配受机构影响,而不受感官预测误差影响。

Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.

机构信息

Department of Psychology,

Department of Psychology and.

出版信息

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

Abstract

Failures to obtain reward can occur from errors in action selection or action execution. Recently, we observed marked differences in choice behavior when the failure to obtain a reward was attributed to errors in action execution compared with errors in action selection (McDougle et al., 2016). Specifically, participants appeared to solve this credit assignment problem by discounting outcomes in which the absence of reward was attributed to errors in action execution. Building on recent evidence indicating relatively direct communication between the cerebellum and basal ganglia, we hypothesized that cerebellar-dependent sensory prediction errors (SPEs), a signal indicating execution failure, could attenuate value updating within a basal ganglia-dependent reinforcement learning system. Here we compared the SPE hypothesis to an alternative, "top-down" hypothesis in which changes in choice behavior reflect participants' sense of agency. In two experiments with male and female human participants, we manipulated the strength of SPEs, along with the participants' sense of agency in the second experiment. The results showed that, whereas the strength of SPE had no effect on choice behavior, participants were much more likely to discount the absence of rewards under conditions in which they believed the reward outcome depended on their ability to produce accurate movements. These results provide strong evidence that SPEs do not directly influence reinforcement learning. Instead, a participant's sense of agency appears to play a significant role in modulating choice behavior when unexpected outcomes can arise from errors in action execution. When learning from the outcome of actions, the brain faces a credit assignment problem: Failures of reward can be attributed to poor choice selection or poor action execution. Here, we test a specific hypothesis that execution errors are implicitly signaled by cerebellar-based sensory prediction errors. We evaluate this hypothesis and compare it with a more "top-down" hypothesis in which the modulation of choice behavior from execution errors reflects participants' sense of agency. We find that sensory prediction errors have no significant effect on reinforcement learning. Instead, instructions influencing participants' belief of causal outcomes appear to be the main factor influencing their choice behavior.

摘要

未能获得奖励可能源于动作选择或动作执行中的错误。最近,我们观察到在归因于动作执行错误而不是动作选择错误的情况下,选择行为存在明显差异(McDougle 等人,2016)。具体来说,参与者似乎通过对归因于动作执行错误而导致缺乏奖励的结果进行折扣来解决这种信用分配问题。基于最近的证据表明小脑和基底神经节之间存在相对直接的通信,我们假设小脑依赖的感觉预测误差(SPE),表示执行失败的信号,可以减弱基于基底神经节的强化学习系统中的价值更新。在这里,我们将 SPE 假设与替代的“自上而下”假设进行了比较,其中选择行为的变化反映了参与者的代理感。在两项有男性和女性人类参与者参与的实验中,我们操纵了 SPE 的强度以及参与者在第二项实验中的代理感。结果表明,尽管 SPE 的强度对选择行为没有影响,但当参与者认为奖励结果取决于他们产生准确动作的能力时,他们更有可能对缺乏奖励进行折扣。这些结果提供了强有力的证据表明 SPE 不会直接影响强化学习。相反,当意外的结果可能源于动作执行错误时,参与者的代理感似乎在调节选择行为方面起着重要作用。当从动作的结果中学习时,大脑面临着一个信用分配问题:奖励的失败可能归因于选择不佳或动作执行不佳。在这里,我们测试了一个特定的假设,即基于小脑的感觉预测误差隐含地发出了执行错误的信号。我们评估了这个假设,并将其与一个更“自上而下”的假设进行了比较,即从执行错误中调节选择行为反映了参与者的代理感。我们发现感觉预测误差对强化学习没有显著影响。相反,影响参与者对因果结果的信念的指令似乎是影响其选择行为的主要因素。

相似文献

2
Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配
Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.
6
Try and try again: Post-error boost of an implicit measure of agency.不断尝试:错误后对能动性内隐测量的促进作用。
Q J Exp Psychol (Hove). 2018 Jul;71(7):1584-1595. doi: 10.1080/17470218.2017.1350871. Epub 2018 Jan 1.
8
Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。
J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.

引用本文的文献

3
Implicit Adaptation Is Modulated by the Relevance of Feedback.内隐适应受反馈相关性的调节。
J Cogn Neurosci. 2024 Jun 1;36(6):1206-1220. doi: 10.1162/jocn_a_02160.

本文引用的文献

2
Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。
J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.
5
Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配
Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.
6
Dopamine reward prediction error coding.多巴胺奖励预测误差编码。
Dialogues Clin Neurosci. 2016 Mar;18(1):23-32. doi: 10.31887/DCNS.2016.18.1/wschultz.
8
Short latency cerebellar modulation of the basal ganglia.短潜伏期小脑对基底神经节的调制。
Nat Neurosci. 2014 Dec;17(12):1767-75. doi: 10.1038/nn.3868. Epub 2014 Nov 17.
9
Using Bayes to get the most out of non-significant results.贝叶斯推断在不显著结果中的应用。
Front Psychol. 2014 Jul 29;5:781. doi: 10.3389/fpsyg.2014.00781. eCollection 2014.
10
Parabolic discounting of monetary rewards by physical effort.体力付出对货币奖励的抛物线式折扣。
Behav Processes. 2013 Nov;100:192-6. doi: 10.1016/j.beproc.2013.09.014. Epub 2013 Oct 15.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验