在一项运动决策任务中，信用分配受机构影响，而不受感官预测误差影响。

Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.

机构信息

Department of Psychology,

Department of Psychology and.

出版信息

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

DOI:10.1523/JNEUROSCI.3601-17.2018

PMID:29650698

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5943979/

Abstract

Failures to obtain reward can occur from errors in action selection or action execution. Recently, we observed marked differences in choice behavior when the failure to obtain a reward was attributed to errors in action execution compared with errors in action selection (McDougle et al., 2016). Specifically, participants appeared to solve this credit assignment problem by discounting outcomes in which the absence of reward was attributed to errors in action execution. Building on recent evidence indicating relatively direct communication between the cerebellum and basal ganglia, we hypothesized that cerebellar-dependent sensory prediction errors (SPEs), a signal indicating execution failure, could attenuate value updating within a basal ganglia-dependent reinforcement learning system. Here we compared the SPE hypothesis to an alternative, "top-down" hypothesis in which changes in choice behavior reflect participants' sense of agency. In two experiments with male and female human participants, we manipulated the strength of SPEs, along with the participants' sense of agency in the second experiment. The results showed that, whereas the strength of SPE had no effect on choice behavior, participants were much more likely to discount the absence of rewards under conditions in which they believed the reward outcome depended on their ability to produce accurate movements. These results provide strong evidence that SPEs do not directly influence reinforcement learning. Instead, a participant's sense of agency appears to play a significant role in modulating choice behavior when unexpected outcomes can arise from errors in action execution. When learning from the outcome of actions, the brain faces a credit assignment problem: Failures of reward can be attributed to poor choice selection or poor action execution. Here, we test a specific hypothesis that execution errors are implicitly signaled by cerebellar-based sensory prediction errors. We evaluate this hypothesis and compare it with a more "top-down" hypothesis in which the modulation of choice behavior from execution errors reflects participants' sense of agency. We find that sensory prediction errors have no significant effect on reinforcement learning. Instead, instructions influencing participants' belief of causal outcomes appear to be the main factor influencing their choice behavior.

摘要

未能获得奖励可能源于动作选择或动作执行中的错误。最近，我们观察到在归因于动作执行错误而不是动作选择错误的情况下，选择行为存在明显差异（McDougle 等人，2016）。具体来说，参与者似乎通过对归因于动作执行错误而导致缺乏奖励的结果进行折扣来解决这种信用分配问题。基于最近的证据表明小脑和基底神经节之间存在相对直接的通信，我们假设小脑依赖的感觉预测误差（SPE），表示执行失败的信号，可以减弱基于基底神经节的强化学习系统中的价值更新。在这里，我们将 SPE 假设与替代的“自上而下”假设进行了比较，其中选择行为的变化反映了参与者的代理感。在两项有男性和女性人类参与者参与的实验中，我们操纵了 SPE 的强度以及参与者在第二项实验中的代理感。结果表明，尽管 SPE 的强度对选择行为没有影响，但当参与者认为奖励结果取决于他们产生准确动作的能力时，他们更有可能对缺乏奖励进行折扣。这些结果提供了强有力的证据表明 SPE 不会直接影响强化学习。相反，当意外的结果可能源于动作执行错误时，参与者的代理感似乎在调节选择行为方面起着重要作用。当从动作的结果中学习时，大脑面临着一个信用分配问题：奖励的失败可能归因于选择不佳或动作执行不佳。在这里，我们测试了一个特定的假设，即基于小脑的感觉预测误差隐含地发出了执行错误的信号。我们评估了这个假设，并将其与一个更“自上而下”的假设进行了比较，即从执行错误中调节选择行为反映了参与者的代理感。我们发现感觉预测误差对强化学习没有显著影响。相反，影响参与者对因果结果的信念的指令似乎是影响其选择行为的主要因素。

相似文献

Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中，信用分配受机构影响，而不受感官预测误差影响。

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配

Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.

Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.在一项决策任务中，预测错误的神经特征受动作执行失败的调节。

Curr Biol. 2019 May 20;29(10):1606-1613.e5. doi: 10.1016/j.cub.2019.04.011. Epub 2019 May 2.

The contribution of striatal pseudo-reward prediction errors to value-based decision-making.纹状体假性奖赏预测误差对基于价值的决策的贡献。

Neuroimage. 2019 Jun;193:67-74. doi: 10.1016/j.neuroimage.2019.02.052. Epub 2019 Mar 7.

Navigating complex decision spaces: Problems and paradigms in sequential choice.导航复杂决策空间：序列选择中的问题和范式。

Psychol Bull. 2014 Mar;140(2):466-86. doi: 10.1037/a0033455. Epub 2013 Jul 8.

Try and try again: Post-error boost of an implicit measure of agency.不断尝试：错误后对能动性内隐测量的促进作用。

Q J Exp Psychol (Hove). 2018 Jul;71(7):1584-1595. doi: 10.1080/17470218.2017.1350871. Epub 2018 Jan 1.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策：强化学习预测错误在人类中的快速传播。

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。

J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.

[Decision-making and learning by cortico-basal ganglia network].[皮质-基底神经节网络的决策与学习]

Brain Nerve. 2008 Jul;60(7):799-813.

Functional disconnection of the orbitofrontal cortex and basolateral amygdala impairs acquisition of a rat gambling task and disrupts animals' ability to alter decision-making behavior after reinforcer devaluation.眶额皮层和基底外侧杏仁核的功能分离会损害大鼠赌博任务的获得，并破坏动物在强化物贬值后改变决策行为的能力。

J Neurosci. 2013 Apr 10;33(15):6434-43. doi: 10.1523/JNEUROSCI.3971-12.2013.

引用本文的文献

Implicit sensorimotor learning in ballistic movement for transporting an object to a target.在将物体运送到目标的弹丸运动中进行内隐感觉运动学习。

Sci Rep. 2024 Sep 9;14(1):21003. doi: 10.1038/s41598-024-71925-y.

Fundamental processes in sensorimotor learning: Reasoning, refinement, and retrieval.感觉运动学习的基本过程：推理、优化与检索。

Elife. 2024 Aug 1;13:e91839. doi: 10.7554/eLife.91839.

Implicit Adaptation Is Modulated by the Relevance of Feedback.内隐适应受反馈相关性的调节。

J Cogn Neurosci. 2024 Jun 1;36(6):1206-1220. doi: 10.1162/jocn_a_02160.

Memory, perceptual, and motor costs affect the strength of categorical encoding during motor learning of object properties.记忆、知觉和运动成本会影响物体属性运动学习过程中类别编码的强度。

Sci Rep. 2023 May 27;13(1):8619. doi: 10.1038/s41598-023-33515-2.

Decision heuristics in contexts integrating action selection and execution.在整合行动选择和执行的情境中进行决策启发式。

Sci Rep. 2023 Apr 20;13(1):6486. doi: 10.1038/s41598-023-33008-2.

Sensorimotor feedback loops are selectively sensitive to reward.感觉运动反馈回路对奖励具有选择性敏感性。

Elife. 2023 Jan 13;12:e81325. doi: 10.7554/eLife.81325.

Motor Plans under Uncertainty Reflect a Trade-Off between Maximizing Reward and Success.在不确定性下的运动计划反映了在最大化奖励和成功之间的权衡。

eNeuro. 2022 Apr 12;9(2). doi: 10.1523/ENEURO.0503-21.2022. Print 2022 Mar-Apr.

Decision neuroscience and neuroeconomics: Recent progress and ongoing challenges.决策神经科学与神经经济学：近期进展与现存挑战。

Wiley Interdiscip Rev Cogn Sci. 2022 May;13(3):e1589. doi: 10.1002/wcs.1589. Epub 2022 Feb 8.

Distinct Neural Signatures of Outcome Monitoring After Selection and Execution Errors.选择和执行错误后，结果监控的神经特征明显不同。

J Cogn Neurosci. 2022 Mar 31;34(5):748-765. doi: 10.1162/jocn_a_01824.

Modulation of neural activity in frontopolar cortex drives reward-based motor learning.前额叶皮层神经活动的调节驱动基于奖励的运动学习。

Sci Rep. 2021 Oct 13;11(1):20303. doi: 10.1038/s41598-021-98571-y.

本文引用的文献

Invariant errors reveal limitations in motor correction rather than constraints on error sensitivity.不变误差揭示了运动校正中的局限性，而非对误差敏感性的限制。

Commun Biol. 2018 Mar 22;1:19. doi: 10.1038/s42003-018-0021-y. eCollection 2018.

Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。

J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.

Feedback delay attenuates implicit but facilitates explicit adjustments to a visuomotor rotation.反馈延迟会减弱对视觉运动旋转的内隐调整，但会促进外显调整。

Neurobiol Learn Mem. 2017 Apr;140:124-133. doi: 10.1016/j.nlm.2017.02.015. Epub 2017 Feb 28.

Characteristics of Implicit Sensorimotor Adaptation Revealed by Task-irrelevant Clamped Feedback.通过任务无关的钳制反馈揭示的内隐感觉运动适应特征

J Cogn Neurosci. 2017 Jun;29(6):1061-1074. doi: 10.1162/jocn_a_01108. Epub 2017 Feb 14.

Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配

Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.

Dopamine reward prediction error coding.多巴胺奖励预测误差编码。

Dialogues Clin Neurosci. 2016 Mar;18(1):23-32. doi: 10.31887/DCNS.2016.18.1/wschultz.

Delayed feedback during sensorimotor learning selectively disrupts adaptation but not strategy use.感觉运动学习过程中的延迟反馈会选择性地干扰适应，但不会干扰策略的使用。

J Neurophysiol. 2016 Mar;115(3):1499-511. doi: 10.1152/jn.00066.2015. Epub 2016 Jan 20.

Short latency cerebellar modulation of the basal ganglia.短潜伏期小脑对基底神经节的调制。

Nat Neurosci. 2014 Dec;17(12):1767-75. doi: 10.1038/nn.3868. Epub 2014 Nov 17.

Using Bayes to get the most out of non-significant results.贝叶斯推断在不显著结果中的应用。

Front Psychol. 2014 Jul 29;5:781. doi: 10.3389/fpsyg.2014.00781. eCollection 2014.

Parabolic discounting of monetary rewards by physical effort.体力付出对货币奖励的抛物线式折扣。

Behav Processes. 2013 Nov;100:192-6. doi: 10.1016/j.beproc.2013.09.014. Epub 2013 Oct 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验