在一项决策任务中，预测错误的神经特征受动作执行失败的调节。

Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.

机构信息

Department of Psychology, University of California, Berkeley, 2121 Berkeley Way, Berkeley, CA 94704, USA.

Department of Psychology, Princeton University, South Drive, Princeton, NJ 08540, USA.

出版信息

Curr Biol. 2019 May 20;29(10):1606-1613.e5. doi: 10.1016/j.cub.2019.04.011. Epub 2019 May 2.

DOI:10.1016/j.cub.2019.04.011

PMID:31056386

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6535105/

Abstract

Decisions must be implemented through actions, and actions are prone to error. As such, when an expected outcome is not obtained, an individual should be sensitive to not only whether the choice itself was suboptimal but also whether the action required to indicate that choice was executed successfully. The intelligent assignment of credit to action execution versus action selection has clear ecological utility for the learner. To explore this, we used a modified version of a classic reinforcement learning task in which feedback indicated whether negative prediction errors were, or were not, associated with execution errors. Using fMRI, we asked if prediction error computations in the human striatum, a key substrate in reinforcement learning and decision making, are modulated when a failure in action execution results in the negative outcome. Participants were more tolerant of non-rewarded outcomes when these resulted from execution errors versus when execution was successful, but reward was withheld. Consistent with this behavior, a model-driven analysis of neural activity revealed an attenuation of the signal associated with negative reward prediction errors in the striatum following execution failures. These results converge with other lines of evidence suggesting that prediction errors in the mesostriatal dopamine system integrate high-level information during the evaluation of instantaneous reward outcomes.

摘要

决策必须通过行动来实施，而行动容易出错。因此，当没有得到预期的结果时，个体不仅应该敏感地意识到选择本身是否不够理想，还应该意识到指示该选择所需的行动是否成功执行。将信用分配给行动执行与行动选择对于学习者具有明显的生态效用。为了探索这一点，我们使用了经典强化学习任务的修改版本，其中反馈表明负预测误差是否与执行错误相关。使用 fMRI，我们询问在人类纹状体（强化学习和决策的关键基质）中的预测误差计算是否在执行失败导致负面结果时受到调制。当非奖励结果是由于执行错误而不是执行成功但奖励被拒绝时，参与者对其的容忍度更高。与该行为一致，对神经活动的模型驱动分析表明，在执行失败后，纹状体中与负奖励预测误差相关的信号减弱。这些结果与其他表明中脑多巴胺系统中的预测误差在评估即时奖励结果时整合高层信息的证据一致。

相似文献

Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.在一项决策任务中，预测错误的神经特征受动作执行失败的调节。

Curr Biol. 2019 May 20;29(10):1606-1613.e5. doi: 10.1016/j.cub.2019.04.011. Epub 2019 May 2.

Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中，信用分配受机构影响，而不受感官预测误差影响。

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配

Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策：强化学习预测错误在人类中的快速传播。

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.在基于奖励的决策过程中，人类纹状体中的强化学习信号可区分学习者和非学习者。

J Neurosci. 2007 Nov 21;27(47):12860-7. doi: 10.1523/JNEUROSCI.2496-07.2007.

The contribution of striatal pseudo-reward prediction errors to value-based decision-making.纹状体假性奖赏预测误差对基于价值的决策的贡献。

Neuroimage. 2019 Jun;193:67-74. doi: 10.1016/j.neuroimage.2019.02.052. Epub 2019 Mar 7.

Signals in human striatum are appropriate for policy update rather than value prediction.人类纹状体中的信号适合用于策略更新，而不是价值预测。

J Neurosci. 2011 Apr 6;31(14):5504-11. doi: 10.1523/JNEUROSCI.6316-10.2011.

Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。

J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.

Neuronal basis for evaluating selected action in the primate striatum.灵长类纹状体中评估选定动作的神经元基础。

Eur J Neurosci. 2011 Aug;34(3):489-506. doi: 10.1111/j.1460-9568.2011.07771.x. Epub 2011 Jul 22.

Processing of action- but not stimulus-related prediction errors differs between active and observational feedback learning.在主动反馈学习和观察性反馈学习中，与动作相关而非与刺激相关的预测误差的处理方式有所不同。

Neuropsychologia. 2015 Jan;66:75-87. doi: 10.1016/j.neuropsychologia.2014.10.036. Epub 2014 Nov 7.

引用本文的文献

Impaired reinforcement learning and coding of prediction errors in patients with cerebellar degeneration - a study with EEG and voxel-based morphometry.小脑变性患者强化学习受损及预测误差编码——一项脑电图和基于体素的形态学研究

Cogn Affect Behav Neurosci. 2025 May 28. doi: 10.3758/s13415-025-01303-2.

Reaching vigor tracks learned prediction error.达到活力追踪学习到的预测误差。

bioRxiv. 2025 Mar 25:2025.03.24.645035. doi: 10.1101/2025.03.24.645035.

The cerebellum contributes to prediction error coding in reinforcement learning in humans.小脑有助于人类强化学习中的预测误差编码。

J Neurosci. 2025 Mar 26;45(19). doi: 10.1523/JNEUROSCI.1972-24.2025.

Reward signals in the motor cortex: from biology to neurotechnology.运动皮层中的奖赏信号：从生物学到神经技术

Nat Commun. 2025 Feb 3;16(1):1307. doi: 10.1038/s41467-024-55016-0.

Reconfigurations of cortical manifold structure during reward-based motor learning.基于奖励的运动学习过程中皮层流形结构的重配置。

Elife. 2024 Jun 25;12:RP91928. doi: 10.7554/eLife.91928.

Functional neuroimaging as a catalyst for integrated neuroscience.功能神经影像学：整合神经科学的催化剂。

Nature. 2023 Nov;623(7986):263-273. doi: 10.1038/s41586-023-06670-9. Epub 2023 Nov 8.

Motor Plans under Uncertainty Reflect a Trade-Off between Maximizing Reward and Success.在不确定性下的运动计划反映了在最大化奖励和成功之间的权衡。

eNeuro. 2022 Apr 12;9(2). doi: 10.1523/ENEURO.0503-21.2022. Print 2022 Mar-Apr.

The Role of Executive Function in Shaping Reinforcement Learning.执行功能在塑造强化学习中的作用。

Curr Opin Behav Sci. 2021 Apr;38:66-73. doi: 10.1016/j.cobeha.2020.10.003. Epub 2020 Nov 14.

Distinct Neural Signatures of Outcome Monitoring After Selection and Execution Errors.选择和执行错误后，结果监控的神经特征明显不同。

J Cogn Neurosci. 2022 Mar 31;34(5):748-765. doi: 10.1162/jocn_a_01824.

Executive Function Assigns Value to Novel Goal-Congruent Outcomes.执行功能赋予新的与目标一致的结果价值。

Cereb Cortex. 2021 Nov 23;32(1):231-247. doi: 10.1093/cercor/bhab205.

本文引用的文献

J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.

Model-based predictions for dopamine.基于模型的多巴胺预测。

Curr Opin Neurobiol. 2018 Apr;49:1-7. doi: 10.1016/j.conb.2017.10.006. Epub 2017 Oct 31.

Reminders of past choices bias decisions for reward in humans.过去选择的提示会影响人类对奖励的决策。

Nat Commun. 2017 Jun 27;8:15958. doi: 10.1038/ncomms15958.

Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。

J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.多维环境中强化学习与注意力之间的动态交互

Neuron. 2017 Jan 18;93(2):451-463. doi: 10.1016/j.neuron.2016.12.040.

Credit assignment in movement-dependent reinforcement learning.运动依赖型强化学习中的信用分配

Proc Natl Acad Sci U S A. 2016 Jun 14;113(24):6797-802. doi: 10.1073/pnas.1523669113. Epub 2016 May 31.

Modulation of Saccade Vigor during Value-Based Decision Making.基于价值的决策过程中扫视活力的调节。

J Neurosci. 2015 Nov 18;35(46):15369-78. doi: 10.1523/JNEUROSCI.2621-15.2015.

Neuronal Reward and Decision Signals: From Theories to Data.神经元奖励与决策信号：从理论到数据

Physiol Rev. 2015 Jul;95(3):853-951. doi: 10.1152/physrev.00023.2014.

Evaluation of ICA-AROMA and alternative strategies for motion artifact removal in resting state fMRI.静息态功能磁共振成像中ICA-AROMA及运动伪影去除替代策略的评估

Neuroimage. 2015 May 15;112:278-287. doi: 10.1016/j.neuroimage.2015.02.063. Epub 2015 Mar 11.

Do learning rates adapt to the distribution of rewards?学习率会适应奖励的分布吗？

Psychon Bull Rev. 2015 Oct;22(5):1320-7. doi: 10.3758/s13423-014-0790-3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验