当理论与生物学不一致时：奖励预测误差与预期之间的关系。

When theory and biology differ: The relationship between reward prediction errors and expectancy.

作者信息

Williams Chad C, Hassall Cameron D, Trska Robert, Holroyd Clay B, Krigolson Olave E

机构信息

Centre for Biomedical Research, University of Victoria, Victoria, British Columbia, V8W 2Y2, Canada.

出版信息

Biol Psychol. 2017 Oct;129:265-272. doi: 10.1016/j.biopsycho.2017.09.007. Epub 2017 Sep 18.

DOI:10.1016/j.biopsycho.2017.09.007

PMID:28923360

Abstract

Comparisons between expectations and outcomes are critical for learning. Termed prediction errors, the violations of expectancy that occur when outcomes differ from expectations are used to modify value and shape behaviour. In the present study, we examined how a wide range of expectancy violations impacted neural signals associated with feedback processing. Participants performed a time estimation task in which they had to guess the duration of one second while their electroencephalogram was recorded. In a key manipulation, we varied task difficulty across the experiment to create a range of different feedback expectancies - reward feedback was either very expected, expected, 50/50, unexpected, or very unexpected. As predicted, the amplitude of the reward positivity, a component of the human event-related brain potential associated with feedback processing, scaled inversely with expectancy (e.g., unexpected feedback yielded a larger reward positivity than expected feedback). Interestingly, the scaling of the reward positivity to outcome expectancy was not linear as would be predicted by some theoretical models. Specifically, we found that the amplitude of the reward positivity was about equivalent for very expected and expected feedback, and for very unexpected and unexpected feedback. As such, our results demonstrate a sigmoidal relationship between reward expectancy and the amplitude of the reward positivity, with interesting implications for theories of reinforcement learning.

摘要

期望与结果之间的比较对学习至关重要。当结果与期望不同时出现的期望违背被称为预测误差，它被用于修正价值并塑造行为。在本研究中，我们考察了广泛的期望违背如何影响与反馈处理相关的神经信号。参与者执行了一项时间估计任务，在记录脑电图的同时，他们必须猜测一秒钟的时长。在一个关键操作中，我们在整个实验过程中改变任务难度，以创造一系列不同的反馈期望——奖励反馈要么是非常可预期的、可预期的、五五开、不可预期的，要么是非常不可预期的。正如预测的那样，奖励正波的幅度，即与反馈处理相关的人类事件相关脑电位的一个成分，与期望成反比（例如，不可预期的反馈比可预期的反馈产生更大的奖励正波）。有趣的是，奖励正波对结果期望的缩放并非如一些理论模型所预测的那样呈线性。具体而言，我们发现非常可预期和可预期的反馈，以及非常不可预期和不可预期的反馈，其奖励正波的幅度大致相当。因此，我们的结果证明了奖励期望与奖励正波幅度之间呈S形关系，这对强化学习理论具有有趣的启示。

相似文献

When theory and biology differ: The relationship between reward prediction errors and expectancy.当理论与生物学不一致时：奖励预测误差与预期之间的关系。

Biol Psychol. 2017 Oct;129:265-272. doi: 10.1016/j.biopsycho.2017.09.007. Epub 2017 Sep 18.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策：强化学习预测错误在人类中的快速传播。

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Feedback information and the reward positivity.反馈信息与正性奖励。

Int J Psychophysiol. 2018 Oct;132(Pt B):243-251. doi: 10.1016/j.ijpsycho.2017.11.017. Epub 2017 Dec 6.

Beta-gamma oscillation reveals learning from unexpected reward in learners versus non-learners.β-γ 振荡揭示了学习者与非学习者在意外奖励中的学习。

Neuropsychologia. 2019 Aug;131:266-274. doi: 10.1016/j.neuropsychologia.2019.06.002. Epub 2019 Jun 4.

The better, the bigger: The effect of graded positive performance feedback on the reward positivity.越好，越大：分级积极绩效反馈对奖励积极情绪的影响。

Biol Psychol. 2016 Feb;114:61-8. doi: 10.1016/j.biopsycho.2015.12.011. Epub 2016 Jan 3.

Manipulation of feedback expectancy and valence induces negative and positive reward prediction error signals manifest in event-related brain potentials.反馈期望和效价的操纵会引起负性和正性奖励预测误差信号，这些信号表现在事件相关脑电位中。

Psychophysiology. 2011 May;48(5):656-64. doi: 10.1111/j.1469-8986.2010.01136.x. Epub 2010 Oct 5.

The application of reward learning in the real world: Changes in the reward positivity amplitude reflect learning in a medical education context.奖励学习在现实世界中的应用：奖励正波幅值的变化反映了医学教育背景下的学习。

Int J Psychophysiol. 2018 Oct;132(Pt B):236-242. doi: 10.1016/j.ijpsycho.2017.10.010. Epub 2017 Oct 27.

Perceptual Salience and Reward Both Influence Feedback-Related Neural Activity Arising from Choice.知觉显著性和奖励都会影响因选择而产生的与反馈相关的神经活动。

J Neurosci. 2015 Sep 23;35(38):13064-75. doi: 10.1523/JNEUROSCI.1601-15.2015.

Acute stress impairs reward positivity effect in probabilistic learning.急性应激会损害概率学习中的奖赏积极效应。

Psychophysiology. 2020 Apr;57(4):e13531. doi: 10.1111/psyp.13531. Epub 2020 Jan 17.

Feedback delay impaired reinforcement learning: Principal components analysis of Reward Positivity.反馈延迟损害强化学习：奖励正性的主成分分析。

Neurosci Lett. 2018 Oct 15;685:179-184. doi: 10.1016/j.neulet.2018.08.039. Epub 2018 Aug 28.

引用本文的文献

Exploring when to exploit: the cognitive underpinnings of foraging-type decisions in relation to psychopathy.探索何时进行利用：与精神病态相关的觅食型决策的认知基础。

Transl Psychiatry. 2025 Jan 28;15(1):31. doi: 10.1038/s41398-025-03245-2.

Differential neural processing of reward and self-relevance in a social gambling paradigm.社交赌博范式中奖励与自我相关性的差异神经处理

Cogn Affect Behav Neurosci. 2025 Apr;25(2):377-386. doi: 10.3758/s13415-024-01247-z. Epub 2024 Dec 16.

Reward processes in extinction learning and applications to exposure therapy.消退学习中的奖励过程及其在暴露疗法中的应用。

J Anxiety Disord. 2024 Aug;106:102911. doi: 10.1016/j.janxdis.2024.102911. Epub 2024 Jul 29.

The neural correlates of continuous feedback processing.连续反馈处理的神经关联。

Psychophysiology. 2023 Dec;60(12):e14399. doi: 10.1111/psyp.14399. Epub 2023 Jul 24.

Oscillatory brain activity links experience to expectancy during associative learning.在联想学习过程中，脑活动的振荡将经验与期望联系起来。

Psychophysiology. 2022 May;59(5):e13946. doi: 10.1111/psyp.13946. Epub 2021 Oct 7.

Single-trial modeling separates multiple overlapping prediction errors during reward processing in human EEG.单试次建模可分离人类 EEG 中奖励处理期间的多个重叠预测误差。

Commun Biol. 2021 Jul 23;4(1):910. doi: 10.1038/s42003-021-02426-1.

Processing of performance errors predicts memory formation: Enhanced feedback-related negativities for corrected versus repeated errors in an associative learning paradigm.行为错误的处理过程可预测记忆形成：在联想学习范式中，与重复错误相比，纠正错误时反馈相关负波增强。

Eur J Neurosci. 2020 Feb;51(3):881-890. doi: 10.1111/ejn.14566. Epub 2019 Oct 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

当理论与生物学不一致时：奖励预测误差与预期之间的关系。

When theory and biology differ: The relationship between reward prediction errors and expectancy.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献