• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

区分奖励预测误差对试验水平适应和长期学习的贡献。

Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning.

机构信息

University of Utah, Department of Health, Kinesiology, and Recreation, United States; University of Utah, Department of Physical Therapy and Athletic Training, United States.

Auburn University, School of Kinesiology, United States; Auburn University, Center for Neuroscience, United States.

出版信息

Biol Psychol. 2020 Jan;149:107775. doi: 10.1016/j.biopsycho.2019.107775. Epub 2019 Sep 26.

DOI:10.1016/j.biopsycho.2019.107775
PMID:31563586
Abstract

Reward positivity (RewP) is an EEG component reflecting reward-prediction errors. Using multilevel models, we measured single-trial RewP amplitude from trial-to-trial, while reward and prediction varied during learning. Sixty participants completed a category-learning task in either engaging or sterile conditions with the RewP time-locked to feedback. Sequential analysis of single-trial RewP showed its relationship to current and previous accuracy, and the probability of changing one's response to subsequent stimuli. Simulations show these effects can be explained in detail by the dynamics of participants' expectations according to principles of reinforcement learning. The single-trial RewP findings were consistent with previous literature linking RewP to reward-prediction error under reinforcement-learning theory. In contrast, the aggregate RewP was unrelated to the engagement manipulation or to delayed retention performance. Thus the present results provide a detailed computational account how RewP relates to acute adaptation, but suggest RewP plays little role in long-term learning.

摘要

奖励正波(RewP)是反映奖励预测误差的一种 EEG 成分。我们使用多层模型,在学习过程中随着奖励和预测的变化,从一次试验到另一次试验测量单次试验 RewP 幅度。60 名参与者在参与或无菌条件下完成了一项类别学习任务,RewP 与反馈时间锁定。对单次试验 RewP 的序列分析表明,它与当前和以前的准确性以及对后续刺激改变反应的可能性有关。模拟表明,根据强化学习的原则,参与者的期望动态可以详细解释这些影响。单次试验 RewP 的发现与强化学习理论下 RewP 与奖励预测误差相关的先前文献一致。相比之下,总体 RewP 与参与操作或延迟保留表现无关。因此,目前的结果提供了一个详细的计算说明,说明 RewP 如何与急性适应相关,但表明 RewP 在长期学习中作用不大。

相似文献

1
Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning.区分奖励预测误差对试验水平适应和长期学习的贡献。
Biol Psychol. 2020 Jan;149:107775. doi: 10.1016/j.biopsycho.2019.107775. Epub 2019 Sep 26.
2
The reward positivity is sensitive to affective liking.正性奖励敏感于情感喜好。
Cogn Affect Behav Neurosci. 2022 Apr;22(2):258-267. doi: 10.3758/s13415-021-00950-5. Epub 2021 Oct 1.
3
Prediction-error-dependent processing of immediate and delayed positive feedback.即时和延迟正反馈的预测误差依赖性处理。
Sci Rep. 2024 Apr 27;14(1):9674. doi: 10.1038/s41598-024-60328-8.
4
Reduced positive affect alters reward learning via reduced information encoding in the Reward Positivity.积极情绪减少通过降低奖励正性的信息编码改变奖励学习。
Psychophysiology. 2023 Aug;60(8):e14276. doi: 10.1111/psyp.14276. Epub 2023 Feb 19.
5
Reinforcement learning and the reward positivity with aversive outcomes.强化学习与令人厌恶的结果的奖励正性。
Psychophysiology. 2024 Apr;61(4):e14460. doi: 10.1111/psyp.14460. Epub 2023 Nov 22.
6
Acute stress impairs reward positivity effect in probabilistic learning.急性应激会损害概率学习中的奖赏积极效应。
Psychophysiology. 2020 Apr;57(4):e13531. doi: 10.1111/psyp.13531. Epub 2020 Jan 17.
7
Outcome valence and stimulus frequency affect neural responses to rewards and punishments.结果效价和刺激频率影响对奖励和惩罚的神经反应。
Psychophysiology. 2022 Mar;59(3):e13981. doi: 10.1111/psyp.13981. Epub 2021 Nov 30.
8
The aversion positivity: Mediofrontal cortical potentials reflect parametric aversive prediction errors and drive behavioral modification following negative reinforcement.厌恶正性化:额眶部皮质电势反映了参数性厌恶预测误差,并在负强化后驱动行为修正。
Cortex. 2021 Jul;140:26-39. doi: 10.1016/j.cortex.2021.03.012. Epub 2021 Mar 27.
9
Dissociating the effect of reward uncertainty and timing uncertainty on neural indices of reward prediction errors: A reward positivity (RewP) event-related potential (ERP) study.区分奖赏不确定性和时间不确定性对奖赏预测误差神经指标的影响:奖赏正波(RewP)事件相关电位(ERP)研究。
Biol Psychol. 2021 Jul;163:108121. doi: 10.1016/j.biopsycho.2021.108121. Epub 2021 May 29.
10
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

引用本文的文献

1
Reinforcement learning in motor skill acquisition: using the reward positivity to understand the mechanisms underlying short- and long-term behavior adaptation.运动技能习得中的强化学习:利用奖励积极性来理解短期和长期行为适应背后的机制。
Front Behav Neurosci. 2024 Oct 30;18:1466970. doi: 10.3389/fnbeh.2024.1466970. eCollection 2024.
2
Learning when effort matters: neural dynamics underlying updating and adaptation to changes in performance efficacy.学习何时需要努力:表现效能变化时更新和适应的神经动力学基础。
Cereb Cortex. 2023 Feb 20;33(5):2395-2411. doi: 10.1093/cercor/bhac215.
3
Examining Social Cognition with Embodied Robots: Does Prior Experience with a Robot Impact Feedback-associated Learning in a Gambling Task?
使用具身机器人研究社会认知:与机器人的先前经验是否会影响赌博任务中与反馈相关的学习?
J Cogn. 2021 May 31;4(1):28. doi: 10.5334/joc.167.
4
Response-based outcome predictions and confidence regulate feedback processing and learning.基于反应的结果预测和置信度调节反馈处理和学习。
Elife. 2021 Apr 30;10:e62825. doi: 10.7554/eLife.62825.
5
Modeling the influence of working memory, reinforcement, and action uncertainty on reaction time and choice during instrumental learning.建立模型,分析工作记忆、强化、动作不确定性对工具性学习过程中反应时和选择的影响。
Psychon Bull Rev. 2021 Feb;28(1):20-39. doi: 10.3758/s13423-020-01774-z.