• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

指令和经验对反馈相关负波的调制。

Modulation of the feedback-related negativity by instruction and experience.

机构信息

Department of Psychology, Carnegie Mellon University, Pittsburgh, PA 15213, USA.

出版信息

Proc Natl Acad Sci U S A. 2011 Nov 22;108(47):19048-53. doi: 10.1073/pnas.1117189108. Epub 2011 Nov 7.

DOI:10.1073/pnas.1117189108
PMID:22065792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3223452/
Abstract

A great deal of research focuses on how humans and animals learn from trial-and-error interactions with the environment. This research has established the viability of reinforcement learning as a model of behavioral adaptation and neural reward valuation. Error-driven learning is inefficient and dangerous, however. Fortunately, humans learn from nonexperiential sources of information as well. In the present study, we focused on one such form of information, instruction. We recorded event-related potentials as participants performed a probabilistic learning task. In one experiment condition, participants received feedback only about whether their responses were rewarded. In the other condition, they also received instruction about reward probabilities before performing the task. We found that instruction eliminated participants' reliance on feedback as evidenced by their immediate asymptotic performance in the instruction condition. In striking contrast, the feedback-related negativity, an event-related potential component thought to reflect neural reward prediction error, continued to adapt with experience in both conditions. These results show that, whereas instruction may immediately control behavior, certain neural responses must be learned from experience.

摘要

大量研究关注人类和动物如何通过与环境的反复互动来学习。这项研究证实了强化学习作为行为适应和神经奖励估值模型的可行性。然而,错误驱动的学习效率低下且危险。幸运的是,人类也可以从非经验信息来源中学习。在本研究中,我们专注于一种这样的信息,即指导。我们记录了参与者在执行概率学习任务时的事件相关电位。在一个实验条件下,参与者仅收到关于他们的反应是否得到奖励的反馈。在另一个条件下,他们在执行任务之前还收到关于奖励概率的指导。我们发现,指导消除了参与者对反馈的依赖,这从他们在指导条件下立即达到渐近表现就可以看出。相比之下,反馈相关负波,一种被认为反映神经奖励预测误差的事件相关电位成分,在两种条件下都继续随着经验而适应。这些结果表明,虽然指导可以立即控制行为,但某些神经反应必须从经验中学习。

相似文献

1
Modulation of the feedback-related negativity by instruction and experience.指令和经验对反馈相关负波的调制。
Proc Natl Acad Sci U S A. 2011 Nov 22;108(47):19048-53. doi: 10.1073/pnas.1117189108. Epub 2011 Nov 7.
2
Feedback-related negativity codes prediction error but not behavioral adjustment during probabilistic reversal learning.反馈相关负性波编码预测误差,但不编码概率反转学习中的行为调整。
J Cogn Neurosci. 2011 Apr;23(4):936-46. doi: 10.1162/jocn.2010.21456. Epub 2010 Feb 10.
3
Electrophysiological responses to feedback during the application of abstract rules.在应用抽象规则时对反馈的电生理反应。
J Cogn Neurosci. 2013 Nov;25(11):1986-2002. doi: 10.1162/jocn_a_00454. Epub 2013 Aug 5.
4
Disrupted reinforcement learning and maladaptive behavior in women with a history of childhood sexual abuse: a high-density event-related potential study.有童年性虐待史的女性中强化学习中断和适应不良行为:一项高密度事件相关电位研究。
JAMA Psychiatry. 2013 May;70(5):499-507. doi: 10.1001/jamapsychiatry.2013.728.
5
EEG correlates of physical effort and reward processing during reinforcement learning.脑电图对强化学习过程中体力消耗和奖励处理的相关性研究。
J Neurophysiol. 2020 Aug 1;124(2):610-622. doi: 10.1152/jn.00370.2020. Epub 2020 Jul 29.
6
Feedback information and the reward positivity.反馈信息与正性奖励。
Int J Psychophysiol. 2018 Oct;132(Pt B):243-251. doi: 10.1016/j.ijpsycho.2017.11.017. Epub 2017 Dec 6.
7
Acting in Temporal Contexts: On the Behavioral and Neurophysiological Consequences of Feedback Delays.在时间语境中行动:反馈延迟的行为和神经生理学后果。
Neuroscience. 2022 Mar 15;486:91-102. doi: 10.1016/j.neuroscience.2021.06.028. Epub 2021 Jun 24.
8
Dissociating the contributions of reward-prediction errors to trial-level adaptation and long-term learning.区分奖励预测误差对试验水平适应和长期学习的贡献。
Biol Psychol. 2020 Jan;149:107775. doi: 10.1016/j.biopsycho.2019.107775. Epub 2019 Sep 26.
9
The impact of stress on feedback and error processing during behavioral adaptation.压力对行为适应过程中反馈与错误处理的影响。
Neuropsychologia. 2015 May;71:181-90. doi: 10.1016/j.neuropsychologia.2015.04.004. Epub 2015 Apr 7.
10
The feedback-related negativity indexes prediction error in active but not observational learning.反馈相关负性波指标预测主动学习而非观察学习中的预测误差。
Psychophysiology. 2019 Sep;56(9):e13389. doi: 10.1111/psyp.13389. Epub 2019 May 3.

引用本文的文献

1
Neural dissociation between reward and salience prediction errors through the lens of optimistic bias.通过乐观偏差的视角来看,神经在奖励和突显预测误差之间的分离。
Hum Brain Mapp. 2023 Aug 15;44(12):4545-4560. doi: 10.1002/hbm.26398. Epub 2023 Jun 19.
2
Effects of subjective and objective task difficulties for feedback- related brain potentials in social situations: An electroencephalogram study.社会情境下反馈相关脑电位的主观和客观任务难度的影响:一项脑电图研究。
PLoS One. 2022 Dec 1;17(12):e0277663. doi: 10.1371/journal.pone.0277663. eCollection 2022.
3
Confirmation Bias in the Course of Instructed Reinforcement Learning in Schizophrenia-Spectrum Disorders.精神分裂症谱系障碍中指导性强化学习过程中的确认偏差。
Brain Sci. 2022 Jan 11;12(1):90. doi: 10.3390/brainsci12010090.
4
To organise or not to organise? Understanding search strategy preferences using Lego building blocks.组织还是不组织?使用乐高积木理解搜索策略偏好。
Q J Exp Psychol (Hove). 2022 May;75(5):869-891. doi: 10.1177/17470218211040724. Epub 2021 Sep 2.
5
Temporal Fluctuation of Mood in Gaming Task Modulates Feedback Negativity: EEG Study With Virtual Reality.游戏任务中情绪的时间波动对反馈负波的调制:虚拟现实脑电图研究
Front Hum Neurosci. 2021 Jun 3;15:536288. doi: 10.3389/fnhum.2021.536288. eCollection 2021.
6
Selective Devaluation Affects the Processing of Preferred Rewards.选择性贬值影响偏好奖励的加工。
Cogn Affect Behav Neurosci. 2021 Oct;21(5):1010-1025. doi: 10.3758/s13415-021-00904-x. Epub 2021 Apr 30.
7
The influence of internal models on feedback-related brain activity.内部模型对反馈相关脑活动的影响。
Cogn Affect Behav Neurosci. 2020 Oct;20(5):1070-1089. doi: 10.3758/s13415-020-00820-6.
8
An Event-Related Potential Study of Decision-Making and Feedback Utilization in Female College Students Who Binge Drink.对酗酒女大学生决策与反馈利用的事件相关电位研究。
Front Psychol. 2019 Nov 22;10:2606. doi: 10.3389/fpsyg.2019.02606. eCollection 2019.
9
Aberrant reward prediction error during Pavlovian appetitive learning in alexithymia.在述情障碍的条件性味觉学习中,异常的奖赏预测错误。
Soc Cogn Affect Neurosci. 2019 Oct 1;14(10):1119-1129. doi: 10.1093/scan/nsz089.
10
Electrophysiological measures reveal the role of anterior cingulate cortex in learning from unreliable feedback.电生理测量揭示了前扣带皮层在从不可靠反馈中学习的作用。
Cogn Affect Behav Neurosci. 2018 Oct;18(5):949-963. doi: 10.3758/s13415-018-0615-3.

本文引用的文献

1
Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a "generic" neural system for error detection.在时间估计任务中出现错误反馈后的事件相关脑电位:错误检测的“通用”神经系统证据。
J Cogn Neurosci. 1997 Nov;9(6):788-98. doi: 10.1162/jocn.1997.9.6.788.
2
Computational models for the combination of advice and individual learning.用于建议和个体学习相结合的计算模型。
Cogn Sci. 2009 Mar;33(2):206-42. doi: 10.1111/j.1551-6709.2009.01010.x.
3
Dopaminergic genes predict individual differences in susceptibility to confirmation bias.多巴胺能基因预测个体对确认偏误易感性的差异。
J Neurosci. 2011 Apr 20;31(16):6188-98. doi: 10.1523/JNEUROSCI.6486-10.2011.
4
Model-based influences on humans' choices and striatal prediction errors.基于模型的影响对人类选择和纹状体预测误差的影响。
Neuron. 2011 Mar 24;69(6):1204-15. doi: 10.1016/j.neuron.2011.02.027.
5
Learning from delayed feedback: neural responses in temporal credit assignment.从延迟反馈中学习:时间信用分配中的神经反应。
Cogn Affect Behav Neurosci. 2011 Jun;11(2):131-43. doi: 10.3758/s13415-011-0027-0.
6
How instructed knowledge modulates the neural systems of reward learning.指导知识如何调节奖励学习的神经系统。
Proc Natl Acad Sci U S A. 2011 Jan 4;108(1):55-60. doi: 10.1073/pnas.1014938108. Epub 2010 Dec 20.
7
States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.状态与奖励:基于模型和无模型强化学习的分离神经预测误差信号。
Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.
8
Model-based analyses: Promises, pitfalls, and example applications to the study of cognitive control.基于模型的分析:认知控制研究中的前景、陷阱及示例应用
Q J Exp Psychol (Hove). 2012;65(2):252-67. doi: 10.1080/17470211003668272. Epub 2011 Jun 24.
9
Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action.人类和啮齿动物在动作控制中的同源性:皮质纹状体对目标导向和习惯性动作的决定因素。
Neuropsychopharmacology. 2010 Jan;35(1):48-69. doi: 10.1038/npp.2009.131.
10
Instructional control of reinforcement learning: a behavioral and neurocomputational investigation.强化学习的指令控制:一项行为与神经计算研究。
Brain Res. 2009 Nov 24;1299:74-94. doi: 10.1016/j.brainres.2009.07.007. Epub 2009 Aug 3.