• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

适应性行为和反馈处理在强化学习中整合了经验与指导。

Adaptive behaviour and feedback processing integrate experience and instruction in reinforcement learning.

作者信息

Schiffer Anne-Marike, Siletti Kayla, Waszak Florian, Yeung Nick

机构信息

Department of Experimental Psychology, University of Oxford, OX13UD Oxford, UK; Université Paris Descartes, Sorbonne Paris Cité, Paris, France; CNRS (Laboratoire Psychologie de la Perception, UMR 8158), Paris, France.

Department of Experimental Psychology, University of Oxford, OX13UD Oxford, UK.

出版信息

Neuroimage. 2017 Feb 1;146:626-641. doi: 10.1016/j.neuroimage.2016.08.057. Epub 2016 Aug 27.

DOI:10.1016/j.neuroimage.2016.08.057
PMID:27577720
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5312784/
Abstract

In any non-deterministic environment, unexpected events can indicate true changes in the world (and require behavioural adaptation) or reflect chance occurrence (and must be discounted). Adaptive behaviour requires distinguishing these possibilities. We investigated how humans achieve this by integrating high-level information from instruction and experience. In a series of EEG experiments, instructions modulated the perceived informativeness of feedback: Participants performed a novel probabilistic reinforcement learning task, receiving instructions about reliability of feedback or volatility of the environment. Importantly, our designs de-confound informativeness from surprise, which typically co-vary. Behavioural results indicate that participants used instructions to adapt their behaviour faster to changes in the environment when instructions indicated that negative feedback was more informative, even if it was simultaneously less surprising. This study is the first to show that neural markers of feedback anticipation (stimulus-preceding negativity) and of feedback processing (feedback-related negativity; FRN) reflect informativeness of unexpected feedback. Meanwhile, changes in P3 amplitude indicated imminent adjustments in behaviour. Collectively, our findings provide new evidence that high-level information interacts with experience-driven learning in a flexible manner, enabling human learners to make informed decisions about whether to persevere or explore new options, a pivotal ability in our complex environment.

摘要

在任何非确定性环境中,意外事件可能表明世界发生了真实变化(需要行为适应),也可能反映偶然发生的情况(必须忽略)。适应性行为需要区分这些可能性。我们研究了人类如何通过整合来自指令和经验的高级信息来实现这一点。在一系列脑电图实验中,指令调节了对反馈信息性的感知:参与者执行了一项新颖的概率强化学习任务,接收关于反馈可靠性或环境波动性的指令。重要的是,我们的设计将信息性与通常共同变化的意外性区分开来。行为结果表明,当指令表明负面反馈更具信息性时,即使它同时不那么令人意外,参与者也会利用指令更快地使自己的行为适应环境变化。这项研究首次表明,反馈预期(刺激前负波)和反馈处理(反馈相关负波;FRN)的神经标记反映了意外反馈的信息性。同时,P3波幅的变化表明行为即将调整。总的来说,我们的研究结果提供了新的证据,即高级信息以灵活的方式与经验驱动的学习相互作用,使人类学习者能够就是否坚持或探索新选项做出明智的决定,这是我们在复杂环境中的一项关键能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/bfc07e15fffe/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/e5ae22c0c5d2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/b46bc6915043/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/39052887430c/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/2c207ede2433/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/abd61b02ec98/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/bfc07e15fffe/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/e5ae22c0c5d2/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/b46bc6915043/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/39052887430c/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/2c207ede2433/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/abd61b02ec98/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/bfc07e15fffe/gr6.jpg

相似文献

1
Adaptive behaviour and feedback processing integrate experience and instruction in reinforcement learning.适应性行为和反馈处理在强化学习中整合了经验与指导。
Neuroimage. 2017 Feb 1;146:626-641. doi: 10.1016/j.neuroimage.2016.08.057. Epub 2016 Aug 27.
2
Effects of feedback delay and agency on feedback-locked beta and theta power during reinforcement learning.反馈延迟和主体效应对强化学习过程中反馈锁定β和θ频段功率的影响。
Psychophysiology. 2019 Oct;56(10):e13428. doi: 10.1111/psyp.13428. Epub 2019 Jun 27.
3
Expectancy affects the feedback-related negativity (FRN) for delayed feedback in probabilistic learning.预期会影响概率学习中延迟反馈的反馈相关负波(FRN)。
Psychophysiology. 2016 Nov;53(11):1739-1750. doi: 10.1111/psyp.12738. Epub 2016 Aug 27.
4
A tradeoff relationship between internal monitoring and external feedback during the dynamic process of reinforcement learning.在强化学习的动态过程中,内部监测和外部反馈之间存在权衡关系。
Int J Psychophysiol. 2020 Apr;150:11-19. doi: 10.1016/j.ijpsycho.2020.01.004. Epub 2020 Jan 23.
5
Learning and altering behaviours by reinforcement: neurocognitive differences between children and adults.通过强化学习和改变行为:儿童和成人的神经认知差异。
Dev Cogn Neurosci. 2014 Jan;7:94-105. doi: 10.1016/j.dcn.2013.12.001. Epub 2013 Dec 7.
6
Modulation of the feedback-related negativity by instruction and experience.指令和经验对反馈相关负波的调制。
Proc Natl Acad Sci U S A. 2011 Nov 22;108(47):19048-53. doi: 10.1073/pnas.1117189108. Epub 2011 Nov 7.
7
Perceptual Salience and Reward Both Influence Feedback-Related Neural Activity Arising from Choice.知觉显著性和奖励都会影响因选择而产生的与反馈相关的神经活动。
J Neurosci. 2015 Sep 23;35(38):13064-75. doi: 10.1523/JNEUROSCI.1601-15.2015.
8
Feedback-related negativity is enhanced in adolescence during a gambling task with and without probabilistic reinforcement learning.在有或没有概率性强化学习的赌博任务中,与反馈相关的负波在青少年期会增强。
Neuroreport. 2015 Jan 21;26(2):45-9. doi: 10.1097/WNR.0000000000000291.
9
Behavioural and neural limits in competitive decision making: The roles of outcome, opponency and observation.竞争决策中的行为和神经限制:结果、对立和观察的作用。
Biol Psychol. 2020 Jan;149:107778. doi: 10.1016/j.biopsycho.2019.107778. Epub 2019 Oct 5.
10
Relevance and uncertainty jointly influence reward anticipation at the level of the SPN ERP component.关联性和不确定性共同影响 SPN ERP 成分水平上的奖励预期。
Int J Psychophysiol. 2018 Oct;132(Pt B):287-297. doi: 10.1016/j.ijpsycho.2017.11.005. Epub 2017 Nov 9.

引用本文的文献

1
Considering What We Know and What We Don't Know: Expectations and Confidence Guide Value Integration in Value-Based Decision-Making.考量已知与未知:期望与信心引导基于价值的决策中的价值整合。
Open Mind (Camb). 2025 Jun 25;9:791-813. doi: 10.1162/opmi.a.3. eCollection 2025.
2
Examining neuroanatomical correlates of win-stay, lose-shift behaviour.研究赢则坚持、输则改变行为的神经解剖学关联。
Brain Struct Funct. 2025 Feb 27;230(2):40. doi: 10.1007/s00429-025-02901-z.
3
Artificial Intelligence and Neuroscience: Transformative Synergies in Brain Research and Clinical Applications.

本文引用的文献

1
Uncertainty and expectancy deviations require cortico-subcortical cooperation.不确定性和预期偏差需要皮质-皮质下合作。
Neuroimage. 2017 Jan 1;144(Pt A):23-34. doi: 10.1016/j.neuroimage.2016.05.069. Epub 2016 Jun 1.
2
Model-based choices involve prospective neural activity.基于模型的选择涉及前瞻性神经活动。
Nat Neurosci. 2015 May;18(5):767-72. doi: 10.1038/nn.3981. Epub 2015 Mar 23.
3
A neural reward prediction error revealed by a meta-analysis of ERPs using great grand averages.使用极大平均法对事件相关电位进行元分析揭示的神经奖励预测误差。
人工智能与神经科学:脑研究及临床应用中的变革性协同作用
J Clin Med. 2025 Jan 16;14(2):550. doi: 10.3390/jcm14020550.
4
Learning when effort matters: neural dynamics underlying updating and adaptation to changes in performance efficacy.学习何时需要努力:表现效能变化时更新和适应的神经动力学基础。
Cereb Cortex. 2023 Feb 20;33(5):2395-2411. doi: 10.1093/cercor/bhac215.
5
Confirmation Bias in the Course of Instructed Reinforcement Learning in Schizophrenia-Spectrum Disorders.精神分裂症谱系障碍中指导性强化学习过程中的确认偏差。
Brain Sci. 2022 Jan 11;12(1):90. doi: 10.3390/brainsci12010090.
6
Response-based outcome predictions and confidence regulate feedback processing and learning.基于反应的结果预测和置信度调节反馈处理和学习。
Elife. 2021 Apr 30;10:e62825. doi: 10.7554/eLife.62825.
7
Expectations of reward and efficacy guide cognitive control allocation.期望奖励和效能指导认知控制分配。
Nat Commun. 2021 Feb 15;12(1):1030. doi: 10.1038/s41467-021-21315-z.
8
Confidence Predictions Affect Performance Confidence and Neural Preparation in Perceptual Decision Making.置信度预测会影响感知决策中的表现信心和神经准备。
Sci Rep. 2019 Mar 11;9(1):4031. doi: 10.1038/s41598-019-40681-9.
9
Electrophysiological measures reveal the role of anterior cingulate cortex in learning from unreliable feedback.电生理测量揭示了前扣带皮层在从不可靠反馈中学习的作用。
Cogn Affect Behav Neurosci. 2018 Oct;18(5):949-963. doi: 10.3758/s13415-018-0615-3.
10
Effects of feedback reliability on feedback-related brain activity: A feedback valuation account.反馈可靠性对与反馈相关的大脑活动的影响:一种反馈评估解释。
Cogn Affect Behav Neurosci. 2018 Jun;18(3):596-608. doi: 10.3758/s13415-018-0591-7.
Psychol Bull. 2015 Jan;141(1):213-35. doi: 10.1037/bul0000006. Epub 2014 Dec 15.
4
Model-based hierarchical reinforcement learning and human action control.基于模型的分层强化学习与人类行为控制。
Philos Trans R Soc Lond B Biol Sci. 2014 Nov 5;369(1655). doi: 10.1098/rstb.2013.0480.
5
Frontal theta as a mechanism for cognitive control.额叶θ波作为认知控制的一种机制。
Trends Cogn Sci. 2014 Aug;18(8):414-21. doi: 10.1016/j.tics.2014.04.012. Epub 2014 May 15.
6
Corticostriatal output gating during selection from working memory.纹状体皮层输出门控在工作记忆选择期间。
Neuron. 2014 Feb 19;81(4):930-42. doi: 10.1016/j.neuron.2014.01.002.
7
Reduced susceptibility to confirmation bias in schizophrenia.精神分裂症患者对确认偏差的易感性降低。
Cogn Affect Behav Neurosci. 2014 Jun;14(2):715-28. doi: 10.3758/s13415-014-0250-6.
8
The feedback-related negativity (FRN) revisited: new insights into the localization, meaning and network organization.重新审视反馈相关负波(FRN):对其定位、意义及网络组织的新见解
Neuroimage. 2014 Jan 1;84:159-68. doi: 10.1016/j.neuroimage.2013.08.028. Epub 2013 Aug 23.
9
Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a "generic" neural system for error detection.在时间估计任务中出现错误反馈后的事件相关脑电位:错误检测的“通用”神经系统证据。
J Cogn Neurosci. 1997 Nov;9(6):788-98. doi: 10.1162/jocn.1997.9.6.788.
10
Learning-induced modulations of the stimulus-preceding negativity.学习诱导的刺激前负波调制。
Psychophysiology. 2013 Sep;50(9):931-9. doi: 10.1111/psyp.12073. Epub 2013 Jun 30.