Suppr超能文献

适应性行为和反馈处理在强化学习中整合了经验与指导。

Adaptive behaviour and feedback processing integrate experience and instruction in reinforcement learning.

作者信息

Schiffer Anne-Marike, Siletti Kayla, Waszak Florian, Yeung Nick

机构信息

Department of Experimental Psychology, University of Oxford, OX13UD Oxford, UK; Université Paris Descartes, Sorbonne Paris Cité, Paris, France; CNRS (Laboratoire Psychologie de la Perception, UMR 8158), Paris, France.

Department of Experimental Psychology, University of Oxford, OX13UD Oxford, UK.

出版信息

Neuroimage. 2017 Feb 1;146:626-641. doi: 10.1016/j.neuroimage.2016.08.057. Epub 2016 Aug 27.

Abstract

In any non-deterministic environment, unexpected events can indicate true changes in the world (and require behavioural adaptation) or reflect chance occurrence (and must be discounted). Adaptive behaviour requires distinguishing these possibilities. We investigated how humans achieve this by integrating high-level information from instruction and experience. In a series of EEG experiments, instructions modulated the perceived informativeness of feedback: Participants performed a novel probabilistic reinforcement learning task, receiving instructions about reliability of feedback or volatility of the environment. Importantly, our designs de-confound informativeness from surprise, which typically co-vary. Behavioural results indicate that participants used instructions to adapt their behaviour faster to changes in the environment when instructions indicated that negative feedback was more informative, even if it was simultaneously less surprising. This study is the first to show that neural markers of feedback anticipation (stimulus-preceding negativity) and of feedback processing (feedback-related negativity; FRN) reflect informativeness of unexpected feedback. Meanwhile, changes in P3 amplitude indicated imminent adjustments in behaviour. Collectively, our findings provide new evidence that high-level information interacts with experience-driven learning in a flexible manner, enabling human learners to make informed decisions about whether to persevere or explore new options, a pivotal ability in our complex environment.

摘要

在任何非确定性环境中,意外事件可能表明世界发生了真实变化(需要行为适应),也可能反映偶然发生的情况(必须忽略)。适应性行为需要区分这些可能性。我们研究了人类如何通过整合来自指令和经验的高级信息来实现这一点。在一系列脑电图实验中,指令调节了对反馈信息性的感知:参与者执行了一项新颖的概率强化学习任务,接收关于反馈可靠性或环境波动性的指令。重要的是,我们的设计将信息性与通常共同变化的意外性区分开来。行为结果表明,当指令表明负面反馈更具信息性时,即使它同时不那么令人意外,参与者也会利用指令更快地使自己的行为适应环境变化。这项研究首次表明,反馈预期(刺激前负波)和反馈处理(反馈相关负波;FRN)的神经标记反映了意外反馈的信息性。同时,P3波幅的变化表明行为即将调整。总的来说,我们的研究结果提供了新的证据,即高级信息以灵活的方式与经验驱动的学习相互作用,使人类学习者能够就是否坚持或探索新选项做出明智的决定,这是我们在复杂环境中的一项关键能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6012/5312784/e5ae22c0c5d2/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验