Suppr超能文献

厌恶正性化:额眶部皮质电势反映了参数性厌恶预测误差,并在负强化后驱动行为修正。

The aversion positivity: Mediofrontal cortical potentials reflect parametric aversive prediction errors and drive behavioral modification following negative reinforcement.

机构信息

Department of Psychiatry and Behavioral Sciences, University of Minnesota Health, USA.

Department of Psychological Sciences, University of Arkansas, USA.

出版信息

Cortex. 2021 Jul;140:26-39. doi: 10.1016/j.cortex.2021.03.012. Epub 2021 Mar 27.

Abstract

Reinforcement learning capitalizes on prediction errors (PEs), representing the deviation of received outcomes from expected outcomes. Mediofrontal event-related potentials (ERPs), in particular the feedback-related negativity (FRN)/reward positivity (RewP), are related to PE signaling, but there is disagreement as to whether the FRN/RewP encode signed or unsigned PEs. PE encoding can potentially be dissected by time-frequency analysis, as frontal theta [4-8 Hz] might represent poor outcomes, while central delta [1-3 Hz] might instead represent rewarding outcomes. However, cortical PE signaling in negative reinforcement is still poorly understood, and the role of cortical PE representations in behavioral reinforcement learning following negative reinforcement is relatively unexplored. We recorded EEG while participants completed a task with matched positive and negative reinforcement outcome modalities, with parametrically manipulated single-trial outcomes producing positive and negative PEs. We first demonstrated that PEs systematically influence future behavior in both positive and negative reinforcement conditions. In negative reinforcement conditions, mediofrontal ERPs positively signaled unsigned PEs in a time window encompassing the P2 potential, and negatively signaled signed PEs for a time window encompassing the FRN/RewP and frontal P3 (an "aversion positivity"). Central delta power increased parametrically with increasingly aversive outcomes, contributing to the "aversion positivity". Finally, negative reinforcement ERPs correlated with RTs on the following trial, suggesting cortical PEs guide behavioral adaptations. Positive reinforcement PEs did not influence ERP or time-frequency activity, despite significant behavioral effects. These results demonstrate that mediofrontal PE signals are a mechanism underlying negative reinforcement learning, and that delta power increases for aversive outcomes might contribute to the "aversion positivity."

摘要

强化学习利用预测误差(PE),代表接收结果与预期结果之间的偏差。中额事件相关电位(ERP),特别是反馈相关负波(FRN)/奖励正波(RewP),与 PE 信号有关,但对于 FRN/RewP 是否编码有符号或无符号的 PE 存在分歧。PE 编码可以通过时频分析来分解,因为额 theta [4-8 Hz] 可能代表较差的结果,而中央 delta [1-3 Hz] 可能代表奖励性的结果。然而,负强化中的皮质 PE 信号仍然知之甚少,皮质 PE 代表在负强化后行为强化学习中的作用也相对未知。我们在参与者完成具有匹配正强化和负强化结果模式的任务时记录了 EEG,并通过参数化操纵单试次结果产生正强化和负强化的 PE。我们首先证明,PE 在正强化和负强化条件下都会系统地影响未来的行为。在负强化条件下,中额 ERP 在包含 P2 电位的时间窗口内对无符号的 PE 呈正信号,在包含 FRN/RewP 和额前 P3 的时间窗口内对有符号的 PE 呈负信号(“厌恶正波”)。中央 delta 功率与越来越厌恶的结果呈参数性增加,这有助于“厌恶正波”。最后,负强化 ERP 与下一次试验的 RT 相关,表明皮质 PE 指导行为适应。尽管有显著的行为影响,但正强化的 PE 并没有影响 ERP 或时频活动。这些结果表明,中额 PE 信号是负强化学习的一种机制,厌恶结果的 delta 功率增加可能有助于“厌恶正波”。

相似文献

6

引用本文的文献

本文引用的文献

5
Reward Prediction Error Modulates Saccade Vigor.奖赏预测误差调节扫视力度。
J Neurosci. 2019 Jun 19;39(25):5010-5017. doi: 10.1523/JNEUROSCI.0432-19.2019. Epub 2019 Apr 23.
8
Medial frontal cortex response to unexpected motivationally salient outcomes.内侧前额叶皮层对意外动机显著结果的反应。
Int J Psychophysiol. 2018 Oct;132(Pt B):268-276. doi: 10.1016/j.ijpsycho.2017.11.003. Epub 2017 Nov 7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验