在强化学习任务中对奖励预测误差的主成分分析。

Principal components analysis of reward prediction errors in a reinforcement learning task.

机构信息

Cognition Institute, Department of Psychology, Plymouth University, Plymouth PL4 8AA, UK.

出版信息

Neuroimage. 2016 Jan 1;124(Pt A):276-286. doi: 10.1016/j.neuroimage.2015.07.032. Epub 2015 Jul 18.

DOI:10.1016/j.neuroimage.2015.07.032

PMID:26196667

Abstract

Models of reinforcement learning represent reward and punishment in terms of reward prediction errors (RPEs), quantitative signed terms describing the degree to which outcomes are better than expected (positive RPEs) or worse (negative RPEs). An electrophysiological component known as feedback related negativity (FRN) occurs at frontocentral sites 240-340ms after feedback on whether a reward or punishment is obtained, and has been claimed to neurally encode an RPE. An outstanding question however, is whether the FRN is sensitive to the size of both positive RPEs and negative RPEs. Previous attempts to answer this question have examined the simple effects of RPE size for positive RPEs and negative RPEs separately. However, this methodology can be compromised by overlap from components coding for unsigned prediction error size, or "salience", which are sensitive to the absolute size of a prediction error but not its valence. In our study, positive and negative RPEs were parametrically modulated using both reward likelihood and magnitude, with principal components analysis used to separate out overlying components. This revealed a single RPE encoding component responsive to the size of positive RPEs, peaking at ~330ms, and occupying the delta frequency band. Other components responsive to unsigned prediction error size were shown, but no component sensitive to negative RPE size was found.

摘要

强化学习模型用奖励预测误差（RPE）来表示奖励和惩罚，这是一个定量的有符号术语，用于描述结果比预期好（正 RPE）或差（负 RPE）的程度。一种被称为反馈相关负波（FRN）的电生理成分在前额中央部位出现，时间在反馈是否获得奖励或惩罚之后 240-340 毫秒，据称它可以对 RPE 进行神经编码。然而，一个悬而未决的问题是，FRN 是否对正 RPE 和负 RPE 的大小都敏感。以前试图回答这个问题的尝试分别检查了正 RPE 和负 RPE 的 RPE 大小的简单效应。然而，这种方法可能会受到编码无符号预测误差大小（或“显着性”）的组件的重叠影响，这些组件对预测误差的绝对大小敏感，但对其效价不敏感。在我们的研究中，使用奖励可能性和幅度对正 RPE 和负 RPE 进行参数调制，使用主成分分析来分离重叠的组件。这揭示了一个对正 RPE 大小敏感的单一 RPE 编码组件，峰值约为 330ms，并占据了 delta 频带。还显示了对无符号预测误差大小敏感的其他组件，但没有发现对负 RPE 大小敏感的组件。

相似文献

Principal components analysis of reward prediction errors in a reinforcement learning task.

Neuroimage. 2016 Jan 1;124(Pt A):276-286. doi: 10.1016/j.neuroimage.2015.07.032. Epub 2015 Jul 18.

Mediofrontal event-related potentials in response to positive, negative and unsigned prediction errors.

Neuropsychologia. 2014 Aug;61:1-10. doi: 10.1016/j.neuropsychologia.2014.06.004. Epub 2014 Jun 16.

Valence-separated representation of reward prediction error in feedback-related negativity and positivity.

Neuroreport. 2015 Feb 11;26(3):157-62. doi: 10.1097/WNR.0000000000000318.

When the outcome is different than expected: Subjective expectancy shapes reward prediction error at the FRN level.

Psychophysiology. 2019 Dec;56(12):e13456. doi: 10.1111/psyp.13456. Epub 2019 Aug 12.

J Cogn Neurosci. 2016 Aug;28(8):1127-38. doi: 10.1162/jocn_a_00957. Epub 2016 Mar 31.

Neuroimage. 2014 Jan 1;84:159-68. doi: 10.1016/j.neuroimage.2013.08.028. Epub 2013 Aug 23.

Oscillatory signatures of reward prediction errors in declarative learning.

Neuroimage. 2019 Feb 1;186:137-145. doi: 10.1016/j.neuroimage.2018.10.083. Epub 2018 Nov 2.

Manipulation of feedback expectancy and valence induces negative and positive reward prediction error signals manifest in event-related brain potentials.

Psychophysiology. 2011 May;48(5):656-64. doi: 10.1111/j.1469-8986.2010.01136.x. Epub 2010 Oct 5.

Aberrant reward prediction errors in young adult at-risk alcohol users.

Addict Biol. 2021 Jan;26(1):e12873. doi: 10.1111/adb.12873. Epub 2020 Jan 23.

The aversion positivity: Mediofrontal cortical potentials reflect parametric aversive prediction errors and drive behavioral modification following negative reinforcement.

Cortex. 2021 Jul;140:26-39. doi: 10.1016/j.cortex.2021.03.012. Epub 2021 Mar 27.

引用本文的文献

Perceptual load modulates the delta oscillation and the contribution of delta oscillation to reward positivity during feedback valence encoding.

Front Psychol. 2025 Aug 15;16:1658756. doi: 10.3389/fpsyg.2025.1658756. eCollection 2025.

Sensation seeking and risk adjustment: the role of reward sensitivity in dynamic risky decisions.

Front Behav Neurosci. 2025 Feb 7;19:1492312. doi: 10.3389/fnbeh.2025.1492312. eCollection 2025.

Emotion regulation strategies explain associations of theta and Beta with positive affect.

Psychophysiology. 2025 Jan;62(1):e14745. doi: 10.1111/psyp.14745.

Prediction-error-dependent processing of immediate and delayed positive feedback.

Sci Rep. 2024 Apr 27;14(1):9674. doi: 10.1038/s41598-024-60328-8.

Neural Correlates of Social Decision-Making.

Iran J Psychiatry. 2024 Jan;19(1):148-154. doi: 10.18502/ijps.v19i1.14350.

Lack of effect of methamphetamine on reward-related brain activity in healthy adults.

Psychopharmacology (Berl). 2024 Jan;241(1):181-193. doi: 10.1007/s00213-023-06475-8. Epub 2023 Dec 23.

Neural dissociation between reward and salience prediction errors through the lens of optimistic bias.

Hum Brain Mapp. 2023 Aug 15;44(12):4545-4560. doi: 10.1002/hbm.26398. Epub 2023 Jun 19.

Exploring Neural Mechanisms of Reward Processing Using Coupled Matrix Tensor Factorization: A Simultaneous EEG-fMRI Investigation.

Brain Sci. 2023 Mar 13;13(3):485. doi: 10.3390/brainsci13030485.

Effects of subjective and objective task difficulties for feedback- related brain potentials in social situations: An electroencephalogram study.

PLoS One. 2022 Dec 1;17(12):e0277663. doi: 10.1371/journal.pone.0277663. eCollection 2022.

TSMG: A Deep Learning Framework for Recognizing Human Learning Style Using EEG Signals.

Brain Sci. 2021 Oct 24;11(11):1397. doi: 10.3390/brainsci11111397.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在强化学习任务中对奖励预测误差的主成分分析。

Principal components analysis of reward prediction errors in a reinforcement learning task.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献