额部θ节律将预测误差与强化学习中的行为适应联系起来。

Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.

机构信息

Department of Psychology, University of Arizona, Tucson, AZ, USA.

出版信息

Neuroimage. 2010 Feb 15;49(4):3198-209. doi: 10.1016/j.neuroimage.2009.11.080. Epub 2009 Dec 5.

DOI:10.1016/j.neuroimage.2009.11.080

PMID:19969093

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2818688/

Abstract

Investigations into action monitoring have consistently detailed a frontocentral voltage deflection in the event-related potential (ERP) following the presentation of negatively valenced feedback, sometimes termed the feedback-related negativity (FRN). The FRN has been proposed to reflect a neural response to prediction errors during reinforcement learning, yet the single-trial relationship between neural activity and the quanta of expectation violation remains untested. Although ERP methods are not well suited to single-trial analyses, the FRN has been associated with theta band oscillatory perturbations in the medial prefrontal cortex. Mediofrontal theta oscillations have been previously associated with expectation violation and behavioral adaptation and are well suited to single-trial analysis. Here, we recorded EEG activity during a probabilistic reinforcement learning task and fit the performance data to an abstract computational model (Q-learning) for calculation of single-trial reward prediction errors. Single-trial theta oscillatory activities following feedback were investigated within the context of expectation (prediction error) and adaptation (subsequent reaction time change). Results indicate that interactive medial and lateral frontal theta activities reflect the degree of negative and positive reward prediction error in the service of behavioral adaptation. These different brain areas use prediction error calculations for different behavioral adaptations, with medial frontal theta reflecting the utilization of prediction errors for reaction time slowing (specifically following errors), but lateral frontal theta reflecting prediction errors leading to working memory-related reaction time speeding for the correct choice.

摘要

对动作监控的研究一直详细描述了事件相关电位（ERP）中呈现负效价反馈后的额前电压偏转，有时称为反馈相关负性（FRN）。FRN 被提出反映了强化学习过程中对预测误差的神经反应，但神经活动与预期违反的量子之间的单次试验关系仍未得到检验。尽管 ERP 方法不适合单次试验分析，但 FRN 与内侧前额叶皮层中的θ波段振荡干扰有关。中额前θ 振荡先前与预期违反和行为适应有关，非常适合单次试验分析。在这里，我们在概率强化学习任务期间记录 EEG 活动，并将性能数据拟合到抽象计算模型（Q-learning）中，以计算单次试验奖励预测误差。在预期（预测误差）和适应（随后的反应时间变化）的背景下研究了反馈后的单次试验θ振荡活动。结果表明，交互性的内侧和外侧额前θ 活动反映了负性和正性奖励预测误差的程度，以适应行为。这些不同的大脑区域使用预测误差进行不同的行为适应，内侧额前θ 反映了预测误差用于反应时间减慢（特别是在错误之后）的利用，但外侧额前θ 反映了预测误差导致与工作记忆相关的正确选择的反应时间加快。

相似文献

Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.

Neuroimage. 2010 Feb 15;49(4):3198-209. doi: 10.1016/j.neuroimage.2009.11.080. Epub 2009 Dec 5.

The aversion positivity: Mediofrontal cortical potentials reflect parametric aversive prediction errors and drive behavioral modification following negative reinforcement.

Cortex. 2021 Jul;140:26-39. doi: 10.1016/j.cortex.2021.03.012. Epub 2021 Mar 27.

Mood congruent tuning of reward expectation in positive mood: evidence from FRN and theta modulations.

Soc Cogn Affect Neurosci. 2017 May 1;12(5):765-774. doi: 10.1093/scan/nsx010.

Right frontal cortex generates reward-related theta-band oscillatory activity.

Neuroimage. 2009 Nov 1;48(2):415-22. doi: 10.1016/j.neuroimage.2009.06.076. Epub 2009 Jul 8.

Frontal theta oscillatory activity is a common mechanism for the computation of unexpected outcomes and learning rate.

J Cogn Neurosci. 2014 Mar;26(3):447-58. doi: 10.1162/jocn_a_00516. Epub 2013 Nov 4.

Theta lingua franca: a common mid-frontal substrate for action monitoring processes.

Psychophysiology. 2012 Feb;49(2):220-38. doi: 10.1111/j.1469-8986.2011.01293.x. Epub 2011 Sep 26.

Neuroimage. 2011 Apr 1;55(3):1373-83. doi: 10.1016/j.neuroimage.2010.12.072. Epub 2010 Dec 31.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Frontal theta reflects uncertainty and unexpectedness during exploration and exploitation.

Cereb Cortex. 2012 Nov;22(11):2575-86. doi: 10.1093/cercor/bhr332. Epub 2011 Nov 25.

Electrophysiological correlates of feedback processing in subarachnoid hemorrhage patients.

Neuroimage Clin. 2019;24:102075. doi: 10.1016/j.nicl.2019.102075. Epub 2019 Nov 5.

引用本文的文献

How working memory and reinforcement learning interact when avoiding punishment and pursuing reward concurrently.

J Exp Psychol Gen. 2025 Sep 1. doi: 10.1037/xge0001817.

Neural Correlates of Burnout Syndrome Based on Electroencephalography (EEG)-A Mechanistic Review and Discussion of Burnout Syndrome Cognitive Bias Theory.

J Clin Med. 2025 Jul 29;14(15):5357. doi: 10.3390/jcm14155357.

Oscillatory signatures of monitoring and anticipatory strategies for probabilistic vs deterministic cues.

Imaging Neurosci (Camb). 2025 Mar 7;3. doi: 10.1162/imag_a_00496. eCollection 2025.

Dissecting how psychopathic traits are linked to learning in different contexts: a multilevel computational and electrophysiological approach.

Cogn Affect Behav Neurosci. 2025 Jul 23. doi: 10.3758/s13415-025-01295-z.

The effect of targeting and interceptive timing tasks on the brain waves of elite and educated athletes.

PLoS One. 2025 Jul 23;20(7):e0321539. doi: 10.1371/journal.pone.0321539. eCollection 2025.

A Negative Reputation Reduces Trust Despite Trustworthy Behavior.

Psychophysiology. 2025 Jul;62(7):e70102. doi: 10.1111/psyp.70102.

Behavioral and electrocortical effects of transcranial alternating current stimulation during advice-guided decision-making.

Neuroimage Rep. 2021 Sep 10;1(4):100052. doi: 10.1016/j.ynirp.2021.100052. eCollection 2021 Dec.

Midfrontal mechanisms of performance monitoring continuously adapt to incoming information during outcome anticipation.

Neuroimage Rep. 2023 Sep 4;3(3):100182. doi: 10.1016/j.ynirp.2023.100182. eCollection 2023 Sep.

Response-locked theta dissociations reveal potential feedback signal following successful retrieval.

Imaging Neurosci (Camb). 2024 Jun 27;2:1-16. doi: 10.1162/imag_a_00207. eCollection 2024 Jun 1.

Sensation seeking and risk adjustment: the role of reward sensitivity in dynamic risky decisions.

Front Behav Neurosci. 2025 Feb 7;19:1492312. doi: 10.3389/fnbeh.2025.1492312. eCollection 2025.

本文引用的文献

J Cogn Neurosci. 1997 Nov;9(6):788-98. doi: 10.1162/jocn.1997.9.6.788.

Detecting alpha rhythm phase reset by phase sorting: caveats to consider.

Neuroimage. 2009 Aug 1;47(1):1-4. doi: 10.1016/j.neuroimage.2009.04.031. Epub 2009 Apr 16.

When is an error not a prediction error? An electrophysiological investigation.

Cogn Affect Behav Neurosci. 2009 Mar;9(1):59-70. doi: 10.3758/CABN.9.1.59.

Prelude to and resolution of an error: EEG phase synchrony reveals cognitive control dynamics during action monitoring.

J Neurosci. 2009 Jan 7;29(1):98-105. doi: 10.1523/JNEUROSCI.4137-08.2009.

Medial frontal cortex and response conflict: evidence from human intracranial EEG and medial frontal cortex lesion.

Brain Res. 2008 Oct 31;1238:127-42. doi: 10.1016/j.brainres.2008.07.114. Epub 2008 Aug 7.

Axiomatic methods, dopamine and reward prediction error.

Curr Opin Neurobiol. 2008 Apr;18(2):197-202. doi: 10.1016/j.conb.2008.07.007. Epub 2008 Aug 12.

Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making.

Cereb Cortex. 2009 Feb;19(2):483-95. doi: 10.1093/cercor/bhn098. Epub 2008 Jun 11.

The feedback correct-related positivity: sensitivity of the event-related brain potential to unexpected positive feedback.

Psychophysiology. 2008 Sep;45(5):688-97. doi: 10.1111/j.1469-8986.2008.00668.x. Epub 2008 May 30.

The electrophysiological dynamics of interference during the Stroop task.

J Cogn Neurosci. 2008 Feb;20(2):215-25. doi: 10.1162/jocn.2008.20020.

Cross-task individual differences in error processing: neural, electrophysiological, and genetic components.

Cogn Affect Behav Neurosci. 2007 Dec;7(4):297-308. doi: 10.3758/cabn.7.4.297.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

额部θ节律将预测误差与强化学习中的行为适应联系起来。

Frontal theta links prediction errors to behavioral adaptation in reinforcement learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献