Department of Psychology, University of Arizona, Tucson, AZ, USA.
Neuroimage. 2010 Feb 15;49(4):3198-209. doi: 10.1016/j.neuroimage.2009.11.080. Epub 2009 Dec 5.
Investigations into action monitoring have consistently detailed a frontocentral voltage deflection in the event-related potential (ERP) following the presentation of negatively valenced feedback, sometimes termed the feedback-related negativity (FRN). The FRN has been proposed to reflect a neural response to prediction errors during reinforcement learning, yet the single-trial relationship between neural activity and the quanta of expectation violation remains untested. Although ERP methods are not well suited to single-trial analyses, the FRN has been associated with theta band oscillatory perturbations in the medial prefrontal cortex. Mediofrontal theta oscillations have been previously associated with expectation violation and behavioral adaptation and are well suited to single-trial analysis. Here, we recorded EEG activity during a probabilistic reinforcement learning task and fit the performance data to an abstract computational model (Q-learning) for calculation of single-trial reward prediction errors. Single-trial theta oscillatory activities following feedback were investigated within the context of expectation (prediction error) and adaptation (subsequent reaction time change). Results indicate that interactive medial and lateral frontal theta activities reflect the degree of negative and positive reward prediction error in the service of behavioral adaptation. These different brain areas use prediction error calculations for different behavioral adaptations, with medial frontal theta reflecting the utilization of prediction errors for reaction time slowing (specifically following errors), but lateral frontal theta reflecting prediction errors leading to working memory-related reaction time speeding for the correct choice.
对动作监控的研究一直详细描述了事件相关电位(ERP)中呈现负效价反馈后的额前电压偏转,有时称为反馈相关负性(FRN)。FRN 被提出反映了强化学习过程中对预测误差的神经反应,但神经活动与预期违反的量子之间的单次试验关系仍未得到检验。尽管 ERP 方法不适合单次试验分析,但 FRN 与内侧前额叶皮层中的θ波段振荡干扰有关。中额前θ 振荡先前与预期违反和行为适应有关,非常适合单次试验分析。在这里,我们在概率强化学习任务期间记录 EEG 活动,并将性能数据拟合到抽象计算模型(Q-learning)中,以计算单次试验奖励预测误差。在预期(预测误差)和适应(随后的反应时间变化)的背景下研究了反馈后的单次试验θ振荡活动。结果表明,交互性的内侧和外侧额前θ 活动反映了负性和正性奖励预测误差的程度,以适应行为。这些不同的大脑区域使用预测误差进行不同的行为适应,内侧额前θ 反映了预测误差用于反应时间减慢(特别是在错误之后)的利用,但外侧额前θ 反映了预测误差导致与工作记忆相关的正确选择的反应时间加快。