Catholic University of Eichstätt-Ingolstadt, Ostenstraße 27, 85072, Eichstätt, Germany.
Cogn Affect Behav Neurosci. 2020 Oct;20(5):1070-1089. doi: 10.3758/s13415-020-00820-6.
Decision making relies on the interplay between two distinct learning mechanisms, namely habitual model-free learning and goal-directed model-based learning. Recent literature suggests that this interplay is significantly shaped by the environmental structure as represented by an internal model. We employed a modified two-stage but one-decision Markov decision task to investigate how two internal models differing in the predictability of stage transitions influence the neural correlates of feedback processing. Our results demonstrate that fronto-central theta and the feedback-related negativity (FRN), two correlates of reward prediction errors in the medial frontal cortex, are independent of the internal representations of the environmental structure. In contrast, centro-parietal delta and the P3, two correlates possibly reflecting feedback evaluation in working memory, were highly susceptible to the underlying internal model. Model-based analyses of single-trial activity showed a comparable pattern, indicating that while the computation of unsigned reward prediction errors is represented by theta and the FRN irrespective of the internal models, the P3 adapts to the internal representation of an environment. Our findings further substantiate the assumption that the feedback-locked components under investigation reflect distinct mechanisms of feedback processing and that different internal models selectively influence these mechanisms.
决策依赖于两种截然不同的学习机制之间的相互作用,即习惯的无模型学习和目标导向的基于模型的学习。最近的文献表明,这种相互作用受到内部模型所代表的环境结构的显著影响。我们采用了一种改良的两阶段但一决策马尔可夫决策任务来研究两个内部模型在阶段转换的可预测性方面的差异如何影响反馈处理的神经相关性。我们的结果表明,额中央θ和反馈相关负波(FRN)是内侧前额叶皮层中奖励预测误差的两个相关物,与环境结构的内部表示无关。相比之下,中央顶叶δ和 P3,两个可能反映工作记忆中反馈评估的相关物,对潜在的内部模型高度敏感。单试次活动的基于模型的分析显示出类似的模式,表明虽然θ和 FRN 代表了无符号奖励预测误差的计算,而与内部模型无关,但 P3 适应环境的内部表示。我们的发现进一步证实了这样的假设,即所研究的反馈锁定成分反映了反馈处理的不同机制,并且不同的内部模型选择性地影响这些机制。