Texas A&M University, United States.
Texas A&M University, United States.
Cognition. 2018 Sep;178:26-36. doi: 10.1016/j.cognition.2018.05.005. Epub 2018 May 11.
The role of associative reward learning in the guidance of feature-based attention is well established. The extent to which reward learning can modulate spatial attention has been much more controversial. At least one demonstration of a persistent spatial attention bias following space-based associative reward learning has been reported. At the same time, multiple other experiments have been published failing to demonstrate enduring attentional biases towards locations at which a target, if found, yields high reward. This is in spite of evidence that participants use reward structures to inform their decisions where to search, leading some to suggest that, unlike feature-based attention, spatial attention may be impervious to the influence of learning from reward structures. Here, we demonstrate a robust bias towards regions of a scene that participants were previously rewarded for selecting. This spatial bias relies on representations that are anchored to the configuration of objects within a scene. The observed bias appears to be driven specifically by reinforcement learning, and can be observed with equal strength following non-reward corrective feedback. The time course of the bias is consistent with a transient shift of attention, rather than a strategic search pattern, and is evident in eye movement patterns during free viewing. Taken together, our findings reconcile previously conflicting reports and offer an integrative account of how learning from feedback shapes the spatial attention system.
联想奖励学习在指导基于特征的注意力方面的作用已经得到充分证实。奖励学习在多大程度上可以调节空间注意力一直存在很大争议。至少有一项基于空间联想奖励学习的持续空间注意力偏向的演示报告。与此同时,也有多项其他实验未能证明对目标所在位置的持久注意力偏向,如果找到目标,目标会产生高奖励。尽管有证据表明参与者使用奖励结构来告知他们在何处搜索的决策,但这导致一些人认为,与基于特征的注意力不同,空间注意力可能不受从奖励结构中学习的影响。在这里,我们证明了参与者之前因选择而获得奖励的场景区域存在强烈的偏向。这种空间偏向依赖于锚定在场景中物体配置的表示。观察到的偏向似乎是由强化学习驱动的,并且在非奖励纠正反馈后也可以以相同的强度观察到。偏向的时间进程与注意力的短暂转移一致,而不是策略性搜索模式,并且在自由观看期间的眼动模式中明显可见。总之,我们的发现调和了先前相互矛盾的报告,并提供了一个综合的解释,说明反馈如何塑造空间注意系统。