Vullings Cécile, Madelain Laurent
Université de Lille, CNRS, CHU Lille, UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Lille, France.
Aix Marseille Université, CNRS, Institut de Neurosciences de la Timone, UMR 7289, Marseille, France.
J Vis. 2019 Mar 1;19(3):16. doi: 10.1167/19.3.16.
Recent studies have demonstrated that saccadic reaction times (SRTs) are influenced by the temporal regularities of dynamic environments (Vullings & Madelain, 2018). Here, we ask whether discriminative control (i.e., the possibility to use external stimuli signaling the future state of the environment) of latencies in a search task might be established using reinforcement contingencies. Eight participants made saccades within 80-750 ms toward a target displayed among distractors. We constructed two latency classes, "short" and "long," using the first and last quartiles of the individual baseline distributions. We then used a latency-contingent display paradigm in which finding the visual target among other items was made contingent upon specific SRTs. For a first group, the postsaccadic target was displayed only following short latencies with leftward saccades, and following long latencies with rightward saccades. The opposite was true for a second group. When short- and long-latency saccades were reinforced (i.e., the target was displayed) depending on the saccade direction, median latencies differed by 74 ms on average (all outside the 98% null hypothesis confidence intervals). Posttraining, in the absence of reinforcement, we still observed strong differences in latency distributions, averaging 64 ms for leftward versus rightward saccades. Our results demonstrate the discriminative control of SRTs, further supporting the effects of reinforcement learning for saccade. This study reveals that saccade triggering is finely controlled by learned temporal and spatial properties of the environment using predictive mechanisms.
最近的研究表明,扫视反应时间(SRTs)受动态环境的时间规律影响(武林斯和马德莱恩,2018年)。在此,我们探讨在搜索任务中,是否可以利用强化意外情况来建立对潜伏期的辨别控制(即利用外部刺激信号指示环境未来状态的可能性)。八名参与者在80 - 750毫秒内对干扰物中显示的目标进行扫视。我们使用个体基线分布的第一个和最后一个四分位数构建了两个潜伏期类别,即“短”和“长”。然后,我们采用了一种潜伏期依赖显示范式,其中在其他项目中找到视觉目标取决于特定的SRTs。对于第一组,仅在向左扫视的短潜伏期之后以及向右扫视的长潜伏期之后显示扫视后的目标。第二组情况相反。当根据扫视方向对短潜伏期和长潜伏期扫视进行强化(即显示目标)时,平均中位数潜伏期相差74毫秒(均在98%的零假设置信区间之外)。训练后,在没有强化的情况下,我们仍然观察到潜伏期分布存在显著差异,向左扫视和向右扫视的平均差异为64毫秒。我们的结果证明了对SRTs的辨别控制,进一步支持了强化学习对扫视的影响。这项研究表明,扫视触发是通过预测机制,由学习到的环境时间和空间属性精细控制的。