Zelinsky Gregory J, Chen Yupei, Ahn Seoyoung, Adeli Hossein, Yang Zhibo, Huang Lihan, Samaras Dimitrios, Hoai Minh
Department of Psychology, Stony Brook University, Stony Brook, NY, 11794, USA.
Department of Computer Science, Stony Brook University, Stony Brook, NY, 11794, USA.
Neuron Behav Data Anal Theory. 2021;2021. doi: 10.51628/001c.22322. Epub 2021 Apr 20.
Understanding how goals control behavior is a question ripe for interrogation by new methods from machine learning. These methods require large and labeled datasets to train models. To annotate a large-scale image dataset with observed search fixations, we collected 16,184 fixations from people searching for either microwaves or clocks in a dataset of 4,366 images (MS-COCO). We then used this behaviorally-annotated dataset and the machine learning method of inverse-reinforcement learning (IRL) to learn target-specific reward functions and policies for these two target goals. Finally, we used these learned policies to predict the fixations of 60 new behavioral searchers (clock = 30, microwave = 30) in a disjoint test dataset of kitchen scenes depicting both a microwave and a clock (thus controlling for differences in low-level image contrast). We found that the IRL model predicted behavioral search efficiency and fixation-density maps using multiple metrics. Moreover, reward maps from the IRL model revealed target-specific patterns that suggest, not just attention guidance by target features, but also guidance by scene context (e.g., fixations along walls in the search of clocks). Using machine learning and the psychologically meaningful principle of reward, it is possible to learn the visual features used in goal-directed attention control.
理解目标如何控制行为是一个亟待用机器学习新方法进行探究的问题。这些方法需要大规模的带标签数据集来训练模型。为了用观察到的搜索注视点标注一个大规模图像数据集,我们在一个包含4366张图像的数据集(MS-COCO)中,收集了人们搜索微波炉或时钟时的16184个注视点。然后,我们使用这个经过行为标注的数据集以及逆强化学习(IRL)这种机器学习方法,来学习这两个目标的特定目标奖励函数和策略。最后,我们使用这些学到的策略,在一个描绘了微波炉和时钟的厨房场景的不相交测试数据集中,预测60名新的行为搜索者(时钟 = 30名,微波炉 = 30名)的注视点(从而控制低层次图像对比度的差异)。我们发现,IRL模型使用多种指标预测了行为搜索效率和注视密度图。此外,IRL模型的奖励图揭示了特定目标的模式,这表明,不仅有目标特征对注意力的引导,还有场景上下文的引导(例如,在搜索时钟时沿着墙壁的注视点)。利用机器学习和具有心理意义的奖励原则,有可能学习到目标导向注意力控制中使用的视觉特征。