Department of Psychology, Stony Brook University, New York, USA.
Department of Computer Science, Stony Brook University, New York, USA.
Sci Rep. 2021 Apr 22;11(1):8776. doi: 10.1038/s41598-021-87715-9.
Attention control is a basic behavioral process that has been studied for decades. The currently best models of attention control are deep networks trained on free-viewing behavior to predict bottom-up attention control - saliency. We introduce COCO-Search18, the first dataset of laboratory-quality goal-directed behavior large enough to train deep-network models. We collected eye-movement behavior from 10 people searching for each of 18 target-object categories in 6202 natural-scene images, yielding [Formula: see text] 300,000 search fixations. We thoroughly characterize COCO-Search18, and benchmark it using three machine-learning methods: a ResNet50 object detector, a ResNet50 trained on fixation-density maps, and an inverse-reinforcement-learning model trained on behavioral search scanpaths. Models were also trained/tested on images transformed to approximate a foveated retina, a fundamental biological constraint. These models, each having a different reliance on behavioral training, collectively comprise the new state-of-the-art in predicting goal-directed search fixations. Our expectation is that future work using COCO-Search18 will far surpass these initial efforts, finding applications in domains ranging from human-computer interactive systems that can anticipate a person's intent and render assistance to the potentially early identification of attention-related clinical disorders (ADHD, PTSD, phobia) based on deviation from neurotypical fixation behavior.
注意控制是一个基本的行为过程,已经研究了几十年。目前,用于预测自下而上注意控制——显著性的最佳注意控制模型是基于自由观看行为训练的深度网络。我们引入了 COCO-Search18,这是第一个用于训练深度网络模型的实验室质量目标导向行为的大型数据集。我们从 10 个人的眼动行为中收集了信息,他们在 6202 张自然场景图像中搜索了 18 个目标对象类别中的每一个,产生了[公式:见文本]30 万个搜索固定点。我们对 COCO-Search18 进行了全面的描述,并使用三种机器学习方法对其进行了基准测试:ResNet50 目标检测器、基于注视密度图训练的 ResNet50 和基于行为搜索扫描路径训练的逆强化学习模型。这些模型还在转换后的图像上进行了训练/测试,以近似注视点视网膜,这是一种基本的生物约束。这些模型,每个模型对行为训练的依赖程度不同,共同构成了预测目标导向搜索固定点的最新技术。我们期望未来使用 COCO-Search18 的工作将远远超过这些初步努力,在从能够预测人的意图并提供帮助的人机交互系统到基于偏离神经典型注视行为潜在早期识别与注意力相关的临床障碍(ADHD、PTSD、恐惧症)的应用等领域找到应用。