Suppr超能文献

权衡自由观看和视觉搜索过程中影响注意力引导的因素:物体识别不确定性的意外作用。

Weighting the factors affecting attention guidance during free viewing and visual search: The unexpected role of object recognition uncertainty.

作者信息

Chakraborty Souradeep, Samaras Dimitris, Zelinsky Gregory J

机构信息

Department of Computer Science, Stony Brook University, Stony Brook, NY, USA.

Department of Psychology, Stony Brook University, Stony Brook, NY, USA.

出版信息

J Vis. 2022 Mar 2;22(4):13. doi: 10.1167/jov.22.4.13.

Abstract

The factors determining how attention is allocated during visual tasks have been studied for decades, but few studies have attempted to model the weighting of several of these factors within and across tasks to better understand their relative contributions. Here we consider the roles of saliency, center bias, target features, and object recognition uncertainty in predicting the first nine changes in fixation made during free viewing and visual search tasks in the OSIE and COCO-Search18 datasets, respectively. We focus on the latter-most and least familiar of these factors by proposing a new method of quantifying uncertainty in an image, one based on object recognition. We hypothesize that the greater the number of object categories competing for an object proposal, the greater the uncertainty of how that object should be recognized and, hence, the greater the need for attention to resolve this uncertainty. As expected, we found that target features best predicted target-present search, with their dominance obscuring the use of other features. Unexpectedly, we found that target features were only weakly used during target-absent search. We also found that object recognition uncertainty outperformed an unsupervised saliency model in predicting free-viewing fixations, although saliency was slightly more predictive of search. We conclude that uncertainty in object recognition, a measure that is image computable and highly interpretable, is better than bottom-up saliency in predicting attention during free viewing.

摘要

几十年来,人们一直在研究视觉任务中注意力分配方式的决定因素,但很少有研究尝试对这些因素在任务内和任务间的权重进行建模,以更好地理解它们的相对贡献。在这里,我们分别考虑显著性、中心偏差、目标特征和物体识别不确定性在预测OSIE和COCO-Search18数据集中自由观看和视觉搜索任务期间首次出现的九次注视变化中的作用。我们通过提出一种基于物体识别的量化图像不确定性的新方法,关注这些因素中最不熟悉的后者。我们假设,竞争物体提议的物体类别数量越多,该物体应如何被识别的不确定性就越大,因此,解决这种不确定性就越需要注意力。不出所料,我们发现目标特征最能预测目标存在的搜索,其主导地位掩盖了其他特征的使用。出乎意料的是,我们发现在目标不存在的搜索中,目标特征的使用程度很低。我们还发现,在预测自由观看注视时,物体识别不确定性优于无监督显著性模型,尽管显著性在搜索预测中略胜一筹。我们得出结论,物体识别中的不确定性,一种可通过图像计算且高度可解释的度量,在预测自由观看期间的注意力方面比自下而上的显著性更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7beb/8963662/b744fa5c11b1/jovi-22-4-13-f001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验