权衡自由观看和视觉搜索过程中影响注意力引导的因素：物体识别不确定性的意外作用。

Weighting the factors affecting attention guidance during free viewing and visual search: The unexpected role of object recognition uncertainty.

作者信息

Chakraborty Souradeep, Samaras Dimitris, Zelinsky Gregory J

机构信息

Department of Computer Science, Stony Brook University, Stony Brook, NY, USA.

Department of Psychology, Stony Brook University, Stony Brook, NY, USA.

出版信息

J Vis. 2022 Mar 2;22(4):13. doi: 10.1167/jov.22.4.13.

DOI:10.1167/jov.22.4.13

PMID:35323870

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8963662/

Abstract

The factors determining how attention is allocated during visual tasks have been studied for decades, but few studies have attempted to model the weighting of several of these factors within and across tasks to better understand their relative contributions. Here we consider the roles of saliency, center bias, target features, and object recognition uncertainty in predicting the first nine changes in fixation made during free viewing and visual search tasks in the OSIE and COCO-Search18 datasets, respectively. We focus on the latter-most and least familiar of these factors by proposing a new method of quantifying uncertainty in an image, one based on object recognition. We hypothesize that the greater the number of object categories competing for an object proposal, the greater the uncertainty of how that object should be recognized and, hence, the greater the need for attention to resolve this uncertainty. As expected, we found that target features best predicted target-present search, with their dominance obscuring the use of other features. Unexpectedly, we found that target features were only weakly used during target-absent search. We also found that object recognition uncertainty outperformed an unsupervised saliency model in predicting free-viewing fixations, although saliency was slightly more predictive of search. We conclude that uncertainty in object recognition, a measure that is image computable and highly interpretable, is better than bottom-up saliency in predicting attention during free viewing.

摘要

几十年来，人们一直在研究视觉任务中注意力分配方式的决定因素，但很少有研究尝试对这些因素在任务内和任务间的权重进行建模，以更好地理解它们的相对贡献。在这里，我们分别考虑显著性、中心偏差、目标特征和物体识别不确定性在预测OSIE和COCO-Search18数据集中自由观看和视觉搜索任务期间首次出现的九次注视变化中的作用。我们通过提出一种基于物体识别的量化图像不确定性的新方法，关注这些因素中最不熟悉的后者。我们假设，竞争物体提议的物体类别数量越多，该物体应如何被识别的不确定性就越大，因此，解决这种不确定性就越需要注意力。不出所料，我们发现目标特征最能预测目标存在的搜索，其主导地位掩盖了其他特征的使用。出乎意料的是，我们发现在目标不存在的搜索中，目标特征的使用程度很低。我们还发现，在预测自由观看注视时，物体识别不确定性优于无监督显著性模型，尽管显著性在搜索预测中略胜一筹。我们得出结论，物体识别中的不确定性，一种可通过图像计算且高度可解释的度量，在预测自由观看期间的注意力方面比自下而上的显著性更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7beb/8963662/b744fa5c11b1/jovi-22-4-13-f001.jpg

相似文献

Weighting the factors affecting attention guidance during free viewing and visual search: The unexpected role of object recognition uncertainty.

J Vis. 2022 Mar 2;22(4):13. doi: 10.1167/jov.22.4.13.

What stands out in a scene? A study of human explicit saliency judgment.

Vision Res. 2013 Oct 18;91:62-77. doi: 10.1016/j.visres.2013.07.016. Epub 2013 Aug 15.

What do saliency models predict?

J Vis. 2014 Mar 11;14(3):14. doi: 10.1167/14.3.14.

Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2020 Jun;2020:190-199. doi: 10.1109/cvpr42600.2020.00027. Epub 2020 Aug 5.

COCO-Search18 fixation dataset for predicting goal-directed attention control.

Sci Rep. 2021 Apr 22;11(1):8776. doi: 10.1038/s41598-021-87715-9.

A Model of the Superior Colliculus Predicts Fixation Locations during Scene Viewing and Visual Search.

J Neurosci. 2017 Feb 8;37(6):1453-1467. doi: 10.1523/JNEUROSCI.0825-16.2016. Epub 2016 Dec 30.

Center bias outperforms image salience but not semantics in accounting for attention during scene viewing.

Atten Percept Psychophys. 2020 Jun;82(3):985-994. doi: 10.3758/s13414-019-01849-7.

SUN: A Bayesian framework for saliency using natural statistics.

J Vis. 2008 Dec 16;8(7):32.1-20. doi: 10.1167/8.7.32.

Vanishing point attracts gaze in free-viewing and visual search tasks.

J Vis. 2016 Nov 1;16(14):18. doi: 10.1167/16.14.18.

Is there a shape to the attention spotlight? Computing saliency over proto-objects predicts fixations during scene viewing.

J Exp Psychol Hum Percept Perform. 2019 Jan;45(1):139-154. doi: 10.1037/xhp0000593.

引用本文的文献

Neuroscientific Analysis of Logo Design: Implications for Luxury Brand Marketing.

Behav Sci (Basel). 2025 Apr 9;15(4):502. doi: 10.3390/bs15040502.

本文引用的文献

Five Factors that Guide Attention in Visual Search.

Nat Hum Behav. 2017 Mar;1(3). doi: 10.1038/s41562-017-0058. Epub 2017 Mar 8.

Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning.

Neuron Behav Data Anal Theory. 2021;2021. doi: 10.51628/001c.22322. Epub 2021 Apr 20.

Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2020 Jun;2020:190-199. doi: 10.1109/cvpr42600.2020.00027. Epub 2020 Aug 5.

COCO-Search18 fixation dataset for predicting goal-directed attention control.

Sci Rep. 2021 Apr 22;11(1):8776. doi: 10.1038/s41598-021-87715-9.

Center bias outperforms image salience but not semantics in accounting for attention during scene viewing.

Atten Percept Psychophys. 2020 Jun;82(3):985-994. doi: 10.3758/s13414-019-01849-7.

Meaning-based guidance of attention in scenes as revealed by meaning maps.

Nat Hum Behav. 2017 Oct;1(10):743-747. doi: 10.1038/s41562-017-0208-0. Epub 2017 Sep 25.

Disentangling bottom-up versus top-down and low-level versus high-level influences on eye movements over time.

J Vis. 2019 Mar 1;19(3):1. doi: 10.1167/19.3.1.

Is there a shape to the attention spotlight? Computing saliency over proto-objects predicts fixations during scene viewing.

J Exp Psychol Hum Percept Perform. 2019 Jan;45(1):139-154. doi: 10.1037/xhp0000593.

Finding any Waldo with zero-shot invariant and efficient visual search.

Nat Commun. 2018 Sep 13;9(1):3730. doi: 10.1038/s41467-018-06217-x.

Mask R-CNN.

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

权衡自由观看和视觉搜索过程中影响注意力引导的因素：物体识别不确定性的意外作用。

Weighting the factors affecting attention guidance during free viewing and visual search: The unexpected role of object recognition uncertainty.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献