• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

目标缺失时的人类注意力。

Target-absent Human Attention.

作者信息

Yang Zhibo, Mondal Sounak, Ahn Seoyoung, Zelinsky Gregory, Hoai Minh, Samaras Dimitris

机构信息

Stony Brook University, Stony Brook, NY 11794, USA.

出版信息

Comput Vis ECCV. 2022 Oct;13664:52-68. doi: 10.1007/978-3-031-19772-7_4. Epub 2022 Oct 23.

DOI:10.1007/978-3-031-19772-7_4
PMID:38144433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10745181/
Abstract

The prediction of human gaze behavior is important for building human-computer interaction systems that can anticipate the user's attention. Computer vision models have been developed to predict the fixations made by people as they search for target objects. But what about when the target is not in the image? Equally important is to know how people search when they cannot find a target, and when they would stop searching. In this paper, we propose a data-driven computational model that addresses the search-termination problem and predicts the scanpath of search fixations made by people searching for targets that do not appear in images. We model visual search as an imitation learning problem and represent the internal knowledge that the viewer acquires through fixations using a novel state representation that we call . FFMs integrate a simulated foveated retina into a pretrained ConvNet that produces an in-network feature pyramid, all with minimal computational overhead. Our method integrates FFMs as the state representation in inverse reinforcement learning. Experimentally, we improve the state of the art in predicting human target-absent search behavior on the COCO-Search18 dataset. Code is available at: https://github.com/cvlab-stonybrook/Target-absent-Human-Attention.

摘要

预测人类的注视行为对于构建能够预判用户注意力的人机交互系统至关重要。计算机视觉模型已被开发用于预测人们在搜索目标物体时的注视点。但是当目标不在图像中时会怎样呢?同样重要的是要了解人们在找不到目标时如何搜索,以及他们何时会停止搜索。在本文中,我们提出了一种数据驱动的计算模型,该模型解决了搜索终止问题,并预测了人们在搜索图像中未出现的目标时的注视扫描路径。我们将视觉搜索建模为一个模仿学习问题,并使用一种新颖的状态表示来表示观察者通过注视获得的内部知识,我们将其称为FFM。FFM将模拟的中央凹视网膜集成到一个预训练的卷积神经网络中,该网络会生成一个网络内特征金字塔,所有这些操作的计算开销都最小。我们的方法将FFM作为逆强化学习中的状态表示进行集成。通过实验,我们在COCO-Search18数据集上预测人类无目标搜索行为方面改进了现有技术水平。代码可在以下网址获取:https://github.com/cvlab-stonybrook/Target-absent-Human-Attention 。

相似文献

1
Target-absent Human Attention.目标缺失时的人类注意力。
Comput Vis ECCV. 2022 Oct;13664:52-68. doi: 10.1007/978-3-031-19772-7_4. Epub 2022 Oct 23.
2
Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning.使用逆强化学习预测目标导向的人类注意力
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2020 Jun;2020:190-199. doi: 10.1109/cvpr42600.2020.00027. Epub 2020 Aug 5.
3
COCO-Search18 fixation dataset for predicting goal-directed attention control.COCO-Search18 数据集用于预测目标导向注意力控制。
Sci Rep. 2021 Apr 22;11(1):8776. doi: 10.1038/s41598-021-87715-9.
4
Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning.使用逆强化学习预测目标导向的注意力控制
Neuron Behav Data Anal Theory. 2021;2021. doi: 10.51628/001c.22322. Epub 2021 Apr 20.
5
Visual Scanpath Prediction Using IOR-ROI Recurrent Mixture Density Network.利用 IOR-ROI 递归混合密度网络进行视觉扫描路径预测。
IEEE Trans Pattern Anal Mach Intell. 2021 Jun;43(6):2101-2118. doi: 10.1109/TPAMI.2019.2956930. Epub 2021 May 11.
6
Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning.基于生成对抗模仿学习的全向图像显著度预测。
IEEE Trans Image Process. 2021;30:2087-2102. doi: 10.1109/TIP.2021.3050861. Epub 2021 Jan 21.
7
There's Waldo! A Normalization Model of Visual Search Predicts Single-Trial Human Fixations in an Object Search Task.找到了!视觉搜索的归一化模型可预测物体搜索任务中的单次试验人类注视点。
Cereb Cortex. 2016 Jul;26(7):3064-82. doi: 10.1093/cercor/bhv129. Epub 2015 Jun 19.
8
Predicting Human Saccadic Scanpaths Based on Iterative Representation Learning.基于迭代表示学习的人类扫视轨迹预测。
IEEE Trans Image Process. 2019 Jul;28(7):3502-3515. doi: 10.1109/TIP.2019.2897966. Epub 2019 Feb 7.
9
A Model of the Superior Colliculus Predicts Fixation Locations during Scene Viewing and Visual Search.上丘模型可预测场景观看和视觉搜索过程中的注视位置。
J Neurosci. 2017 Feb 8;37(6):1453-1467. doi: 10.1523/JNEUROSCI.0825-16.2016. Epub 2016 Dec 30.
10
Scanpath Prediction on Information Visualisations.信息可视化中的扫描路径预测
IEEE Trans Vis Comput Graph. 2024 Jul;30(7):3902-3914. doi: 10.1109/TVCG.2023.3242293. Epub 2024 Jun 27.