模拟任务对注意力的影响。

Modeling the influence of task on attention.

作者信息

Navalpakkam Vidhya, Itti Laurent

机构信息

Department of Computer Science, Psychology and Neuroscience Graduate Program, University of Southern California, Hedco Neuroscience Building, Room 30A, Mail Code 2520, 3641 Watt Way, Los Angeles, CA 90089-2520, USA.

出版信息

Vision Res. 2005 Jan;45(2):205-31. doi: 10.1016/j.visres.2004.07.042.

DOI:10.1016/j.visres.2004.07.042

PMID:15581921

Abstract

We propose a computational model for the task-specific guidance of visual attention in real-world scenes. Our model emphasizes four aspects that are important in biological vision: determining task-relevance of an entity, biasing attention for the low-level visual features of desired targets, recognizing these targets using the same low-level features, and incrementally building a visual map of task-relevance at every scene location. Given a task definition in the form of keywords, the model first determines and stores the task-relevant entities in working memory, using prior knowledge stored in long-term memory. It attempts to detect the most relevant entity by biasing its visual attention system with the entity's learned low-level features. It attends to the most salient location in the scene, and attempts to recognize the attended object through hierarchical matching against object representations stored in long-term memory. It updates its working memory with the task-relevance of the recognized entity and updates a topographic task-relevance map with the location and relevance of the recognized entity. The model is tested on three types of tasks: single-target detection in 343 natural and synthetic images, where biasing for the target accelerates target detection over twofold on average; sequential multiple-target detection in 28 natural images, where biasing, recognition, working memory and long term memory contribute to rapidly finding all targets; and learning a map of likely locations of cars from a video clip filmed while driving on a highway. The model's performance on search for single features and feature conjunctions is consistent with existing psychophysical data. These results of our biologically-motivated architecture suggest that the model may provide a reasonable approximation to many brain processes involved in complex task-driven visual behaviors.

摘要

我们提出了一种用于在现实场景中进行特定任务视觉注意力引导的计算模型。我们的模型强调了生物视觉中重要的四个方面：确定实体与任务的相关性、对期望目标的低级视觉特征进行注意力偏向、使用相同的低级特征识别这些目标，以及在每个场景位置逐步构建任务相关性的视觉地图。给定以关键词形式呈现的任务定义，该模型首先利用存储在长期记忆中的先验知识，在工作记忆中确定并存储与任务相关的实体。它通过用实体的已学习低级特征对其视觉注意力系统进行偏向，来尝试检测最相关实的体。它关注场景中最显著的位置，并尝试通过与存储在长期记忆中的对象表示进行分层匹配来识别被关注的对象。它用已识别实体的任务相关性更新其工作记忆，并用已识别实体的位置和相关性更新地形任务相关性地图。该模型在三种类型的任务上进行了测试：在343张自然和合成图像中进行单目标检测，其中对目标的偏向平均使目标检测速度加快两倍以上；在28张自然图像中进行顺序多目标检测，其中偏向、识别、工作记忆和长期记忆有助于快速找到所有目标；以及从在高速公路上行驶时拍摄的视频片段中学习汽车可能位置的地图。该模型在搜索单个特征和特征组合方面的性能与现有的心理物理学数据一致。我们这种受生物启发的架构所得到的这些结果表明，该模型可能为参与复杂任务驱动视觉行为的许多大脑过程提供合理的近似。

相似文献

Modeling the influence of task on attention.

Vision Res. 2005 Jan;45(2):205-31. doi: 10.1016/j.visres.2004.07.042.

Performance of a Computational Model of the Mammalian Olfactory System

There's Waldo! A Normalization Model of Visual Search Predicts Single-Trial Human Fixations in an Object Search Task.

Cereb Cortex. 2016 Jul;26(7):3064-82. doi: 10.1093/cercor/bhv129. Epub 2015 Jun 19.

Involuntary top-down control by search-irrelevant features: Visual working memory biases attention in an object-based manner.

Cognition. 2018 Mar;172:37-45. doi: 10.1016/j.cognition.2017.12.002. Epub 2017 Dec 8.

Top-down attention based on object representation and incremental memory for knowledge building and inference.

Neural Netw. 2013 Oct;46:9-22. doi: 10.1016/j.neunet.2013.04.002. Epub 2013 Apr 8.

Explicit goal-driven attention, unlike implicitly learned attention, spreads to secondary tasks.

J Exp Psychol Hum Percept Perform. 2018 Mar;44(3):356-366. doi: 10.1037/xhp0000457. Epub 2017 Aug 10.

Long-term memory for distractors: Effects of involuntary attention from working memory.

Mem Cognit. 2024 Feb;52(2):401-416. doi: 10.3758/s13421-023-01469-5. Epub 2023 Sep 28.

Binding actions and scenes in visual long-term memory.

Psychon Bull Rev. 2013 Dec;20(6):1246-52. doi: 10.3758/s13423-013-0440-1.

A Computational Model of Visual Recognition Memory via Grid Cells.

Curr Biol. 2019 Mar 18;29(6):979-990.e4. doi: 10.1016/j.cub.2019.01.077. Epub 2019 Mar 7.

Dynamic interactions between visual working memory and saccade target selection.

J Vis. 2014 Sep 16;14(11):9. doi: 10.1167/14.11.9.

引用本文的文献

Higher baseline alpha power is associated with faster responses in visual search.

bioRxiv. 2025 Aug 29:2025.08.29.673162. doi: 10.1101/2025.08.29.673162.

A Multimodal AI Framework for Automated Multiclass Lung Disease Diagnosis from Respiratory Sounds with Simulated Biomarker Fusion and Personalized Medication Recommendation.

Int J Mol Sci. 2025 Jul 24;26(15):7135. doi: 10.3390/ijms26157135.

Saliency Models Reveal Reduced Top-Down Attention in Attention-Deficit/Hyperactivity Disorder: A Naturalistic Eye-Tracking Study.

JAACAP Open. 2024 Apr 3;3(2):192-204. doi: 10.1016/j.jaacop.2024.03.001. eCollection 2025 Jun.

Guided visual search is associated with target boosting and distractor suppression in early visual cortex.

Commun Biol. 2025 Jun 11;8(1):912. doi: 10.1038/s42003-025-08321-3.

Brain-guided convolutional neural networks reveal task-specific representations in scene processing.

Sci Rep. 2025 Apr 15;15(1):13025. doi: 10.1038/s41598-025-96307-w.

Identification of a cognitive network with effective connectivity to post-stroke cognitive impairment.

Cogn Neurodyn. 2024 Dec;18(6):3741-3756. doi: 10.1007/s11571-024-10139-4. Epub 2024 Aug 12.

Saccades to partially occluded objects: Perceptual completion mediates oculomotor control.

J Vis. 2024 Mar 1;24(3):8. doi: 10.1167/jov.24.3.8.

Motor "laziness" constrains fixation selection in real-world tasks.

Proc Natl Acad Sci U S A. 2024 Mar 19;121(12):e2302239121. doi: 10.1073/pnas.2302239121. Epub 2024 Mar 12.

Objects guide human gaze behavior in dynamic real-world scenes.

PLoS Comput Biol. 2023 Oct 26;19(10):e1011512. doi: 10.1371/journal.pcbi.1011512. eCollection 2023 Oct.

Influence of eye movements on academic performance: A bibliometric and citation network analysis.

J Eye Mov Res. 2022 Sep 7;15(4). doi: 10.16910/jemr.15.4.4. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

模拟任务对注意力的影响。

Modeling the influence of task on attention.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献