• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于模拟自下而上视觉注意力的连贯计算方法。

A coherent computational approach to model bottom-up visual attention.

作者信息

Le Meur Olivier, Le Callet Patrick, Barba Dominique, Thoreau Dominique

机构信息

Video Compression Laboratory, Thomson, Cesson-Sévigné, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):802-17. doi: 10.1109/TPAMI.2006.86.

DOI:10.1109/TPAMI.2006.86
PMID:16640265
Abstract

Visual attention is a mechanism which filters out redundant visual information and detects the most relevant parts of our visual field. Automatic determination of the most visually relevant areas would be useful in many applications such as image and video coding, watermarking, video browsing, and quality assessment. Many research groups are currently investigating computational modeling of the visual attention system. The first published computational models have been based on some basic and well-understood Human Visual System (HVS) properties. These models feature a single perceptual layer that simulates only one aspect of the visual system. More recent models integrate complex features of the HVS and simulate hierarchical perceptual representation of the visual input. The bottom-up mechanism is the most occurring feature found in modern models. This mechanism refers to involuntary attention (i.e., salient spatial visual features that effortlessly or involuntary attract our attention). This paper presents a coherent computational approach to the modeling of the bottom-up visual attention. This model is mainly based on the current understanding of the HVS behavior. Contrast sensitivity functions, perceptual decomposition, visual masking, and center-surround interactions are some of the features implemented in this model. The performances of this algorithm are assessed by using natural images and experimental measurements from an eye-tracking system. Two adequate well-known metrics (correlation coefficient and Kullbacl-Leibler divergence) are used to validate this model. A further metric is also defined. The results from this model are finally compared to those from a reference bottom-up model.

摘要

视觉注意力是一种机制,它能过滤掉冗余的视觉信息,并检测我们视野中最相关的部分。自动确定视觉上最相关的区域在许多应用中都很有用,如图像和视频编码、水印、视频浏览以及质量评估。目前许多研究小组正在研究视觉注意力系统的计算建模。最早发表的计算模型是基于一些基本且已被充分理解的人类视觉系统(HVS)特性。这些模型具有一个单一的感知层,仅模拟视觉系统的一个方面。最近的模型整合了HVS的复杂特征,并模拟视觉输入的分层感知表示。自下而上的机制是现代模型中最常见的特征。这种机制指的是不随意注意(即,显著的空间视觉特征会毫不费力地或不自主地吸引我们的注意力)。本文提出了一种连贯的计算方法来对自下而上的视觉注意力进行建模。该模型主要基于对HVS行为的当前理解。对比度敏感函数、感知分解、视觉掩蔽和中心 - 周边相互作用是该模型中实现的一些特征。通过使用自然图像和来自眼动追踪系统的实验测量来评估该算法的性能。使用两个合适的知名指标(相关系数和库尔贝克 - 莱布勒散度)来验证该模型。还定义了另一个指标。最后将该模型的结果与参考自下而上模型的结果进行比较。

相似文献

1
A coherent computational approach to model bottom-up visual attention.一种用于模拟自下而上视觉注意力的连贯计算方法。
IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):802-17. doi: 10.1109/TPAMI.2006.86.
2
Predicting visual fixations on video based on low-level visual features.基于低级视觉特征预测视频中的视觉注视点。
Vision Res. 2007 Sep;47(19):2483-98. doi: 10.1016/j.visres.2007.06.015. Epub 2007 Aug 3.
3
Components of bottom-up gaze allocation in natural images.自然图像中自下而上注视分配的组成部分。
Vision Res. 2005 Aug;45(18):2397-416. doi: 10.1016/j.visres.2005.03.019.
4
A model of modes of attention and inattention for artificial perception.一种用于人工感知的注意力与非注意力模式模型。
Bioinspir Biomim. 2007 Sep;2(3):S94-S115. doi: 10.1088/1748-3182/2/3/S03. Epub 2007 Jun 22.
5
An object-based visual attention model for robotic applications.一种用于机器人应用的基于对象的视觉注意力模型。
IEEE Trans Syst Man Cybern B Cybern. 2010 Oct;40(5):1398-412. doi: 10.1109/TSMCB.2009.2038895. Epub 2010 Feb 2.
6
Automatic computation of an image's statistical surprise predicts performance of human observers on a natural image detection task.图像统计意外性的自动计算可预测人类观察者在自然图像检测任务中的表现。
Vision Res. 2009 Jun;49(13):1620-37. doi: 10.1016/j.visres.2009.03.025. Epub 2009 Apr 5.
7
Neural blackboard architectures: the realization of compositionality and systematicity in neural networks.神经黑板架构:神经网络中组合性和系统性的实现
J Neural Eng. 2006 Mar;3(1):R1-12. doi: 10.1088/1741-2560/3/1/R01. Epub 2006 Feb 6.
8
Visual attention on the sphere.对球体的视觉注意力。
IEEE Trans Image Process. 2008 Nov;17(11):2000-14. doi: 10.1109/TIP.2008.2003415.
9
Computational versus psychophysical bottom-up image saliency: a comparative evaluation study.计算与心理物理底层图像显著性:一项比较评估研究。
IEEE Trans Pattern Anal Mach Intell. 2011 Nov;33(11):2131-46. doi: 10.1109/TPAMI.2011.53.
10
Tuning curve shift by attention modulation in cortical neurons: a computational study of its mechanisms.皮层神经元中注意力调制引起的调谐曲线偏移:其机制的计算研究
Cereb Cortex. 2006 Jun;16(6):761-78. doi: 10.1093/cercor/bhj021. Epub 2005 Aug 31.

引用本文的文献

1
Noise correlations and neuronal diversity may limit the utility of winner-take-all readout in a pop out visual search task.噪声相关性和神经元多样性可能会限制“赢家通吃”读出机制在弹出式视觉搜索任务中的效用。
PLoS Comput Biol. 2025 May 7;21(5):e1013092. doi: 10.1371/journal.pcbi.1013092. eCollection 2025 May.
2
Toward viewing behavior for aerial scene categorization.面向空中场景分类的行为观察。
Cogn Res Princ Implic. 2024 Mar 26;9(1):17. doi: 10.1186/s41235-024-00541-1.
3
Pixel-Domain Just Noticeable Difference Modeling with Heterogeneous Color Features.
基于异质颜色特征的像素域可察觉差异建模。
Sensors (Basel). 2023 Feb 5;23(4):1788. doi: 10.3390/s23041788.
4
Multiscale Cascaded Attention Network for Saliency Detection Based on ResNet.基于 ResNet 的多尺度级联注意力网络的显著目标检测
Sensors (Basel). 2022 Dec 16;22(24):9950. doi: 10.3390/s22249950.
5
BIAS-3D: Brain inspired attentional search model fashioned after what and where/how pathways for target search in 3D environment.BIAS-3D:一种受大脑启发的注意力搜索模型,它仿照用于在三维环境中进行目标搜索的“什么”和“哪里/如何”通路构建而成。
Front Comput Neurosci. 2022 Nov 18;16:1012559. doi: 10.3389/fncom.2022.1012559. eCollection 2022.
6
Where to look at the movies: Analyzing visual attention to understand movie editing.在哪里观看电影:通过分析视觉注意力来理解电影剪辑。
Behav Res Methods. 2023 Sep;55(6):2940-2959. doi: 10.3758/s13428-022-01949-7. Epub 2022 Aug 24.
7
Do Children With Developmental Language Disorder Activate Scene Knowledge to Guide Visual Attention? Effect of Object-Scene Inconsistencies on Gaze Allocation.患有发育性语言障碍的儿童会激活场景知识来引导视觉注意力吗?物体-场景不一致对注视分配的影响。
Front Psychol. 2022 Jan 7;12:796459. doi: 10.3389/fpsyg.2021.796459. eCollection 2021.
8
The attention schema theory in a neural network agent: Controlling visuospatial attention using a descriptive model of attention.神经网络代理中的注意模式理论:使用注意描述模型控制视空间注意。
Proc Natl Acad Sci U S A. 2021 Aug 17;118(33). doi: 10.1073/pnas.2102421118.
9
Salient Region Guided Blind Image Sharpness Assessment.显著区域引导盲图像锐度评估。
Sensors (Basel). 2021 Jun 8;21(12):3963. doi: 10.3390/s21123963.
10
A novel fully convolutional network for visual saliency prediction.一种用于视觉显著性预测的新型全卷积网络。
PeerJ Comput Sci. 2020 Jul 13;6:e280. doi: 10.7717/peerj-cs.280. eCollection 2020.