显著性（扩展显著性）：使用随机图像建模的有意义的注意力。

Esaliency (extended saliency): meaningful attention using stochastic image modeling.

机构信息

Computer Science Department, Technion-Israel Institute of Technology, Haifa 32000, Israel.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2010 Apr;32(4):693-708. doi: 10.1109/TPAMI.2009.53.

DOI:10.1109/TPAMI.2009.53

PMID:20224124

Abstract

Computer vision attention processes assign variable-hypothesized importance to different parts of the visual input and direct the allocation of computational resources. This nonuniform allocation might help accelerate the image analysis process. This paper proposes a new bottom-up attention mechanism. Rather than taking the traditional approach, which tries to model human attention, we propose a validated stochastic model to estimate the probability that an image part is of interest. We refer to this probability as saliency and thus specify saliency in a mathematically well-defined sense. The model quantifies several intuitive observations, such as the greater likelihood of correspondence between visually similar image regions and the likelihood that only a few of interesting objects will be present in the scene. The latter observation, which implies that such objects are (relaxed) global exceptions, replaces the traditional preference for local contrast. The algorithm starts with a rough preattentive segmentation and then uses a graphical model approximation to efficiently reveal which segments are more likely to be of interest. Experiments on natural scenes containing a variety of objects demonstrate the proposed method and show its advantages over previous approaches.

摘要

计算机视觉注意力过程会对视觉输入的不同部分赋予可变的假设重要性，并指导计算资源的分配。这种非均匀分配可能有助于加速图像分析过程。本文提出了一种新的自下而上的注意力机制。我们没有采用传统的方法来模拟人类注意力，而是提出了一种经过验证的随机模型来估计图像某个部分是否引人关注的概率。我们将这个概率称为显著度，并因此以数学上定义明确的方式指定显著度。该模型量化了几个直观的观察结果，例如视觉相似的图像区域之间更有可能对应，以及场景中只会出现少数感兴趣的对象的可能性。后一个观察结果意味着这些对象是（放宽的）全局异常，取代了传统上对局部对比度的偏好。该算法从粗略的前注意分割开始，然后使用图形模型近似来有效地揭示哪些片段更有可能引起关注。对包含各种对象的自然场景的实验证明了所提出的方法，并展示了其相对于先前方法的优势。

相似文献

Esaliency (extended saliency): meaningful attention using stochastic image modeling.

IEEE Trans Pattern Anal Mach Intell. 2010 Apr;32(4):693-708. doi: 10.1109/TPAMI.2009.53.

The improbability of harris interest points.

IEEE Trans Pattern Anal Mach Intell. 2010 Jun;32(6):1141-7. doi: 10.1109/TPAMI.2010.53.

Visual attention on the sphere.

IEEE Trans Image Process. 2008 Nov;17(11):2000-14. doi: 10.1109/TIP.2008.2003415.

A multiresolution stochastic level set method for Mumford-Shah image segmentation.

IEEE Trans Image Process. 2008 Dec;17(12):2289-300. doi: 10.1109/TIP.2008.2005823.

A probabilistic model of gaze imitation and shared attention.

Neural Netw. 2006 Apr;19(3):299-310. doi: 10.1016/j.neunet.2006.02.008.

Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search.

Psychol Rev. 2006 Oct;113(4):766-86. doi: 10.1037/0033-295X.113.4.766.

A coherent computational approach to model bottom-up visual attention.

IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):802-17. doi: 10.1109/TPAMI.2006.86.

An object-based visual attention model for robotic applications.

IEEE Trans Syst Man Cybern B Cybern. 2010 Oct;40(5):1398-412. doi: 10.1109/TSMCB.2009.2038895. Epub 2010 Feb 2.

Fast and robust generation of feature maps for region-based visual attention.

IEEE Trans Image Process. 2008 May;17(5):633-44. doi: 10.1109/TIP.2008.919365.

Attention-based dynamic visual search using inner-scene similarity: algorithms and bounds.

IEEE Trans Pattern Anal Mach Intell. 2006 Feb;28(2):251-64. doi: 10.1109/TPAMI.2006.28.

引用本文的文献

Supervisors' Visual Attention Allocation Modeling Using Hybrid Entropy.

Entropy (Basel). 2019 Apr 12;21(4):393. doi: 10.3390/e21040393.

Fuzzy Adaptive-Sampling Block Compressed Sensing for Wireless Multimedia Sensor Networks.

Sensors (Basel). 2020 Oct 31;20(21):6217. doi: 10.3390/s20216217.

Learning to Model Task-Oriented Attention.

Comput Intell Neurosci. 2016;2016:2381451. doi: 10.1155/2016/2381451. Epub 2016 May 9.

What do saliency models predict?

J Vis. 2014 Mar 11;14(3):14. doi: 10.1167/14.3.14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

显著性（扩展显著性）：使用随机图像建模的有意义的注意力。

Esaliency (extended saliency): meaningful attention using stochastic image modeling.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献