基于分层判别显著网络的目标识别。

Object recognition with hierarchical discriminant saliency networks.

机构信息

Analytics Department, ID Analytics San Diego, CA, USA.

Statistical and Visual Computing Lab, Electrical and Computer Engineering, University of California San Diego, La Jolla, CA, USA.

出版信息

Front Comput Neurosci. 2014 Sep 9;8:109. doi: 10.3389/fncom.2014.00109. eCollection 2014.

DOI:10.3389/fncom.2014.00109

PMID:25249971

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4158795/

Abstract

The benefits of integrating attention and object recognition are investigated. While attention is frequently modeled as a pre-processor for recognition, we investigate the hypothesis that attention is an intrinsic component of recognition and vice-versa. This hypothesis is tested with a recognition model, the hierarchical discriminant saliency network (HDSN), whose layers are top-down saliency detectors, tuned for a visual class according to the principles of discriminant saliency. As a model of neural computation, the HDSN has two possible implementations. In a biologically plausible implementation, all layers comply with the standard neurophysiological model of visual cortex, with sub-layers of simple and complex units that implement a combination of filtering, divisive normalization, pooling, and non-linearities. In a convolutional neural network implementation, all layers are convolutional and implement a combination of filtering, rectification, and pooling. The rectification is performed with a parametric extension of the now popular rectified linear units (ReLUs), whose parameters can be tuned for the detection of target object classes. This enables a number of functional enhancements over neural network models that lack a connection to saliency, including optimal feature denoising mechanisms for recognition, modulation of saliency responses by the discriminant power of the underlying features, and the ability to detect both feature presence and absence. In either implementation, each layer has a precise statistical interpretation, and all parameters are tuned by statistical learning. Each saliency detection layer learns more discriminant saliency templates than its predecessors and higher layers have larger pooling fields. This enables the HDSN to simultaneously achieve high selectivity to target object classes and invariance. The performance of the network in saliency and object recognition tasks is compared to those of models from the biological and computer vision literatures. This demonstrates benefits for all the functional enhancements of the HDSN, the class tuning inherent to discriminant saliency, and saliency layers based on templates of increasing target selectivity and invariance. Altogether, these experiments suggest that there are non-trivial benefits in integrating attention and recognition.

摘要

研究了注意力和目标识别集成的好处。虽然注意力通常被建模为识别的预处理，但我们研究了这样一种假设，即注意力是识别的内在组成部分，反之亦然。使用识别模型——分层判别显着性网络（HDSN）来检验这一假设，该模型的层是自上而下的显着性探测器，根据判别显着性的原则，根据视觉类别进行调整。作为一种神经计算模型，HDSN 有两种可能的实现方式。在一种具有生物学意义的实现方式中，所有层都符合视觉皮层的标准神经生理学模型，具有简单和复杂单元的子层，实现了滤波、除法归一化、池化和非线性的组合。在卷积神经网络实现中，所有层都是卷积的，并实现了滤波、整流和池化的组合。整流是通过对现在流行的整流线性单元（ReLU）的参数扩展来实现的，其参数可以针对目标对象类的检测进行调整。这使得具有与显着性无关的连接的神经网络模型具有许多功能增强，包括用于识别的最佳特征去噪机制、基础特征的判别能力对显着性响应的调制，以及检测特征存在和不存在的能力。在任何实现方式中，每层都有一个精确的统计解释，并且所有参数都通过统计学习进行调整。每个显着性检测层都比其前一层学习更多的判别显着性模板，并且更高的层具有更大的池化区域。这使得 HDSN 能够同时实现对目标对象类的高选择性和不变性。将网络在显着性和目标识别任务中的性能与生物和计算机视觉文献中的模型进行了比较。这证明了 HDSN 的所有功能增强、判别显着性固有的类调谐以及基于目标选择性和不变性模板的显着性层都有好处。总之，这些实验表明，在注意力和识别的集成中有非平凡的好处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a52/4158795/56bf604acef2/fncom-08-00109-g0001.jpg

相似文献

Object recognition with hierarchical discriminant saliency networks.

Front Comput Neurosci. 2014 Sep 9;8:109. doi: 10.3389/fncom.2014.00109. eCollection 2014.

Biologically plausible saliency mechanisms improve feedforward object recognition.

Vision Res. 2010 Oct 28;50(22):2295-307. doi: 10.1016/j.visres.2010.05.034. Epub 2010 Jun 2.

Biologically Inspired Object Tracking Using Center-Surround Saliency Mechanisms.

IEEE Trans Pattern Anal Mach Intell. 2013 Mar;35(3):541-54. doi: 10.1109/TPAMI.2012.98. Epub 2012 Apr 24.

Decision-theoretic saliency: computational principles, biological plausibility, and implications for neurophysiology and psychophysics.

Neural Comput. 2009 Jan;21(1):239-71. doi: 10.1162/neco.2009.11-06-391.

Discriminant saliency, the detection of suspicious coincidences, and applications to visual recognition.

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):989-1005. doi: 10.1109/TPAMI.2009.27.

Visual Saliency Detection Based on Multiscale Deep CNN Features.

IEEE Trans Image Process. 2016 Nov;25(11):5012-5024. doi: 10.1109/TIP.2016.2602079. Epub 2016 Aug 24.

Correspondence between Monkey Visual Cortices and Layers of a Saliency Map Model Based on a Deep Convolutional Neural Network for Representations of Natural Images.

eNeuro. 2021 Feb 9;8(1). doi: 10.1523/ENEURO.0200-20.2020. Print 2021 Jan-Feb.

Exploring Duality in Visual Question-Driven Top-Down Saliency.

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2672-2679. doi: 10.1109/TNNLS.2019.2933439. Epub 2019 Sep 2.

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection.

IEEE Trans Image Process. 2016 Aug;25(8):3919-30. doi: 10.1109/TIP.2016.2579306. Epub 2016 Jun 9.

Learning optimized features for hierarchical models of invariant object recognition.

Neural Comput. 2003 Jul;15(7):1559-88. doi: 10.1162/089976603321891800.

引用本文的文献

Stimulus- and goal-oriented frameworks for understanding natural vision.

Nat Neurosci. 2019 Jan;22(1):15-24. doi: 10.1038/s41593-018-0284-0. Epub 2018 Dec 10.

Feedback-Driven Sensory Mapping Adaptation for Robust Speech Activity Detection.

IEEE/ACM Trans Audio Speech Lang Process. 2017 Mar;25(3):481-492. doi: 10.1109/TASLP.2016.2639322. Epub 2016 Dec 13.

Editorial: Hierarchical Object Representations in the Visual Cortex and Computer Vision.

Front Comput Neurosci. 2015 Nov 20;9:142. doi: 10.3389/fncom.2015.00142. eCollection 2015.

本文引用的文献

Guided Search 2.0 A revised model of visual search.

Psychon Bull Rev. 1994 Jun;1(2):202-38. doi: 10.3758/BF03200774.

Quantitative analysis of human-model agreement in visual saliency modeling: a comparative study.

IEEE Trans Image Process. 2013 Jan;22(1):55-69. doi: 10.1109/TIP.2012.2210727. Epub 2012 Jul 30.

Biologically Inspired Object Tracking Using Center-Surround Saliency Mechanisms.

IEEE Trans Pattern Anal Mach Intell. 2013 Mar;35(3):541-54. doi: 10.1109/TPAMI.2012.98. Epub 2012 Apr 24.

Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.

State-of-the-art in visual attention modeling.

IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):185-207. doi: 10.1109/TPAMI.2012.89.

Normalization as a canonical neural computation.

Nat Rev Neurosci. 2011 Nov 23;13(1):51-62. doi: 10.1038/nrn3136.

Beyond multiple pattern analyzers modeled as linear filters (as classical V1 simple cells): useful additions of the last 25 years.

Vision Res. 2011 Jul 1;51(13):1397-430. doi: 10.1016/j.visres.2011.02.007. Epub 2011 Feb 15.

Object detection with discriminatively trained part-based models.

IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.

Biologically plausible saliency mechanisms improve feedforward object recognition.

Vision Res. 2010 Oct 28;50(22):2295-307. doi: 10.1016/j.visres.2010.05.034. Epub 2010 Jun 2.

A Bayesian model for efficient visual search and recognition.

Vision Res. 2010 Jun 25;50(14):1338-52. doi: 10.1016/j.visres.2010.01.002. Epub 2010 Jan 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于分层判别显著网络的目标识别。

Object recognition with hierarchical discriminant saliency networks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献