将灵活归一化集成到深度卷积神经网络的中级表示中。

Integrating Flexible Normalization into Midlevel Representations of Deep Convolutional Neural Networks.

作者信息

Giraldo Luis Gonzalo Sánchez, Schwartz Odelia

机构信息

Computer Science Department, University of Miami, Coral Gables, FL 33146, U.S.A.

出版信息

Neural Comput. 2019 Nov;31(11):2138-2176. doi: 10.1162/neco_a_01226. Epub 2019 Sep 16.

DOI:10.1162/neco_a_01226

PMID:31525314

Abstract

Deep convolutional neural networks (CNNs) are becoming increasingly popular models to predict neural responses in visual cortex. However, contextual effects, which are prevalent in neural processing and in perception, are not explicitly handled by current CNNs, including those used for neural prediction. In primary visual cortex, neural responses are modulated by stimuli spatially surrounding the classical receptive field in rich ways. These effects have been modeled with divisive normalization approaches, including flexible models, where spatial normalization is recruited only to the degree that responses from center and surround locations are deemed statistically dependent. We propose a flexible normalization model applied to midlevel representations of deep CNNs as a tractable way to study contextual normalization mechanisms in midlevel cortical areas. This approach captures nontrivial spatial dependencies among midlevel features in CNNs, such as those present in textures and other visual stimuli, that arise from tiling high-order features geometrically. We expect that the proposed approach can make predictions about when spatial normalization might be recruited in midlevel cortical areas. We also expect this approach to be useful as part of the CNN tool kit, therefore going beyond more restrictive fixed forms of normalization.

摘要

深度卷积神经网络（CNN）正日益成为预测视觉皮层神经反应的流行模型。然而，当前的CNN，包括用于神经预测的那些，并未明确处理在神经处理和感知中普遍存在的上下文效应。在初级视觉皮层中，神经反应会受到经典感受野周围空间刺激的丰富调制。这些效应已通过除法归一化方法进行建模，包括灵活模型，其中空间归一化仅在中心和周围位置的反应被认为具有统计依赖性的程度上被采用。我们提出一种应用于深度CNN中层表示的灵活归一化模型，作为研究中层皮层区域上下文归一化机制的一种易于处理的方法。这种方法捕捉了CNN中层特征之间重要的空间依赖性，例如纹理和其他视觉刺激中存在的那些，这些依赖性是通过几何方式平铺高阶特征而产生的。我们期望所提出的方法能够预测中层皮层区域何时可能采用空间归一化。我们还期望这种方法作为CNN工具包的一部分会很有用，因此超越了更具限制性的固定形式的归一化。

相似文献

Integrating Flexible Normalization into Midlevel Representations of Deep Convolutional Neural Networks.

Neural Comput. 2019 Nov;31(11):2138-2176. doi: 10.1162/neco_a_01226. Epub 2019 Sep 16.

The impact on midlevel vision of statistically optimal divisive normalization in V1.

J Vis. 2013 Jul 15;13(8):13. doi: 10.1167/13.8.13.

Learning divisive normalization in primary visual cortex.

PLoS Comput Biol. 2021 Jun 7;17(6):e1009028. doi: 10.1371/journal.pcbi.1009028. eCollection 2021 Jun.

Generalizing biological surround suppression based on center surround similarity via deep neural network models.

PLoS Comput Biol. 2023 Sep 22;19(9):e1011486. doi: 10.1371/journal.pcbi.1011486. eCollection 2023 Sep.

Approximating the Architecture of Visual Cortex in a Convolutional Network.

Neural Comput. 2019 Aug;31(8):1551-1591. doi: 10.1162/neco_a_01211. Epub 2019 Jul 1.

Visual attention and flexible normalization pools.

J Vis. 2013 Jan 23;13(1):25. doi: 10.1167/13.1.25.

Deep neural networks capture texture sensitivity in V2.

J Vis. 2020 Jul 1;20(7):21-1. doi: 10.1167/jov.20.7.21.

Convolutional neural network models of V1 responses to complex patterns.

J Comput Neurosci. 2019 Feb;46(1):33-54. doi: 10.1007/s10827-018-0687-7. Epub 2018 Jun 5.

Computational mechanisms underlying cortical responses to the affordance properties of visual scenes.

PLoS Comput Biol. 2018 Apr 23;14(4):e1006111. doi: 10.1371/journal.pcbi.1006111. eCollection 2018 Apr.

Nat Commun. 2021 Mar 25;12(1):1872. doi: 10.1038/s41467-021-22078-3.

引用本文的文献

Generalizing biological surround suppression based on center surround similarity via deep neural network models.

PLoS Comput Biol. 2023 Sep 22;19(9):e1011486. doi: 10.1371/journal.pcbi.1011486. eCollection 2023 Sep.

Texture Interpolation for Probing Visual Perception.

Adv Neural Inf Process Syst. 2020 Dec;33:22146-22157.

Deep neural networks capture texture sensitivity in V2.

J Vis. 2020 Jul 1;20(7):21-1. doi: 10.1167/jov.20.7.21.

Stimulus- and goal-oriented frameworks for understanding natural vision.

Nat Neurosci. 2019 Jan;22(1):15-24. doi: 10.1038/s41593-018-0284-0. Epub 2018 Dec 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将灵活归一化集成到深度卷积神经网络的中级表示中。

Integrating Flexible Normalization into Midlevel Representations of Deep Convolutional Neural Networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献