Giraldo Luis Gonzalo Sánchez, Schwartz Odelia
Computer Science Department, University of Miami, Coral Gables, FL 33146, U.S.A.
Neural Comput. 2019 Nov;31(11):2138-2176. doi: 10.1162/neco_a_01226. Epub 2019 Sep 16.
Deep convolutional neural networks (CNNs) are becoming increasingly popular models to predict neural responses in visual cortex. However, contextual effects, which are prevalent in neural processing and in perception, are not explicitly handled by current CNNs, including those used for neural prediction. In primary visual cortex, neural responses are modulated by stimuli spatially surrounding the classical receptive field in rich ways. These effects have been modeled with divisive normalization approaches, including flexible models, where spatial normalization is recruited only to the degree that responses from center and surround locations are deemed statistically dependent. We propose a flexible normalization model applied to midlevel representations of deep CNNs as a tractable way to study contextual normalization mechanisms in midlevel cortical areas. This approach captures nontrivial spatial dependencies among midlevel features in CNNs, such as those present in textures and other visual stimuli, that arise from tiling high-order features geometrically. We expect that the proposed approach can make predictions about when spatial normalization might be recruited in midlevel cortical areas. We also expect this approach to be useful as part of the CNN tool kit, therefore going beyond more restrictive fixed forms of normalization.
深度卷积神经网络(CNN)正日益成为预测视觉皮层神经反应的流行模型。然而,当前的CNN,包括用于神经预测的那些,并未明确处理在神经处理和感知中普遍存在的上下文效应。在初级视觉皮层中,神经反应会受到经典感受野周围空间刺激的丰富调制。这些效应已通过除法归一化方法进行建模,包括灵活模型,其中空间归一化仅在中心和周围位置的反应被认为具有统计依赖性的程度上被采用。我们提出一种应用于深度CNN中层表示的灵活归一化模型,作为研究中层皮层区域上下文归一化机制的一种易于处理的方法。这种方法捕捉了CNN中层特征之间重要的空间依赖性,例如纹理和其他视觉刺激中存在的那些,这些依赖性是通过几何方式平铺高阶特征而产生的。我们期望所提出的方法能够预测中层皮层区域何时可能采用空间归一化。我们还期望这种方法作为CNN工具包的一部分会很有用,因此超越了更具限制性的固定形式的归一化。