Vintch Brett, Movshon J Anthony, Simoncelli Eero P
Center for Neural Science.
Center for Neural Science, Courant Institute of Mathematical Sciences, and Howard Hughes Medical Institute, New York University, New York, New York 10003
J Neurosci. 2015 Nov 4;35(44):14829-41. doi: 10.1523/JNEUROSCI.2815-13.2015.
The response properties of neurons in the early stages of the visual system can be described using the rectified responses of a set of self-similar, spatially shifted linear filters. In macaque primary visual cortex (V1), simple cell responses can be captured with a single filter, whereas complex cells combine a set of filters, creating position invariance. These filters cannot be estimated using standard methods, such as spike-triggered averaging. Subspace methods like spike-triggered covariance can recover multiple filters but require substantial amounts of data, and recover an orthogonal basis for the subspace in which the filters reside, rather than the filters themselves. Here, we assume a linear-nonlinear-linear-nonlinear (LN-LN) cascade model in which the first LN stage consists of shifted ("convolutional") copies of a single filter, followed by a common instantaneous nonlinearity. We refer to these initial LN elements as the "subunits" of the receptive field, and we allow two independent sets of subunits, each with its own filter and nonlinearity. The second linear stage computes a weighted sum of the subunit responses and passes the result through a final instantaneous nonlinearity. We develop a procedure to directly fit this model to electrophysiological data. When fit to data from macaque V1, the subunit model significantly outperforms three alternatives in terms of cross-validated accuracy and efficiency, and provides a robust, biologically plausible account of receptive field structure for all cell types encountered in V1.
We present a new subunit model for neurons in primary visual cortex that significantly outperforms three alternative models in terms of cross-validated accuracy and efficiency, and provides a robust and biologically plausible account of the receptive field structure in these neurons across the full spectrum of response properties.
视觉系统早期阶段神经元的反应特性可以用一组自相似、空间移位的线性滤波器的整流反应来描述。在猕猴初级视觉皮层(V1)中,简单细胞的反应可以用单个滤波器捕获,而复杂细胞则组合一组滤波器,产生位置不变性。这些滤波器不能用标准方法估计,如脉冲触发平均法。像脉冲触发协方差这样的子空间方法可以恢复多个滤波器,但需要大量数据,并且恢复的是滤波器所在子空间的正交基,而不是滤波器本身。在这里,我们假设一个线性-非线性-线性-非线性(LN-LN)级联模型,其中第一个LN阶段由单个滤波器的移位(“卷积”)副本组成,接着是一个共同的瞬时非线性。我们将这些初始LN元素称为感受野的“亚单位”,并且允许两组独立的亚单位,每组都有自己的滤波器和非线性。第二个线性阶段计算亚单位反应的加权和,并将结果通过最终的瞬时非线性。我们开发了一种程序,将该模型直接拟合到电生理数据。当拟合猕猴V1的数据时,亚单位模型在交叉验证的准确性和效率方面显著优于三种替代模型,并为V1中遇到的所有细胞类型的感受野结构提供了一个稳健的、生物学上合理的解释。
我们提出了一种用于初级视觉皮层神经元的新亚单位模型,该模型在交叉验证的准确性和效率方面显著优于三种替代模型,并为这些神经元在整个反应特性范围内的感受野结构提供了一个稳健且生物学上合理的解释。