LAPC, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China.
BMC Neurosci. 2010 Mar 26;11:43. doi: 10.1186/1471-2202-11-43.
What kind of neural computation is actually performed by the primary visual cortex and how is this represented mathematically at the system level? It is an important problem in the visual information processing, but has not been well answered. In this paper, according to our understanding of retinal organization and parallel multi-channel topographical mapping between retina and primary visual cortex V1, we divide an image into orthogonal and orderly array of image primitives (or patches), in which each patch will evoke activities of simple cells in V1. From viewpoint of information processing, this activated process, essentially, involves optimal detection and optimal matching of receptive fields of simple cells with features contained in image patches. For the reconstruction of the visual image in the visual cortex V1 based on the principle of minimum mean squares error, it is natural to use the inner product expression in neural computation, which then is transformed into matrix form.
The inner product is carried out by using Kronecker product between patches and function architecture (or functional column) in localized and oriented neural computing. Compared with Fourier Transform, the mathematical description of Kronecker product is simple and intuitive, so is the algorithm more suitable for neural computation of visual cortex V1. Results of computer simulation based on two-dimensional Gabor pyramid wavelets show that the theoretical analysis and the proposed model are reasonable.
Our results are: 1. The neural computation of the retinal image in cortex V1 can be expressed to Kronecker product operation and its matrix form, this algorithm is implemented by the inner operation between retinal image primitives and primary visual cortex's column. It has simple, efficient and robust features, which is, therefore, such a neural algorithm, which can be completed by biological vision. 2. It is more suitable that the function of cortical column in cortex V1 is considered as the basic unit of visual image processing (such unit can implement basic multiplication of visual primitives, such as contour, line, and edge), rather than a set of tiled array filter. Fourier Transformation is replaced with Kronecker product, which greatly reduces the computational complexity. The neurobiological basis of this idea is that a visual image can be represented as a linear combination of orderly orthogonal primitive image containing some local feature. In the visual pathway, the image patches are topographically mapped onto cortex V1 through parallel multi-channels and then are processed independently by functional columns. Clearly, the above new perspective has some reference significance to exploring the neural mechanisms on the human visual information processing.
初级视觉皮层实际执行的是哪种神经计算,以及这种计算在系统水平上如何用数学表示?这是视觉信息处理中的一个重要问题,但尚未得到很好的回答。在本文中,根据我们对视网膜组织的理解以及视网膜和初级视觉皮层 V1 之间的并行多通道地形映射,我们将图像划分为正交有序的图像基元(或补丁)阵列,其中每个补丁都会引发 V1 中的简单细胞的活动。从信息处理的角度来看,这个激活过程本质上涉及到简单细胞的感受野与图像补丁中包含的特征的最佳检测和最佳匹配。为了根据最小均方误差原理在 V1 视觉皮层中重建视觉图像,使用内积在神经计算中的表示形式是自然的,然后将其转换为矩阵形式。
使用局部和定向神经计算中的补丁和功能架构(或功能柱)之间的 Kronecker 积来执行内积。与傅里叶变换相比,Kronecker 积的数学描述简单直观,因此该算法更适合 V1 视觉皮层的神经计算。基于二维 Gabor 金字塔小波的计算机模拟结果表明,理论分析和提出的模型是合理的。
我们的结果是:1. 皮层 V1 中视网膜图像的神经计算可以表示为 Kronecker 积运算及其矩阵形式,该算法通过视网膜图像基元和初级视觉皮层柱之间的内部运算来实现。它具有简单、高效和鲁棒的特点,因此是一种可以通过生物视觉完成的神经算法。2. 皮层 V1 中的皮层柱的功能被认为是视觉图像处理的基本单元(这种单元可以实现视觉基元的基本乘法,如轮廓、线和边缘),而不是一组平铺阵列滤波器,这更合适。用 Kronecker 积代替傅里叶变换,大大降低了计算复杂度。这种思想的神经生物学基础是,一个视觉图像可以表示为包含一些局部特征的有序正交基元图像的线性组合。在视觉通路中,图像补丁通过并行多通道映射到 V1 皮层,并由功能柱独立处理。显然,上述新视角对探索人类视觉信息处理的神经机制具有一定的参考意义。