Hyvärinen Aapo, Gutmann Michael, Hoyer Patrik O
HIIT Basic Research Unit and Dept of Computer Science, University of Helsinki, Finland.
BMC Neurosci. 2005 Feb 16;6:12. doi: 10.1186/1471-2202-6-12.
It has been shown that the classical receptive fields of simple and complex cells in the primary visual cortex emerge from the statistical properties of natural images by forcing the cell responses to be maximally sparse or independent. We investigate how to learn features beyond the primary visual cortex from the statistical properties of modelled complex-cell outputs. In previous work, we showed that a new model, non-negative sparse coding, led to the emergence of features which code for contours of a given spatial frequency band.
We applied ordinary independent component analysis to modelled outputs of complex cells that span different frequency bands. The analysis led to the emergence of features which pool spatially coherent across-frequency activity in the modelled primary visual cortex. Thus, the statistically optimal way of processing complex-cell outputs abandons separate frequency channels, while preserving and even enhancing orientation tuning and spatial localization. As a technical aside, we found that the non-negativity constraint is not necessary: ordinary independent component analysis produces essentially the same results as our previous work.
We propose that the pooling that emerges allows the features to code for realistic low-level image features related to step edges. Further, the results prove the viability of statistical modelling of natural images as a framework that produces quantitative predictions of visual processing.
研究表明,初级视觉皮层中简单细胞和复杂细胞的经典感受野是通过迫使细胞反应达到最大程度的稀疏或独立,从自然图像的统计特性中产生的。我们研究如何从模拟的复杂细胞输出的统计特性中学习初级视觉皮层之外的特征。在之前的工作中,我们表明一种新模型——非负稀疏编码,导致了编码给定空间频率带轮廓的特征的出现。
我们将普通独立成分分析应用于跨越不同频带的复杂细胞的模拟输出。该分析导致了在模拟的初级视觉皮层中汇聚跨频率空间相干活动的特征的出现。因此,处理复杂细胞输出的统计最优方法放弃了单独的频率通道,同时保留甚至增强了方向调谐和空间定位。作为一个技术附注,我们发现非负性约束并非必要:普通独立成分分析产生的结果与我们之前的工作基本相同。
我们提出出现的汇聚使得这些特征能够编码与阶跃边缘相关的现实低级图像特征。此外,这些结果证明了自然图像统计建模作为一个能够产生视觉处理定量预测的框架的可行性。