Honda Research Institute Europe GmbH, Carl-Legien-Strasse 30, Offenbach/Main, Germany.
Neural Netw. 2009 Oct;22(8):1055-70. doi: 10.1016/j.neunet.2009.07.021. Epub 2009 Jul 19.
Experimental data suggests that a first hypothesis about the content of a complex visual scene is available as early as 150 ms after stimulus presentation. Other evidence suggests that recognition in the visual cortex of mammals is a bidirectional, often top-down driven process. Here, we present a spiking neural network model that demonstrates how the cortex can use both strategies: Faced with a new stimulus, the cortex first tries to catch the gist of the scene. The gist is then fed back as global hypothesis to influence and redirect further bottom-up processing. We propose that these two modes of processing are carried out in different layers of the cortex. A cortical column may, thus, be primarily defined by the specific connectivity that links neurons in different layers into a functional circuit. Given an input, our model generates an initial hypothesis after only a few milliseconds. The first wave of action potentials traveling up the hierarchy activates representations of features and feature combinations. In most cases, the correct feature representation is activated strongest and precedes all other candidates with millisecond precision. Thus, our model codes the reliability of a response in the relative latency of spikes. In the subsequent refinement stage where high-level activity modulates lower stages, this activation dominance is propagated back, influencing its own afferent activity to establish a unique decision. Thus, top-down influence de-activates representations that have contributed to the initial hypothesis about the current stimulus, comparable to predictive coding. Features that do not match the top-down prediction trigger an error signal that can be the basis for learning new representations.
实验数据表明,在刺激呈现后 150 毫秒左右,复杂视觉场景的内容就可以提供第一个假设。其他证据表明,哺乳动物视觉皮层的识别是一个双向的、通常是自上而下驱动的过程。在这里,我们提出了一个尖峰神经网络模型,该模型展示了皮层如何使用这两种策略:面对新的刺激,皮层首先试图抓住场景的要点。然后,将要点作为全局假设反馈,以影响和重新引导进一步的自下而上的处理。我们提出,这两种处理模式在皮层的不同层中进行。因此,一个皮层柱可能主要由将不同层中的神经元连接成一个功能电路的特定连接来定义。给定一个输入,我们的模型仅在几毫秒后就会生成一个初始假设。沿层次结构传播的第一波动作电位激活特征和特征组合的表示。在大多数情况下,正确的特征表示被激活得最强,并以毫秒精度优先于所有其他候选者。因此,我们的模型通过相对尖峰延迟来编码响应的可靠性。在随后的细化阶段,高级活动调节低级阶段,这种激活优势会被反向传播,影响其自身的传入活动以建立独特的决策。因此,自上而下的影响会使对当前刺激的初始假设做出贡献的表示失活,类似于预测编码。与自上而下的预测不匹配的特征会触发错误信号,该信号可以作为学习新表示的基础。