Stringer S M, Rolls E T, Tromans J M
Department of Experimental Psychology, Oxford University, Centre for Computational Neuroscience, South Parks Road, Oxford OX1 3UD, England.
Network. 2007 Jun;18(2):161-87. doi: 10.1080/09548980701556055.
Over successive stages, the ventral visual system develops neurons that respond with view, size and position invariance to objects including faces. A major challenge is to explain how invariant representations of individual objects could develop given visual input from environments containing multiple objects. Here we show that the neurons in a 1-layer competitive network learn to represent combinations of three objects simultaneously present during training if the number of objects in the training set is low (e.g. 4), to represent combinations of two objects as the number of objects is increased to for e.g. 10, and to represent individual objects as the number of objects in the training set is increased further to for e.g. 20. We next show that translation invariant representations can be formed even when multiple stimuli are always present during training, by including a temporal trace in the learning rule. Finally, we show that these concepts can be extended to a multi-layer hierarchical network model (VisNet) of the ventral visual system. This approach provides a way to understand how a visual system can, by self-organizing competitive learning, form separate invariant representations of each object even when each object is presented in a scene with multiple other objects present, as in natural visual scenes.
在连续的阶段中,腹侧视觉系统发育出对包括面孔在内的物体具有视角、大小和位置不变性反应的神经元。一个主要挑战是解释在包含多个物体的环境中,给定视觉输入的情况下,单个物体的不变表示是如何形成的。在这里我们表明,如果训练集中物体的数量较少(例如4个),单层竞争网络中的神经元会学习同时表示训练期间同时出现的三个物体的组合;当物体数量增加到例如10个时,表示两个物体的组合;当训练集中物体的数量进一步增加到例如20个时,表示单个物体。接下来我们表明,通过在学习规则中纳入时间痕迹,即使在训练期间始终存在多个刺激,也可以形成平移不变表示。最后,我们表明这些概念可以扩展到腹侧视觉系统的多层层次网络模型(VisNet)。这种方法提供了一种途径,来理解视觉系统如何通过自组织竞争学习,即使每个物体都出现在有多个其他物体的场景中,如在自然视觉场景中那样,也能形成每个物体的单独不变表示。