Galeazzi Juan M, Minini Loredana, Stringer Simon M
Department of Experimental Psychology, Oxford Centre for Theoretical Neuroscience and Artificial Intelligence, University of Oxford Oxford, UK.
Front Comput Neurosci. 2015 Dec 15;9:147. doi: 10.3389/fncom.2015.00147. eCollection 2015.
Neurons that respond to visual targets in a hand-centered frame of reference have been found within various areas of the primate brain. We investigate how hand-centered visual representations may develop in a neural network model of the primate visual system called VisNet, when the model is trained on images of the hand seen against natural visual scenes. The simulations show how such neurons may develop through a biologically plausible process of unsupervised competitive learning and self-organization. In an advance on our previous work, the visual scenes consisted of multiple targets presented simultaneously with respect to the hand. Three experiments are presented. First, VisNet was trained with computerized images consisting of a realistic image of a hand and a variety of natural objects, presented in different textured backgrounds during training. The network was then tested with just one textured object near the hand in order to verify if the output cells were capable of building hand-centered representations with a single localized receptive field. We explain the underlying principles of the statistical decoupling that allows the output cells of the network to develop single localized receptive fields even when the network is trained with multiple objects. In a second simulation we examined how some of the cells with hand-centered receptive fields decreased their shape selectivity and started responding to a localized region of hand-centered space as the number of objects presented in overlapping locations during training increases. Lastly, we explored the same learning principles training the network with natural visual scenes collected by volunteers. These results provide an important step in showing how single, localized, hand-centered receptive fields could emerge under more ecologically realistic visual training conditions.
在灵长类动物大脑的各个区域中,已经发现了在手中心参照系中对视觉目标做出反应的神经元。我们研究了在一个名为VisNet的灵长类视觉系统神经网络模型中,当该模型在以自然视觉场景为背景的手部图像上进行训练时,以手为中心的视觉表征是如何发展的。模拟结果展示了这些神经元如何通过无监督竞争学习和自组织这一具有生物学合理性的过程得以发展。相较于我们之前的工作,此次视觉场景由相对于手部同时呈现的多个目标组成。本文呈现了三个实验。首先,使用由手部的真实图像和各种自然物体组成的计算机图像对VisNet进行训练,在训练过程中这些图像呈现于不同的纹理背景中。然后,仅用手部附近的一个纹理物体对该网络进行测试,以验证输出细胞是否能够通过单个局部感受野构建以手为中心的表征。我们解释了统计解耦的基本原理,正是这种原理使得网络的输出细胞即便在使用多个物体进行训练时,仍能发展出单个局部感受野。在第二个模拟实验中,我们研究了随着训练过程中在重叠位置呈现的物体数量增加,一些具有以手为中心感受野的细胞是如何降低其形状选择性并开始对手中心空间的局部区域做出反应的。最后,我们用志愿者收集的自然视觉场景对网络进行训练,探索相同的学习原理。这些结果为展示在更符合生态现实的视觉训练条件下,单个、局部的、以手为中心的感受野是如何出现的迈出了重要一步。