Einhäuser Wolfgang, Hipp Jörg, Eggert Julian, Körner Edgar, König Peter
Institute of Neuroinformatics, University & ETH Zürich, Zürich, Switzerland.
Biol Cybern. 2005 Jul;93(1):79-90. doi: 10.1007/s00422-005-0585-8. Epub 2005 Jul 13.
Invariant object recognition is arguably one of the major challenges for contemporary machine vision systems. In contrast, the mammalian visual system performs this task virtually effortlessly. How can we exploit our knowledge on the biological system to improve artificial systems? Our understanding of the mammalian early visual system has been augmented by the discovery that general coding principles could explain many aspects of neuronal response properties. How can such schemes be transferred to system level performance? In the present study we train cells on a particular variant of the general principle of temporal coherence, the "stability" objective. These cells are trained on unlabeled real-world images without a teaching signal. We show that after training, the cells form a representation that is largely independent of the viewpoint from which the stimulus is looked at. This finding includes generalization to previously unseen viewpoints. The achieved representation is better suited for view-point invariant object classification than the cells' input patterns. This property to facilitate view-point invariant classification is maintained even if training and classification take place in the presence of an--also unlabeled--distractor object. In summary, here we show that unsupervised learning using a general coding principle facilitates the classification of real-world objects, that are not segmented from the background and undergo complex, non-isomorphic, transformations.
不变目标识别可以说是当代机器视觉系统面临的主要挑战之一。相比之下,哺乳动物的视觉系统几乎可以毫不费力地完成这项任务。我们如何利用对生物系统的了解来改进人工系统呢?对哺乳动物早期视觉系统的发现增强了我们的理解,即通用编码原则可以解释神经元反应特性的许多方面。这样的方案如何转化为系统级性能呢?在本研究中,我们在时间相干性通用原则的一个特定变体——“稳定性”目标上训练细胞。这些细胞在没有教学信号的情况下,在未标记的真实世界图像上进行训练。我们表明,经过训练后,这些细胞形成的表征在很大程度上独立于观察刺激的视角。这一发现包括对以前未见过的视角的泛化。所实现的表征比细胞的输入模式更适合于视角不变的目标分类。即使在存在(同样未标记的)干扰物体的情况下进行训练和分类,这种促进视角不变分类的特性仍然得以保持。总之,我们在此表明,使用通用编码原则的无监督学习有助于对未从背景中分割出来且经历复杂、非同构变换的真实世界物体进行分类。