Institute of Neural Information Processing, Ulm University Ulm, Germany.
Department of Computer Science, University of Tübingen Tübingen, Germany.
Front Psychol. 2014 Dec 5;5:1287. doi: 10.3389/fpsyg.2014.01287. eCollection 2014.
The categorization of real world objects is often reflected in the similarity of their visual appearances. Such categories of objects do not necessarily form disjunct sets of objects, neither semantically nor visually. The relationship between categories can often be described in terms of a hierarchical structure. For instance, tigers and leopards build two separate mammalian categories, both of which are subcategories of the category Felidae. In the last decades, the unsupervised learning of categories of visual input stimuli has been addressed by numerous approaches in machine learning as well as in computational neuroscience. However, the question of what kind of mechanisms might be involved in the process of subcategory learning, or category refinement, remains a topic of active investigation. We propose a recurrent computational network architecture for the unsupervised learning of categorial and subcategorial visual input representations. During learning, the connection strengths of bottom-up weights from input to higher-level category representations are adapted according to the input activity distribution. In a similar manner, top-down weights learn to encode the characteristics of a specific stimulus category. Feedforward and feedback learning in combination realize an associative memory mechanism, enabling the selective top-down propagation of a category's feedback weight distribution. We suggest that the difference between the expected input encoded in the projective field of a category node and the current input pattern controls the amplification of feedforward-driven representations. Large enough differences trigger the recruitment of new representational resources and the establishment of additional (sub-) category representations. We demonstrate the temporal evolution of such learning and show how the proposed combination of an associative memory with a modulatory feedback integration successfully establishes category and subcategory representations.
真实世界物体的分类通常反映在它们视觉外观的相似性上。这些物体类别不一定在语义上或视觉上形成不相交的物体集合。类别之间的关系通常可以用层次结构来描述。例如,老虎和豹子构成了两个独立的哺乳动物类别,它们都是猫科的子类别。在过去的几十年中,机器学习和计算神经科学领域已经提出了许多方法来解决无监督学习视觉输入刺激的类别问题。然而,在子类别学习或类别细化过程中可能涉及到什么样的机制,仍然是一个活跃的研究课题。我们提出了一种用于无监督学习类别和子类别视觉输入表示的递归计算网络架构。在学习过程中,根据输入活动分布,从输入到更高层次类别表示的下传权重的连接强度会自适应调整。类似地,上传权重学会编码特定刺激类别的特征。前馈和反馈学习相结合实现了联想记忆机制,使特定类别反馈权重分布的选择性前馈传播成为可能。我们认为,在类别节点的投影场中编码的预期输入与当前输入模式之间的差异控制着前馈驱动表示的放大。足够大的差异会触发新的表示资源的招募和附加(子)类别表示的建立。我们演示了这种学习的时间演变,并展示了联想记忆与调制反馈集成的组合如何成功地建立了类别和子类别表示。