Kim Junkyung, Ricci Matthew, Serre Thomas
Department of Cognitive, Linguistic & Psychological Sciences, Carney Institute for Brain Science, Brown University, Providence, RI 02912, USA.
Interface Focus. 2018 Aug 6;8(4):20180011. doi: 10.1098/rsfs.2018.0011. Epub 2018 Jun 15.
The advent of deep learning has recently led to great successes in various engineering applications. As a prime example, convolutional neural networks, a type of feedforward neural network, now approach human accuracy on visual recognition tasks like image classification and face recognition. However, here we will show that feedforward neural networks struggle to learn abstract visual relations that are effortlessly recognized by non-human primates, birds, rodents and even insects. We systematically study the ability of feedforward neural networks to learn to recognize a variety of visual relations and demonstrate that same-different visual relations pose a particular strain on these networks. Networks fail to learn same-different visual relations when stimulus variability makes rote memorization difficult. Further, we show that learning same-different problems becomes trivial for a feedforward network that is fed with perceptually grouped stimuli. This demonstration and the comparative success of biological vision in learning visual relations suggests that feedback mechanisms such as attention, working memory and perceptual grouping may be the key components underlying human-level abstract visual reasoning.
深度学习的出现最近在各种工程应用中取得了巨大成功。一个典型的例子是卷积神经网络,它是前馈神经网络的一种,现在在图像分类和人脸识别等视觉识别任务上已接近人类的准确率。然而,我们将在此表明,前馈神经网络在学习抽象视觉关系方面存在困难,而这些关系非人类灵长类动物、鸟类、啮齿动物甚至昆虫都能轻松识别。我们系统地研究了前馈神经网络学习识别各种视觉关系的能力,并证明相同 - 不同的视觉关系给这些网络带来了特别的压力。当刺激的变异性使死记硬背变得困难时,网络无法学习相同 - 不同的视觉关系。此外,我们表明,对于输入经过感知分组刺激的前馈网络来说,学习相同 - 不同问题变得轻而易举。这一论证以及生物视觉在学习视觉关系方面的相对成功表明,诸如注意力、工作记忆和感知分组等反馈机制可能是人类水平抽象视觉推理的关键组成部分。