Centre for Neuroscience, Indian Institute of Science, 560012 Bangalore, India.
Centre for Neuroscience, Indian Institute of Science, 560012 Bangalore, India
Proc Natl Acad Sci U S A. 2018 Apr 3;115(14):E3276-E3285. doi: 10.1073/pnas.1714287115. Epub 2018 Mar 20.
Object recognition is challenging because the same object can produce vastly different images, mixing signals related to its identity with signals due to its image attributes, such as size, position, rotation, etc. Previous studies have shown that both signals are present in high-level visual areas, but precisely how they are combined has remained unclear. One possibility is that neurons might encode identity and attribute signals multiplicatively so that each can be efficiently decoded without interference from the other. Here, we show that, in high-level visual cortex, responses of single neurons can be explained better as a product rather than a sum of tuning for object identity and tuning for image attributes. This subtle effect in single neurons produced substantially better population decoding of object identity and image attributes in the neural population as a whole. This property was absent both in low-level vision models and in deep neural networks. It was also unique to invariances: when tested with two-part objects, neural responses were explained better as a sum than as a product of part tuning. Taken together, our results indicate that signals requiring separate decoding, such as object identity and image attributes, are combined multiplicatively in IT neurons, whereas signals that require integration (such as parts in an object) are combined additively.
对象识别具有挑战性,因为同一个对象可能会产生截然不同的图像,将与其身份相关的信号与由于其图像属性(例如大小、位置、旋转等)而产生的信号混合在一起。先前的研究表明,这两种信号都存在于高级视觉区域中,但它们是如何组合的仍然不清楚。一种可能性是,神经元可能会以乘法方式对身份和属性信号进行编码,以便可以在不受其他信号干扰的情况下有效地对每个信号进行解码。在这里,我们表明,在高级视觉皮层中,单个神经元的反应可以更好地解释为乘积,而不是对象身份调谐和图像属性调谐的和。这种单个神经元中的细微效果导致整个神经元群体对对象身份和图像属性的群体解码有了实质性的改善。这种特性在低级视觉模型和深度神经网络中都不存在。它也与不变性独特相关:当用两部分物体进行测试时,神经反应的和比乘积更能很好地解释。总的来说,我们的结果表明,需要单独解码的信号(例如对象身份和图像属性)在 IT 神经元中以乘法方式组合,而需要整合的信号(例如对象的各个部分)则以加法方式组合。