Department of Applied Science and Technology & Center for Computational Sciences, Politecnico di Torino, Torino, Italy.
PLoS Comput Biol. 2013;9(8):e1003167. doi: 10.1371/journal.pcbi.1003167. Epub 2013 Aug 8.
The anterior inferotemporal cortex (IT) is the highest stage along the hierarchy of visual areas that, in primates, processes visual objects. Although several lines of evidence suggest that IT primarily represents visual shape information, some recent studies have argued that neuronal ensembles in IT code the semantic membership of visual objects (i.e., represent conceptual classes such as animate and inanimate objects). In this study, we investigated to what extent semantic, rather than purely visual information, is represented in IT by performing a multivariate analysis of IT responses to a set of visual objects. By relying on a variety of machine-learning approaches (including a cutting-edge clustering algorithm that has been recently developed in the domain of statistical physics), we found that, in most instances, IT representation of visual objects is accounted for by their similarity at the level of shape or, more surprisingly, low-level visual properties. Only in a few cases we observed IT representations of semantic classes that were not explainable by the visual similarity of their members. Overall, these findings reassert the primary function of IT as a conveyor of explicit visual shape information, and reveal that low-level visual properties are represented in IT to a greater extent than previously appreciated. In addition, our work demonstrates how combining a variety of state-of-the-art multivariate approaches, and carefully estimating the contribution of shape similarity to the representation of object categories, can substantially advance our understanding of neuronal coding of visual objects in cortex.
前下颞叶皮层(IT)是沿着视觉区域层级结构的最高阶段,在灵长类动物中,它处理视觉对象。尽管有几条证据表明 IT 主要表示视觉形状信息,但最近的一些研究认为,IT 中的神经元集合对视觉对象的语义成员身份进行编码(即表示概念类别,如生物和非生物对象)。在这项研究中,我们通过对一组视觉对象的 IT 反应进行多元分析,研究了语义信息在多大程度上代表了 IT,而不仅仅是纯粹的视觉信息。通过依赖各种机器学习方法(包括最近在统计物理领域开发的一种前沿聚类算法),我们发现,在大多数情况下,视觉对象的 IT 表示是由其形状相似性或更令人惊讶的低水平视觉特性来解释的。只有在少数情况下,我们观察到 IT 对语义类别的表示,而这些表示不能用其成员的视觉相似性来解释。总的来说,这些发现再次强调了 IT 的主要功能是作为明确视觉形状信息的传递者,并且表明低水平视觉特性在 IT 中的表示程度比以前认为的要高。此外,我们的工作表明,如何结合各种最先进的多元方法,并仔细估计形状相似性对物体类别表示的贡献,可以大大提高我们对皮层中视觉物体的神经元编码的理解。