Laboratory of Brain and Cognition, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892, USA.
J Neurosci. 2011 May 18;31(20):7322-33. doi: 10.1523/JNEUROSCI.4588-10.2011.
Real-world scenes are incredibly complex and heterogeneous, yet we are able to identify and categorize them effortlessly. In humans, the ventral temporal parahippocampal place area (PPA) has been implicated in scene processing, but scene information is contained in many visual areas, leaving their specific contributions unclear. Although early theories of PPA emphasized its role in spatial processing, more recent reports of its function have emphasized semantic or contextual processing. Here, using functional imaging, we reconstructed the organization of scene representations across human ventral visual cortex by analyzing the distributed response to 96 diverse real-world scenes. We found that, although individual scenes could be decoded in both PPA and early visual cortex (EVC), the structure of representations in these regions was vastly different. In both regions, spatial rather than semantic factors defined the structure of representations. However, in PPA, representations were defined primarily by the spatial factor of expanse (open, closed) and in EVC primarily by distance (near, far). Furthermore, independent behavioral ratings of expanse and distance correlated strongly with representations in PPA and peripheral EVC, respectively. In neither region was content (manmade, natural) a major contributor to the overall organization. Furthermore, the response of PPA could not be used to decode the high-level semantic category of scenes even when spatial factors were held constant, nor could category be decoded across different distances. These findings demonstrate, contrary to recent reports, that the response PPA primarily reflects spatial, not categorical or contextual, aspects of real-world scenes.
真实场景极其复杂且多样,但我们能够毫不费力地识别和分类它们。在人类中,腹侧颞叶旁海马旁回区域(PPA)被认为参与了场景处理,但场景信息包含在许多视觉区域中,其具体贡献尚不清楚。尽管 PPA 的早期理论强调了其在空间处理中的作用,但最近关于其功能的报告则强调了语义或上下文处理。在这里,我们通过分析对 96 个不同的真实世界场景的分布式反应,使用功能成像重建了人类腹侧视觉皮层中场景表示的组织。我们发现,尽管个别场景可以在 PPA 和早期视觉皮层(EVC)中进行解码,但这些区域的表示结构却大不相同。在这两个区域中,空间而不是语义因素定义了表示的结构。但是,在 PPA 中,代表主要由广度(开阔,封闭)的空间因素定义,而在 EVC 中主要由距离(近,远)定义。此外,广度和距离的独立行为评分与 PPA 和外围 EVC 中的表示分别强烈相关。在这两个区域中,内容(人造,自然)都不是整体组织的主要贡献者。此外,即使在保持空间因素不变的情况下,PPA 的反应也无法用于解码场景的高级语义类别,也无法在不同的距离之间进行解码。这些发现表明,与最近的报告相反,PPA 的反应主要反映了真实场景的空间方面,而不是类别或上下文方面。