Groen Iris I A, Silson Edward H, Baker Chris I
Laboratory of Brain and Cognition, National Institutes of Health, 10 Center Drive 10-3N228, Bethesda, MD, USA
Laboratory of Brain and Cognition, National Institutes of Health, 10 Center Drive 10-3N228, Bethesda, MD, USA.
Philos Trans R Soc Lond B Biol Sci. 2017 Feb 19;372(1714). doi: 10.1098/rstb.2016.0102. Epub 2017 Jan 2.
Visual scene analysis in humans has been characterized by the presence of regions in extrastriate cortex that are selectively responsive to scenes compared with objects or faces. While these regions have often been interpreted as representing high-level properties of scenes (e.g. category), they also exhibit substantial sensitivity to low-level (e.g. spatial frequency) and mid-level (e.g. spatial layout) properties, and it is unclear how these disparate findings can be united in a single framework. In this opinion piece, we suggest that this problem can be resolved by questioning the utility of the classical low- to high-level framework of visual perception for scene processing, and discuss why low- and mid-level properties may be particularly diagnostic for the behavioural goals specific to scene perception as compared to object recognition. In particular, we highlight the contributions of low-level vision to scene representation by reviewing (i) retinotopic biases and receptive field properties of scene-selective regions and (ii) the temporal dynamics of scene perception that demonstrate overlap of low- and mid-level feature representations with those of scene category. We discuss the relevance of these findings for scene perception and suggest a more expansive framework for visual scene analysis.This article is part of the themed issue 'Auditory and visual scene analysis'.
人类视觉场景分析的特点是,与物体或面孔相比,纹外皮层中存在对场景有选择性反应的区域。虽然这些区域通常被解释为代表场景的高级属性(如类别),但它们对低级(如空间频率)和中级(如空间布局)属性也表现出相当的敏感性,目前尚不清楚如何在一个统一的框架中将这些不同的发现结合起来。在这篇观点文章中,我们认为通过质疑视觉感知的经典低级到高级框架对场景处理的实用性,可以解决这个问题,并讨论为什么与物体识别相比,低级和中级属性对于场景感知特有的行为目标可能具有特别的诊断意义。特别是,我们通过回顾(i)场景选择性区域的视网膜拓扑偏差和感受野特性,以及(ii)场景感知的时间动态,突出了低级视觉对场景表征的贡献,这些动态表明低级和中级特征表征与场景类别的特征表征存在重叠。我们讨论了这些发现与场景感知的相关性,并提出了一个更广泛的视觉场景分析框架。本文是主题为“听觉和视觉场景分析”的特刊的一部分。