Suppr超能文献

从全局属性识别自然场景:不见树木,只见森林。

Recognition of natural scenes from global properties: seeing the forest without representing the trees.

作者信息

Greene Michelle R, Oliva Aude

机构信息

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 77 Massachusetts Avenue 46-4078, Cambridge, MA 02139, USA.

出版信息

Cogn Psychol. 2009 Mar;58(2):137-76. doi: 10.1016/j.cogpsych.2008.06.001. Epub 2008 Aug 30.

Abstract

Human observers are able to rapidly and accurately categorize natural scenes, but the representation mediating this feat is still unknown. Here we propose a framework of rapid scene categorization that does not segment a scene into objects and instead uses a vocabulary of global, ecological properties that describe spatial and functional aspects of scene space (such as navigability or mean depth). In Experiment 1, we obtained ground truth rankings on global properties for use in Experiments 2-4. To what extent do human observers use global property information when rapidly categorizing natural scenes? In Experiment 2, we found that global property resemblance was a strong predictor of both false alarm rates and reaction times in a rapid scene categorization experiment. To what extent is global property information alone a sufficient predictor of rapid natural scene categorization? In Experiment 3, we found that the performance of a classifier representing only these properties is indistinguishable from human performance in a rapid scene categorization task in terms of both accuracy and false alarms. To what extent is this high predictability unique to a global property representation? In Experiment 4, we compared two models that represent scene object information to human categorization performance and found that these models had lower fidelity at representing the patterns of performance than the global property model. These results provide support for the hypothesis that rapid categorization of natural scenes may not be mediated primarily though objects and parts, but also through global properties of structure and affordance.

摘要

人类观察者能够快速且准确地对自然场景进行分类,但介导这一能力的表征仍然未知。在此,我们提出了一个快速场景分类框架,该框架不会将场景分割为物体,而是使用一组描述场景空间的空间和功能方面(如可导航性或平均深度)的全局生态属性词汇。在实验1中,我们获得了用于实验2至4的关于全局属性的真实排名。人类观察者在快速对自然场景进行分类时,在多大程度上使用全局属性信息?在实验2中,我们发现在快速场景分类实验中,全局属性相似性是误报率和反应时间的有力预测指标。仅全局属性信息在多大程度上足以预测快速自然场景分类?在实验3中,我们发现仅表示这些属性的分类器在快速场景分类任务中的表现,在准确性和误报方面与人类表现难以区分。这种高预测性在多大程度上是全局属性表征所独有的?在实验4中,我们将两个表示场景物体信息的模型与人类分类表现进行了比较,发现这些模型在表示表现模式方面比全局属性模型的保真度更低。这些结果为以下假设提供了支持:自然场景的快速分类可能主要不是通过物体和部分来介导的,而是也通过结构和可供性的全局属性来介导的。

相似文献

1
Recognition of natural scenes from global properties: seeing the forest without representing the trees.
Cogn Psychol. 2009 Mar;58(2):137-76. doi: 10.1016/j.cogpsych.2008.06.001. Epub 2008 Aug 30.
2
Nonaccidental properties underlie human categorization of complex natural scenes.
Psychol Sci. 2014 Apr;25(4):851-60. doi: 10.1177/0956797613512662. Epub 2014 Jan 28.
3
4
What makes a scene? Fast scene categorization as a function of global scene information at different resolutions.
J Exp Psychol Hum Percept Perform. 2022 Aug;48(8):871-888. doi: 10.1037/xhp0001020. Epub 2022 Jun 16.
6
Coherent natural scene structure facilitates the extraction of task-relevant object information in visual cortex.
Neuroimage. 2021 Oct 15;240:118365. doi: 10.1016/j.neuroimage.2021.118365. Epub 2021 Jul 4.
8
Processing scene context: fast categorization and object interference.
Vision Res. 2007 Dec;47(26):3286-97. doi: 10.1016/j.visres.2007.09.013. Epub 2007 Oct 29.
9
Global ensemble texture representations are critical to rapid scene perception.
J Exp Psychol Hum Percept Perform. 2017 Jun;43(6):1160-1176. doi: 10.1037/xhp0000399. Epub 2017 Mar 6.
10
The influence of behavioral relevance on the processing of global scene properties: An ERP study.
Neuropsychologia. 2018 Jun;114:168-180. doi: 10.1016/j.neuropsychologia.2018.04.040. Epub 2018 May 2.

引用本文的文献

2
Post-Saccadic Disruption of Semantic Category Information in Naturalistic Scenes.
bioRxiv. 2025 Jun 10:2025.06.06.658316. doi: 10.1101/2025.06.06.658316.
6
The role of anchor objects in scene function understanding.
Sci Rep. 2025 Jun 23;15(1):20247. doi: 10.1038/s41598-025-04122-0.
7
Cortical Encoding of Spatial Structure and Semantic Content in 3D Natural Scenes.
J Neurosci. 2025 Feb 26;45(9):e2157232024. doi: 10.1523/JNEUROSCI.2157-23.2024.
8
Memory-based predictions prime perceptual judgments across head turns in immersive, real-world scenes.
Curr Biol. 2025 Jan 6;35(1):121-130.e6. doi: 10.1016/j.cub.2024.11.024. Epub 2024 Dec 17.
9
No evidence for a privileged role of global ensemble statistics in rapid scene perception: A registered replication attempt.
Atten Percept Psychophys. 2025 Feb;87(2):685-697. doi: 10.3758/s13414-024-02994-4. Epub 2024 Dec 10.
10
Electroencephalographic Responses to the Number of Objects in Partially Occluded and Uncovered Scenes.
J Cogn Neurosci. 2025 Jan 2;37(1):227-238. doi: 10.1162/jocn_a_02264.

本文引用的文献

1
ARTSCENE: A neural system for natural scene classification.
J Vis. 2009 Apr 6;9(4):6.1-19. doi: 10.1167/9.4.6.
2
The briefest of glances: the time course of natural scene understanding.
Psychol Sci. 2009 Apr;20(4):464-72. doi: 10.1111/j.1467-9280.2009.02316.x.
3
80 million tiny images: a large data set for nonparametric object and scene recognition.
IEEE Trans Pattern Anal Mach Intell. 2008 Nov;30(11):1958-70. doi: 10.1109/TPAMI.2008.128.
4
Unconscious associative memory affects visual processing before 100 ms.
J Vis. 2008 Mar 12;8(3):10.1-10. doi: 10.1167/8.3.10.
5
The representation of simple ensemble visual features outside the focus of attention.
Psychol Sci. 2008 Apr;19(4):392-8. doi: 10.1111/j.1467-9280.2008.02098.x.
6
Processing scene context: fast categorization and object interference.
Vision Res. 2007 Dec;47(26):3286-97. doi: 10.1016/j.visres.2007.09.013. Epub 2007 Oct 29.
7
The three dimensions of human visual sensitivity to first-order contrast statistics.
Vision Res. 2007 Aug;47(17):2237-48. doi: 10.1016/j.visres.2007.03.025. Epub 2007 Jul 9.
8
What do we perceive in a glance of a real-world scene?
J Vis. 2007 Jan 31;7(1):10. doi: 10.1167/7.1.10.
9
Image statistics and the perception of surface qualities.
Nature. 2007 May 10;447(7141):206-9. doi: 10.1038/nature05724. Epub 2007 Apr 18.
10
A feedforward architecture accounts for rapid categorization.
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6424-9. doi: 10.1073/pnas.0700622104. Epub 2007 Apr 2.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验