物体识别中的度量不变性：综述及进一步证据

Metric invariance in object recognition: a review and further evidence.

作者信息

Cooper E E, Biederman I, Hummel J E

机构信息

University of Minnesota.

出版信息

Can J Psychol. 1992 Jun;46(2):191-214. doi: 10.1037/h0084317.

DOI:10.1037/h0084317

PMID:1451041

Abstract

Phenomenologically, human shape recognition appears to be invariant with changes of orientation in depth (up to parts occlusion), position in the visual field, and size. Recent versions of template theories (e.g., Ullman, 1989; Lowe, 1987) assume that these invariances are achieved through the application of transformations such as rotation, translation, and scaling of the image so that it can be matched metrically to a stored template. Presumably, such transformations would require time for their execution. We describe recent priming experiments in which the effects of a prior brief presentation of an image on its subsequent recognition are assessed. The results of these experiments indicate that the invariance is complete: The magnitude of visual priming (as distinct from name or basic level concept priming) is not affected by a change in position, size, orientation in depth, or the particular lines and vertices present in the image, as long as representations of the same components can be activated. An implemented seven layer neural network model (Hummel & Biederman, 1992) that captures these fundamental properties of human object recognition is described. Given a line drawing of an object, the model activates a viewpoint-invariant structural description of the object, specifying its parts and their interrelations. Visual priming is interpreted as a change in the connection weights for the activation of: a) cells, termed geon feature assemblies (GFAs), that conjoin the output of units that represent invariant, independent properties of a single geon and its relations (such as its type, aspect ratio, relations to other geons), or b) a change in the connection weights by which several GFAs activate a cell representing an object.

摘要

从现象学角度来看，人类形状识别似乎不会因深度方向的变化（直至部分遮挡）、视野中的位置以及大小而改变。模板理论的最新版本（例如，Ullman，1989；Lowe，1987）假定，这些不变性是通过应用诸如旋转、平移和缩放图像等变换来实现的，以便能够将其与存储的模板进行度量匹配。据推测，此类变换的执行需要时间。我们描述了最近的启动实验，其中评估了图像的先前简短呈现对其后续识别的影响。这些实验的结果表明，这种不变性是完全的：视觉启动的程度（与名称或基本水平概念启动不同）不受位置、大小、深度方向的变化或图像中存在的特定线条和顶点的影响，只要相同组件的表征能够被激活。文中描述了一个已实现的七层神经网络模型（Hummel和Biederman，1992），该模型捕捉了人类物体识别的这些基本特性。给定一个物体的线条图，该模型会激活该物体的视点不变结构描述，指定其部分及其相互关系。视觉启动被解释为激活以下内容的连接权重的变化：a）称为geon特征组件（GFA）的细胞，这些细胞结合了代表单个geon的不变、独立属性及其关系（如类型、纵横比、与其他geon的关系）的单元的输出，或者b）几个GFA激活代表物体的细胞的连接权重的变化。

相似文献

Metric invariance in object recognition: a review and further evidence.

Can J Psychol. 1992 Jun;46(2):191-214. doi: 10.1037/h0084317.

Priming contour-deleted images: evidence for intermediate representations in visual object recognition.

Cogn Psychol. 1991 Jul;23(3):393-419. doi: 10.1016/0010-0285(91)90014-f.

Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance.

J Exp Psychol Hum Percept Perform. 1993 Dec;19(6):1162-82. doi: 10.1037//0096-1523.19.6.1162.

Translational and reflectional priming invariance: a retrospective.

Perception. 2009;38(6):809-17. doi: 10.1068/pmkbie.

Testing conditions for viewpoint invariance in object recognition.

J Exp Psychol Hum Percept Perform. 1997 Oct;23(5):1511-21. doi: 10.1037//0096-1523.23.5.1511.

Neurocomputational bases of object and face recognition.

Philos Trans R Soc Lond B Biol Sci. 1997 Aug 29;352(1358):1203-19. doi: 10.1098/rstb.1997.0103.

Non-accidental properties, metric invariance, and encoding by neurons in a model of ventral stream visual object recognition, VisNet.

Neurobiol Learn Mem. 2018 Jul;152:20-31. doi: 10.1016/j.nlm.2018.04.017. Epub 2018 May 1.

Invariance of long-term visual priming to scale, reflection, translation, and hemisphere.

Vision Res. 2001 Jan 15;41(2):221-34. doi: 10.1016/s0042-6989(00)00234-0.

Is human object recognition better described by geon structural descriptions or by multiple views? Comment on Biederman and Gerhardstein (1993).

J Exp Psychol Hum Percept Perform. 1995 Dec;21(6):1494-505. doi: 10.1037//0096-1523.21.6.1494.

A Balanced Comparison of Object Invariances in Monkey IT Neurons.

eNeuro. 2017 Apr 13;4(2). doi: 10.1523/ENEURO.0333-16.2017. eCollection 2017 Mar-Apr.

引用本文的文献

An image-computable model of human visual shape similarity.

PLoS Comput Biol. 2021 Jun 1;17(6):e1008981. doi: 10.1371/journal.pcbi.1008981. eCollection 2021 Jun.

The human visual system and CNNs can both support robust online translation tolerance following extreme displacements.

J Vis. 2021 Feb 3;21(2):9. doi: 10.1167/jov.21.2.9.

Conflicting demands of abstract and specific visual object processing resolved by frontoparietal networks.

Cogn Affect Behav Neurosci. 2016 Jun;16(3):502-15. doi: 10.3758/s13415-016-0409-4.

Separability of abstract-category and specific-exemplar visual object subsystems: evidence from fMRI pattern analysis.

Brain Cogn. 2015 Feb;93:54-63. doi: 10.1016/j.bandc.2014.11.007. Epub 2014 Dec 18.

Local and global level-priming occurs for hierarchical stimuli composed of outlined, but not filled-in, elements.

J Vis. 2013 Feb 18;13(2):23. doi: 10.1167/13.2.23.

Changes in visual object recognition precede the shape bias in early noun learning.

Front Psychol. 2012 Dec 3;3:533. doi: 10.3389/fpsyg.2012.00533. eCollection 2012.

(In) sensitivity to spatial distortion in natural scenes.

J Vis. 2010 Feb 24;10(2):23.1-15. doi: 10.1167/10.2.23.

Developmental changes in visual object recognition between 18 and 24 months of age.

Dev Sci. 2009 Jan;12(1):67-80. doi: 10.1111/j.1467-7687.2008.00747.x.

Dissociable neural subsystems underlie visual working memory for abstract categories and specific exemplars.

Cogn Affect Behav Neurosci. 2008 Mar;8(1):17-24. doi: 10.3758/cabn.8.1.17.

Color and context: an ERP study on intrinsic and extrinsic feature binding in episodic memory.

Mem Cognit. 2007 Sep;35(6):1483-501. doi: 10.3758/bf03193618.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

物体识别中的度量不变性：综述及进一步证据

Metric invariance in object recognition: a review and further evidence.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献