Bülthoff H H, Edelman S Y, Tarr M J
Max-Planck-Institut für biologische Kybernetik, Tübingen, Germany.
Cereb Cortex. 1995 May-Jun;5(3):247-60. doi: 10.1093/cercor/5.3.247.
In this report we discuss a variety of psychophysical experiments that explore different aspects of the problem of object recognition and representation in human vision. In all experiments, subjects were presented with realistically rendered images of computer-generated 3D objects, with tight control over stimulus shape, surface properties, illumination, and viewpoint, as well as subjects' prior exposure to the stimulus objects. Contrary to the predictions of the paradigmatic theory of recognition, which holds that object representations are viewpoint invariant, performance in all experiments was consistently viewpoint dependent, was only partially aided by binocular stereo and other depth information, was specific to viewpoints that were familiar, and was systematically disrupted by rotation in depth more than by deforming the 2D images of the stimuli. The emerging concept of multiple-views representation supported by these results is consistent with recently advanced computational theories of recognition based on view interpolation. Moreover, in several simulated experiments employing the same stimuli used in experiments with human subjects, models based on multiple-views representations replicated many of the psychophysical results concerning the observed pattern of human performance.
在本报告中,我们讨论了各种心理物理学实验,这些实验探索了人类视觉中物体识别与表征问题的不同方面。在所有实验中,向受试者呈现计算机生成的3D物体的逼真渲染图像,对刺激形状、表面属性、光照和视角以及受试者对刺激物体的先前接触进行严格控制。与识别的范式理论的预测相反,该理论认为物体表征是视角不变的,所有实验中的表现始终依赖于视角,仅部分地得到双眼立体视觉和其他深度信息的辅助,特定于熟悉的视角,并且与通过使刺激的二维图像变形相比,深度旋转对其造成的系统性干扰更大。这些结果所支持的多视图表征这一新兴概念与最近基于视图插值的先进识别计算理论相一致。此外,在几个使用与人类受试者实验中相同刺激的模拟实验中,基于多视图表征的模型复制了许多关于观察到的人类表现模式的心理物理学结果。