Balas Benjamin J, Sinha Pawan
Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.
Atten Percept Psychophys. 2009 May;71(4):712-23. doi: 10.3758/APP.71.4.712.
Objects are best recognized from so-called "canonical" views. The characteristics of canonical views of arbitrary objects have been qualitatively described using a variety of different criteria, but little is known regarding how these views might be acquired during object learning. We address this issue, in part, by examining the role of object motion in the selection of preferred views of novel objects. Specifically, we adopt a modeling approach to investigate whether or not the sequence of views seen during initial exposure to an object contributes to observers' preferences for particular images in the sequence. In two experiments, we exposed observers to short sequences depicting rigidly rotating novel objects and subsequently collected subjective ratings of view canonicality (Experiment 1) and recall rates for individual views (Experiment 2). Given these two operational definitions of view canonicality, we attempted to fit both sets of behavioral data with a computational model incorporating 3-D shape information (object foreshortening), as well as information relevant to the temporal order of views presented during training (the rate of change for object foreshortening). Both sets of ratings were reasonably well predicted using only 3-D shape; the inclusion of terms that capture sequence order improved model performance significantly.
物体最好从所谓的“典型”视角被识别。任意物体的典型视角特征已通过各种不同标准进行了定性描述,但对于这些视角在物体学习过程中是如何获得的却知之甚少。我们通过研究物体运动在新物体偏好视角选择中的作用,部分地解决了这个问题。具体而言,我们采用一种建模方法来探究在首次接触物体时所看到的视角序列是否会影响观察者对序列中特定图像的偏好。在两个实验中,我们让观察者观看描绘刚性旋转新物体的短序列,随后收集视角典型性的主观评分(实验1)以及各个视角的召回率(实验2)。基于视角典型性的这两种操作定义,我们试图用一个包含三维形状信息(物体缩短)以及与训练期间呈现的视角时间顺序相关信息(物体缩短变化率)的计算模型来拟合这两组行为数据。仅使用三维形状就能对两组评分进行合理预测;纳入捕捉序列顺序的项显著提高了模型性能。