Tian Moqian, Yamins Daniel, Grill-Spector Kalanit
J Vis. 2016 May 1;16(7):7. doi: 10.1167/16.7.7.
Humans can learn to recognize new objects just from observing example views. However, it is unknown what structural information enables this learning. To address this question, we manipulated the amount of structural information given to subjects during unsupervised learning by varying the format of the trained views. We then tested how format affected participants' ability to discriminate similar objects across views that were rotated 90° apart. We found that, after training, participants' performance increased and generalized to new views in the same format. Surprisingly, the improvement was similar across line drawings, shape from shading, and shape from shading + stereo even though the latter two formats provide richer depth information compared to line drawings. In contrast, participants' improvement was significantly lower when training used silhouettes, suggesting that silhouettes do not have enough information to generate a robust 3-D structure. To test whether the learned object representations were format-specific or format-invariant, we examined if learning novel objects from example views transfers across formats. We found that learning objects from example line drawings transferred to shape from shading and vice versa. These results have important implications for theories of object recognition because they suggest that (a) learning the 3-D structure of objects does not require rich structural cues during training as long as shape information of internal and external features is provided and (b) learning generates shape-based object representations independent of the training format.
人类仅通过观察示例视图就能学会识别新物体。然而,尚不清楚是什么结构信息促成了这种学习。为了解决这个问题,我们在无监督学习过程中,通过改变训练视图的格式来操纵给予受试者的结构信息量。然后,我们测试了格式如何影响参与者辨别相隔90°旋转的不同视图中相似物体的能力。我们发现,训练后,参与者的表现有所提高,并且能推广到相同格式的新视图。令人惊讶的是,尽管与线条图相比,后两种格式(从阴影中提取形状以及从阴影+立体中提取形状)提供了更丰富的深度信息,但在线条图、从阴影中提取形状以及从阴影+立体中提取形状这几种情况下,参与者的表现提升相似。相比之下,当训练使用轮廓时,参与者的表现提升显著较低,这表明轮廓没有足够的信息来生成强大的三维结构。为了测试所学的物体表征是特定于格式还是格式不变的,我们研究了从示例视图中学习新物体是否能跨格式迁移。我们发现,从示例线条图中学习物体可以迁移到从阴影中提取形状,反之亦然。这些结果对物体识别理论具有重要意义,因为它们表明:(a)只要提供了内部和外部特征的形状信息,在训练过程中学习物体的三维结构并不需要丰富的结构线索;(b)学习生成基于形状的物体表征,且与训练格式无关。