Department of Electrical Engineering, Shahid Rajaee Teacher Training University, Tehran, Iran.
Cognitive Science Research lab., Department of Computer Engineering, Shahid Rajaee Teacher Training University, Tehran, Iran.
Sci Rep. 2017 Oct 31;7(1):14402. doi: 10.1038/s41598-017-13756-8.
One key ability of human brain is invariant object recognition, which refers to rapid and accurate recognition of objects in the presence of variations such as size, rotation and position. Despite decades of research into the topic, it remains unknown how the brain constructs invariant representations of objects. Providing brain-plausible object representations and reaching human-level accuracy in recognition, hierarchical models of human vision have suggested that, human brain implements similar feed-forward operations to obtain invariant representations. However, conducting two psychophysical object recognition experiments on humans with systematically controlled variations of objects, we observed that humans relied on specific (diagnostic) object regions for accurate recognition which remained relatively consistent (invariant) across variations; but feed-forward feature-extraction models selected view-specific (non-invariant) features across variations. This suggests that models can develop different strategies, but reach human-level recognition performance. Moreover, human individuals largely disagreed on their diagnostic features and flexibly shifted their feature extraction strategy from view-invariant to view-specific when objects became more similar. This implies that, even in rapid object recognition, rather than a set of feed-forward mechanisms which extract diagnostic features from objects in a hard-wired fashion, the bottom-up visual pathways receive, through top-down connections, task-related information possibly processed in prefrontal cortex.
人类大脑的一个关键能力是不变物体识别,即指在大小、旋转和位置等变化的情况下,对物体进行快速而准确的识别。尽管对这个主题进行了几十年的研究,但仍然不清楚大脑如何构建物体的不变表示。人类视觉的分层模型提供了具有大脑似然的物体表示,并在识别方面达到了人类水平的准确性,它们表明,人类大脑通过类似的前馈操作来获得不变的表示。然而,我们通过对人类进行了两个具有系统控制的物体变化的心理物理学物体识别实验,观察到人类依赖于特定的(诊断性的)物体区域进行准确识别,这些区域在变化中保持相对一致(不变);但是前馈特征提取模型在变化中选择了特定视图的(非不变)特征。这表明模型可以采用不同的策略,但可以达到人类水平的识别性能。此外,人类个体在他们的诊断特征上存在很大的分歧,并在物体变得更加相似时,灵活地将他们的特征提取策略从视图不变转换为视图特定。这意味着,即使在快速的物体识别中,也不是一组通过前馈机制以硬连线的方式从物体中提取诊断特征,而是通过自上而下的连接,从底部向上的视觉通路接收可能在前额叶皮层中处理的与任务相关的信息。