Yue Xiaomin, Biederman Irving, Mangini Michael C, Malsburg Christoph von der, Amir Ori
Martinos Center for Biomedical Imaging, Massachusetts General Hospital, 149 13th Street, Suite 2301, Charlestown, MA 02129, USA.
Vision Res. 2012 Feb 15;55:41-6. doi: 10.1016/j.visres.2011.12.012. Epub 2012 Jan 5.
Shape representation is accomplished by a series of cortical stages in which cells in the first stage (V1) have local receptive fields tuned to contrast at a particular scale and orientation, each well modeled as a Gabor filter. In succeeding stages, the representation becomes largely invariant to Gabor coding (Kobatake & Tanaka, 1994). Because of the non-Gabor tuning in these later stages, which must be engaged for a behavioral response (Tong, 2003; Tong et al., 1998), a V1-based measure of shape similarity based on Gabor filtering would not be expected to be highly correlated with human performance when discriminating complex shapes (faces and teeth-like blobs) that differ metrically on a two-choice, match-to-sample task. Here we show that human performance is highly correlated with Gabor-based image measures (Gabor simple and complex cells), with values often in the mid 0.90s, even without discounting the variability in the speed and accuracy of performance not associated with the similarity of the distractors. This high correlation is generally maintained through the stages of HMAX, a model that builds upon the Gabor metric and develops units for complex features and larger receptive fields. This is the first report of the psychophysical similarity of complex shapes being predictable from a biologically motivated, physical measure of similarity. As accurate as these measures were for accounting for metric variation, a simple demonstration showed that all were insensitive to viewpoint invariant (nonaccidental) differences in shape.
形状表征是通过一系列皮质阶段来完成的,在第一阶段(V1)的细胞具有局部感受野,这些感受野在特定尺度和方向上对对比度进行了调谐,每个感受野都可以很好地建模为一个Gabor滤波器。在后续阶段,表征在很大程度上对Gabor编码变得不变(小幡和田中,1994)。由于这些后期阶段存在非Gabor调谐,而这种调谐对于行为反应是必需的(童,2003;童等人,1998),因此基于Gabor滤波的基于V1的形状相似性度量在辨别复杂形状(面部和牙齿状斑点)时,预计与人类表现不会高度相关,这些复杂形状在二选一、匹配样本任务中在度量上存在差异。在这里,我们表明人类表现与基于Gabor的图像度量(Gabor简单细胞和复杂细胞)高度相关,相关值通常在0.90左右,即使不考虑与干扰物相似性无关的表现速度和准确性的变异性。这种高相关性通常在HMAX的各个阶段都得以保持,HMAX是一个基于Gabor度量构建的模型,并为复杂特征和更大的感受野开发单元。这是首次报道从基于生物学动机的物理相似性度量可以预测复杂形状的心理物理相似性。尽管这些度量在解释度量变化方面很准确,但一个简单的演示表明,所有这些度量对形状的视点不变(非偶然)差异都不敏感。