Baker Nicholas, Elder James H
Department of Psychology, Loyola University of Chicago, Chicago, IL 60660, USA.
Centre for Vision Research, York University, Toronto, ON M3J 1P3, Canada.
iScience. 2022 Aug 11;25(9):104913. doi: 10.1016/j.isci.2022.104913. eCollection 2022 Sep 16.
A hallmark of human object perception is sensitivity to the holistic configuration of the local shape features of an object. Deep convolutional neural networks (DCNNs) are currently the dominant models for object recognition processing in the visual cortex, but do they capture this configural sensitivity? To answer this question, we employed a dataset of animal silhouettes and created a variant of this dataset that disrupts the configuration of each object while preserving local features. While human performance was impacted by this manipulation, DCNN performance was not, indicating insensitivity to object configuration. Modifications to training and architecture to make networks more brain-like did not lead to configural processing, and none of the networks were able to accurately predict trial-by-trial human object judgements. We speculate that to match human configural sensitivity, networks must be trained to solve a broader range of object tasks beyond category recognition.
人类物体感知的一个标志是对物体局部形状特征的整体构型敏感。深度卷积神经网络(DCNN)是目前视觉皮层中物体识别处理的主导模型,但它们是否捕捉到了这种构型敏感性呢?为了回答这个问题,我们使用了一个动物剪影数据集,并创建了该数据集的一个变体,在保留局部特征的同时打乱每个物体的构型。虽然这种操作影响了人类的表现,但DCNN的表现却没有,这表明它对物体构型不敏感。对训练和架构进行修改以使网络更像大脑,并没有导致构型处理,而且没有一个网络能够准确预测逐次试验的人类物体判断。我们推测,要匹配人类的构型敏感性,网络必须经过训练以解决类别识别之外更广泛的物体任务。