Baker Nicholas, Garrigan Patrick, Phillips Austin, Kellman Philip J
Department of Psychology, Loyola University Chicago, Chicago, IL, United States.
Department of Psychology, Saint Joseph's University, Philadelphia, PA, United States.
Front Artif Intell. 2023 Mar 1;5:961595. doi: 10.3389/frai.2022.961595. eCollection 2022.
Deep convolutional neural networks (DCNNs) have attracted considerable interest as useful devices and as possible windows into understanding perception and cognition in biological systems. In earlier work, we showed that DCNNs differ dramatically from human perceivers in that they have no sensitivity to global object shape. Here, we investigated whether those findings are symptomatic of broader limitations of DCNNs regarding the use of relations. We tested learning and generalization of DCNNs (AlexNet and ResNet-50) for several relations involving objects. One involved classifying two shapes in an otherwise empty field as same or different. Another involved enclosure. Every display contained a closed figure among contour noise fragments and one dot; correct responding depended on whether the dot was inside or outside the figure. The third relation we tested involved a classification that depended on which of two polygons had more sides. One polygon always contained a dot, and correct classification of each display depended on whether the polygon with the dot had a greater number of sides. We used DCNNs that had been trained on the ImageNet database, and we used both restricted and unrestricted transfer learning (connection weights at all layers could change with training). For the same-different experiment, there was little restricted transfer learning (82.2%). Generalization tests showed near chance performance for new shapes. Results for enclosure were at chance for restricted transfer learning and somewhat better for unrestricted (74%). Generalization with two new kinds of shapes showed reduced but above-chance performance (≈66%). Follow-up studies indicated that the networks did not access the enclosure relation in their responses. For the relation of more or fewer sides of polygons, DCNNs showed successful learning with polygons having 3-5 sides under unrestricted transfer learning, but showed chance performance in generalization tests with polygons having 6-10 sides. Experiments with human observers showed learning from relatively few examples of all of the relations tested and complete generalization of relational learning to new stimuli. These results using several different relations suggest that DCNNs have crucial limitations that derive from their lack of computations involving abstraction and relational processing of the sort that are fundamental in human perception.
深度卷积神经网络(DCNNs)作为一种有用的工具以及理解生物系统感知和认知的潜在窗口,已经引起了广泛关注。在早期的研究中,我们发现DCNNs与人类感知者存在显著差异,即它们对全局物体形状不敏感。在此,我们研究了这些发现是否表明DCNNs在关系运用方面存在更广泛的局限性。我们测试了DCNNs(AlexNet和ResNet - 50)对涉及物体的几种关系的学习和泛化能力。一种关系是在其他部分为空的区域中,对两个形状进行相同或不同的分类。另一种关系是包含关系。每个显示屏在轮廓噪声片段中包含一个封闭图形和一个点;正确的反应取决于点是在图形内部还是外部。我们测试的第三种关系是一种分类,它取决于两个多边形中哪个边更多。其中一个多边形总是包含一个点,每个显示屏的正确分类取决于带有点的多边形是否边数更多。我们使用在ImageNet数据库上训练过的DCNNs,并采用了受限和无受限的迁移学习(所有层的连接权重都可以随着训练而改变)。对于相同 - 不同实验,受限迁移学习的效果不佳(82.2%)。泛化测试表明,对于新形状的表现接近随机水平。包含关系的实验中,受限迁移学习的结果处于随机水平,无受限迁移学习的结果稍好一些(74%)。使用两种新形状进行泛化时,表现有所下降但仍高于随机水平(约66%)。后续研究表明,网络在反应中并未利用包含关系。对于多边形边数多少的关系,在无受限迁移学习下,DCNNs对边数为3 - 5的多边形表现出成功的学习,但在对边数为6 - 10的多边形进行泛化测试时,表现为随机水平。对人类观察者进行的实验表明,可以从相对较少的测试关系示例中进行学习,并将关系学习完全泛化到新刺激上。这些使用几种不同关系的结果表明,DCNNs存在关键局限性,这源于它们缺乏涉及人类感知中基本的抽象和关系处理的计算。