深度卷积神经网络能否支持相同-不同任务中的关系推理？

Can deep convolutional neural networks support relational reasoning in the same-different task?

机构信息

School of Psychological Science, University of Bristol, UK.

出版信息

J Vis. 2022 Sep 2;22(10):11. doi: 10.1167/jov.22.10.11.

DOI:10.1167/jov.22.10.11

PMID:36094524

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9482325/

Abstract

Same-different visual reasoning is a basic skill central to abstract combinatorial thought. This fact has lead neural networks researchers to test same-different classification on deep convolutional neural networks (DCNNs), which has resulted in a controversy regarding whether this skill is within the capacity of these models. However, most tests of same-different classification rely on testing on images that come from the same pixel-level distribution as the training images, yielding the results inconclusive. In this study, we tested relational same-different reasoning in DCNNs. In a series of simulations we show that models based on the ResNet architecture are capable of visual same-different classification, but only when the test images are similar to the training images at the pixel level. In contrast, when there is a shift in the testing distribution that does not change the relation between the objects in the image, the performance of DCNNs decreases substantially. This finding is true even when the DCNNs' training regime is expanded to include images taken from a wide range of different pixel-level distributions or when the model is trained on the testing distribution but on a different task in a multitask learning context. Furthermore, we show that the relation network, a deep learning architecture specifically designed to tackle visual relational reasoning problems, suffers the same kind of limitations. Overall, the results of this study suggest that learning same-different relations is beyond the scope of current DCNNs.

摘要

相同-不同视觉推理是抽象组合思维的基本技能。这一事实促使神经网络研究人员在深度卷积神经网络（DCNN）上测试相同-不同分类，这导致了关于这些模型是否具备这种能力的争议。然而，大多数相同-不同分类的测试都依赖于测试与训练图像来自相同像素级分布的图像，导致结果不确定。在这项研究中，我们在 DCNN 中测试了关系相同-不同推理。在一系列模拟中，我们表明基于 ResNet 架构的模型能够进行视觉相同-不同分类，但仅当测试图像在像素级别与训练图像相似时。相比之下，当测试分布发生变化但不改变图像中对象之间的关系时，DCNN 的性能会大幅下降。即使 DCNN 的训练方案扩展到包括来自广泛不同像素级分布的图像，或者模型在多任务学习环境中针对测试分布进行训练但针对不同任务进行训练时，这种发现仍然成立。此外，我们表明，关系网络，一种专门用于解决视觉关系推理问题的深度学习架构，也受到相同类型的限制。总体而言，这项研究的结果表明，学习相同-不同关系超出了当前 DCNN 的范围。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c49e/9482325/d576309b390c/jovi-22-10-11-f001.jpg

相似文献

Can deep convolutional neural networks support relational reasoning in the same-different task?

J Vis. 2022 Sep 2;22(10):11. doi: 10.1167/jov.22.10.11.

Configural relations in humans and deep convolutional neural networks.

Front Artif Intell. 2023 Mar 1;5:961595. doi: 10.3389/frai.2022.961595. eCollection 2022.

Local features and global shape information in object classification by deep convolutional neural networks.

Vision Res. 2020 Jul;172:46-61. doi: 10.1016/j.visres.2020.04.003. Epub 2020 May 12.

Improved object recognition using neural networks trained to mimic the brain's statistical properties.

Neural Netw. 2020 Nov;131:103-114. doi: 10.1016/j.neunet.2020.07.013. Epub 2020 Jul 29.

Dermatologist-level classification of malignant lip diseases using a deep convolutional neural network.

Br J Dermatol. 2020 Jun;182(6):1388-1394. doi: 10.1111/bjd.18459. Epub 2019 Nov 19.

Deep convolutional networks do not classify based on global object shape.

PLoS Comput Biol. 2018 Dec 7;14(12):e1006613. doi: 10.1371/journal.pcbi.1006613. eCollection 2018 Dec.

Face Recognition Depends on Specialized Mechanisms Tuned to View-Invariant Facial Features: Insights from Deep Neural Networks Optimized for Face or Object Recognition.

Cogn Sci. 2021 Sep;45(9):e13031. doi: 10.1111/cogs.13031.

Towards pixel-to-pixel deep nucleus detection in microscopy images.

BMC Bioinformatics. 2019 Sep 14;20(1):472. doi: 10.1186/s12859-019-3037-5.

Using Deep Convolutional Neural Networks for Image-Based Diagnosis of Nutrient Deficiencies in Rice.

Comput Intell Neurosci. 2020 Sep 9;2020:7307252. doi: 10.1155/2020/7307252. eCollection 2020.

Medical image classification using synergic deep learning.

Med Image Anal. 2019 May;54:10-19. doi: 10.1016/j.media.2019.02.010. Epub 2019 Feb 18.

引用本文的文献

A feedforward mechanism for human-like contour integration.

PLoS Comput Biol. 2025 Aug 18;21(8):e1013391. doi: 10.1371/journal.pcbi.1013391. eCollection 2025 Aug.

Mitigating data bias and ensuring reliable evaluation of AI models with shortcut hull learning.

Nat Commun. 2025 Jul 1;16(1):5513. doi: 10.1038/s41467-025-60801-6.

Visual homogeneity computations in the brain enable solving property-based visual tasks.

Elife. 2025 Feb 18;13:RP93033. doi: 10.7554/eLife.93033.

A Benchmark for Compositional Visual Reasoning.

Adv Neural Inf Process Syst. 2022 Dec;35(DB):29776-29788.

A brain-inspired object-based attention network for multiobject recognition and visual reasoning.

J Vis. 2023 May 2;23(5):16. doi: 10.1167/jov.23.5.16.

SpatialSim: Recognizing Spatial Configurations of Objects With Graph Neural Networks.

Front Artif Intell. 2022 Jan 26;4:782081. doi: 10.3389/frai.2021.782081. eCollection 2021.

本文引用的文献

Understanding the Computational Demands Underlying Visual Reasoning.

Neural Comput. 2022 Apr 15;34(5):1075-1099. doi: 10.1162/neco_a_01485.

Evaluating the progress of deep learning for visual relational concepts.

J Vis. 2021 Oct 5;21(11):8. doi: 10.1167/jov.21.11.8.

Five points to check when comparing visual perception in humans and machines.

J Vis. 2021 Mar 1;21(3):16. doi: 10.1167/jov.21.3.16.

Individual differences among deep neural network models.

Nat Commun. 2020 Nov 12;11(1):5725. doi: 10.1038/s41467-020-19632-w.

Not-So-CLEVR: learning same-different relations strains feedforward neural networks.

Interface Focus. 2018 Aug 6;8(4):20180011. doi: 10.1098/rsfs.2018.0011. Epub 2018 Jun 15.

Prelinguistic Relational Concepts: Investigating Analogical Processing in Infants.

Child Dev. 2015 Sep-Oct;86(5):1386-405. doi: 10.1111/cdev.12381. Epub 2015 May 20.

Comparing machines and humans on a visual categorization test.

Proc Natl Acad Sci U S A. 2011 Oct 25;108(43):17621-5. doi: 10.1073/pnas.1109168108. Epub 2011 Oct 17.

Darwin's mistake: explaining the discontinuity between human and nonhuman minds.

Behav Brain Sci. 2008 Apr;31(2):109-30; discussion 130-178. doi: 10.1017/S0140525X08003543.

Index for rating diagnostic tests.

Cancer. 1950 Jan;3(1):32-5. doi: 10.1002/1097-0142(1950)3:1<32::aid-cncr2820030106>3.0.co;2-3.

The parallel distributed processing approach to semantic cognition.

Nat Rev Neurosci. 2003 Apr;4(4):310-22. doi: 10.1038/nrn1076.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

深度卷积神经网络能否支持相同-不同任务中的关系推理？

Can deep convolutional neural networks support relational reasoning in the same-different task?

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献