深度卷积网络不是基于全局物体形状进行分类的。

Deep convolutional networks do not classify based on global object shape.

机构信息

Department of Psychology, University of California, Los Angeles, Los Angeles, California, United States of America.

University of Nevada, Reno, Nevada, United States of America.

出版信息

PLoS Comput Biol. 2018 Dec 7;14(12):e1006613. doi: 10.1371/journal.pcbi.1006613. eCollection 2018 Dec.

DOI:10.1371/journal.pcbi.1006613

PMID:30532273

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6306249/

Abstract

Deep convolutional networks (DCNNs) are achieving previously unseen performance in object classification, raising questions about whether DCNNs operate similarly to human vision. In biological vision, shape is arguably the most important cue for recognition. We tested the role of shape information in DCNNs trained to recognize objects. In Experiment 1, we presented a trained DCNN with object silhouettes that preserved overall shape but were filled with surface texture taken from other objects. Shape cues appeared to play some role in the classification of artifacts, but little or none for animals. In Experiments 2-4, DCNNs showed no ability to classify glass figurines or outlines but correctly classified some silhouettes. Aspects of these results led us to hypothesize that DCNNs do not distinguish object's bounding contours from other edges, and that DCNNs access some local shape features, but not global shape. In Experiment 5, we tested this hypothesis with displays that preserved local features but disrupted global shape, and vice versa. With disrupted global shape, which reduced human accuracy to 28%, DCNNs gave the same classification labels as with ordinary shapes. Conversely, local contour changes eliminated accurate DCNN classification but caused no difficulty for human observers. These results provide evidence that DCNNs have access to some local shape information in the form of local edge relations, but they have no access to global object shapes.

摘要

深度卷积网络（DCNN）在物体分类方面取得了前所未有的性能，这引发了人们的疑问，即 DCNN 是否与人类视觉的运作方式相似。在生物视觉中，形状可以说是识别最重要的线索。我们测试了形状信息在经过训练以识别物体的 DCNN 中的作用。在实验 1 中，我们向经过训练的 DCNN 展示了保留整体形状但填充了来自其他物体的表面纹理的物体轮廓。形状线索似乎在对人工制品的分类中发挥了一定作用，但对动物的作用很小或没有。在实验 2-4 中，DCNN 无法对玻璃小雕像或轮廓进行分类，但可以正确分类一些轮廓。这些结果的某些方面使我们假设 DCNN 无法将物体的边界轮廓与其他边缘区分开来，并且 DCNN 可以访问某些局部形状特征，但不是全局形状。在实验 5 中，我们用保留局部特征但破坏全局形状的显示器和反之亦然来测试这个假设。全局形状被破坏，使人类的准确率降低到 28%，而 DCNN 则给出与普通形状相同的分类标签。相反，局部轮廓变化消除了 DCNN 的准确分类，但对人类观察者没有造成任何困难。这些结果提供了证据表明，DCNN 可以以局部边缘关系的形式访问某些局部形状信息，但它们无法访问全局物体形状。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60af/6306249/8fbbf4e1c799/pcbi.1006613.g001.jpg

相似文献

Deep convolutional networks do not classify based on global object shape.深度卷积网络不是基于全局物体形状进行分类的。

PLoS Comput Biol. 2018 Dec 7;14(12):e1006613. doi: 10.1371/journal.pcbi.1006613. eCollection 2018 Dec.

Local features and global shape information in object classification by deep convolutional neural networks.深度卷积神经网络在目标分类中的局部特征和全局形状信息。

Vision Res. 2020 Jul;172:46-61. doi: 10.1016/j.visres.2020.04.003. Epub 2020 May 12.

Improved object recognition using neural networks trained to mimic the brain's statistical properties.利用模仿大脑统计特性的神经网络来提高物体识别能力。

Neural Netw. 2020 Nov;131:103-114. doi: 10.1016/j.neunet.2020.07.013. Epub 2020 Jul 29.

Crowding in humans is unlike that in convolutional neural networks.人群拥挤的情况与卷积神经网络不同。

Neural Netw. 2020 Jun;126:262-274. doi: 10.1016/j.neunet.2020.03.021. Epub 2020 Mar 27.

Configural relations in humans and deep convolutional neural networks.人类与深度卷积神经网络中的构型关系。

Front Artif Intell. 2023 Mar 1;5:961595. doi: 10.3389/frai.2022.961595. eCollection 2022.

Face Recognition Depends on Specialized Mechanisms Tuned to View-Invariant Facial Features: Insights from Deep Neural Networks Optimized for Face or Object Recognition.人脸识别依赖于专门的机制，这些机制针对的是不变的面部特征：来自专门针对人脸或物体识别进行优化的深度神经网络的见解。

Cogn Sci. 2021 Sep;45(9):e13031. doi: 10.1111/cogs.13031.

Real-world size of objects serves as an axis of object space.现实世界中物体的大小充当了物体空间的一个轴。

Commun Biol. 2022 Jul 27;5(1):749. doi: 10.1038/s42003-022-03711-3.

A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何：对卷积神经网络作为生物视觉合理模型的影响。

Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.

Tree-CNN: A hierarchical Deep Convolutional Neural Network for incremental learning.树卷积神经网络：一种用于增量学习的层次化深度卷积神经网络。

Neural Netw. 2020 Jan;121:148-160. doi: 10.1016/j.neunet.2019.09.010. Epub 2019 Sep 19.

Skeletal descriptions of shape provide unique perceptual information for object recognition.骨骼描述的形状为物体识别提供了独特的感知信息。

Sci Rep. 2019 Jun 27;9(1):9359. doi: 10.1038/s41598-019-45268-y.

引用本文的文献

A feedforward mechanism for human-like contour integration.一种类人轮廓整合的前馈机制。

PLoS Comput Biol. 2025 Aug 18;21(8):e1013391. doi: 10.1371/journal.pcbi.1013391. eCollection 2025 Aug.

Computational models reveal that intuitive physics underlies visual processing of soft objects.计算模型表明，直观物理学是软物体视觉处理的基础。

Nat Commun. 2025 Jul 9;16(1):6303. doi: 10.1038/s41467-025-61458-x.

Potential role of developmental experience in the emergence of the parvo-magno distinction.发育经历在小细胞-大细胞差异出现过程中的潜在作用。

Commun Biol. 2025 Jul 3;8(1):987. doi: 10.1038/s42003-025-08382-4.

Fast and robust visual object recognition in young children.幼儿快速且稳健的视觉物体识别

Sci Adv. 2025 Jul 4;11(27):eads6821. doi: 10.1126/sciadv.ads6821. Epub 2025 Jul 2.

Mitigating data bias and ensuring reliable evaluation of AI models with shortcut hull learning.通过捷径外壳学习减轻数据偏差并确保对人工智能模型进行可靠评估。

Nat Commun. 2025 Jul 1;16(1):5513. doi: 10.1038/s41467-025-60801-6.

Human shape perception spontaneously discovers the biological origin of novel, but natural, stimuli.人类形状感知能自发地发现新颖但自然的刺激的生物学起源。

J R Soc Interface. 2025 May;22(226):20240931. doi: 10.1098/rsif.2024.0931. Epub 2025 May 21.

Brain-like border ownership signals support prediction of natural videos.类脑边界所有权信号支持对自然视频的预测。

iScience. 2025 Mar 11;28(4):112199. doi: 10.1016/j.isci.2025.112199. eCollection 2025 Apr 18.

Establishment of a deep-learning-assisted recurrent nasopharyngeal carcinoma detecting simultaneous tactic (DARNDEST) with high cost-effectiveness based on magnetic resonance images: a multicenter study in an endemic area.基于磁共振成像建立具有高成本效益的深度学习辅助复发性鼻咽癌同步检测策略（DARNDEST）：一项在流行地区的多中心研究

Cancer Imaging. 2025 Mar 24;25(1):39. doi: 10.1186/s40644-025-00853-5.

Configural processing as an optimized strategy for robust object recognition in neural networks.作为神经网络中稳健目标识别的优化策略的构型处理。

Commun Biol. 2025 Mar 7;8(1):386. doi: 10.1038/s42003-025-07672-1.

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.剖析大鼠物体视觉的复杂性需要一个全卷积网络及其他技术。

Patterns (N Y). 2025 Jan 17;6(2):101149. doi: 10.1016/j.patter.2024.101149. eCollection 2025 Feb 14.

本文引用的文献

Abstract shape representation in human visual perception.人眼视觉中的抽象形状表示。

J Exp Psychol Gen. 2018 Sep;147(9):1295-1308. doi: 10.1037/xge0000409. Epub 2018 Apr 9.

Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较，揭示了层级对应关系。

Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.

Deep Neural Networks as a Computational Model for Human Shape Sensitivity.深度神经网络作为人类形状敏感度的计算模型

PLoS Comput Biol. 2016 Apr 28;12(4):e1004896. doi: 10.1371/journal.pcbi.1004896. eCollection 2016 Apr.

Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。

J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.

Visual Turing test for computer vision systems.计算机视觉系统的视觉图灵测试。

Proc Natl Acad Sci U S A. 2015 Mar 24;112(12):3618-23. doi: 10.1073/pnas.1422953112. Epub 2015 Mar 9.

Deep neural networks rival the representation of primate IT cortex for core visual object recognition.深度神经网络在核心视觉目标识别方面可与灵长类动物的颞下皮质表征相媲美。

PLoS Comput Biol. 2014 Dec 18;10(12):e1003963. doi: 10.1371/journal.pcbi.1003963. eCollection 2014 Dec.

Deep supervised, but not unsupervised, models may explain IT cortical representation.深度监督模型而非无监督模型可能解释IT皮层表征。

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

Performance-optimized hierarchical models predict neural responses in higher visual cortex.性能优化的层次模型预测高级视觉皮层中的神经反应。

Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.

Cue dynamics underlying rapid detection of animals in natural scenes.自然场景中动物快速检测背后的线索动力学。

J Vis. 2009 Jul 10;9(7):7. doi: 10.1167/9.7.7.

Revisiting Snodgrass and Vanderwart's object pictorial set: the role of surface detail in basic-level object recognition.重温斯诺德格拉斯和范德沃特的物体图片集：表面细节在基本层次物体识别中的作用。

Perception. 2004;33(2):217-36. doi: 10.1068/p5117.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度卷积网络不是基于全局物体形状进行分类的。

Deep convolutional networks do not classify based on global object shape.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献