Suppr
超能文献

在解释物体相似性判断方面，深度卷积神经网络的表现优于基于特征的模型，但不优于分类模型。

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

作者信息

Jozwik Kamila M, Kriegeskorte Nikolaus, Storrs Katherine R, Mur Marieke

机构信息

Neural Dynamics of Visual Cognition, Department of Education and Psychology, Free University of Berlin, Berlin, Germany.

Memory and Perception Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom.

出版信息

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

DOI:10.3389/fpsyg.2017.01726

PMID:29062291

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5640771/

Abstract

Recent advances in Deep convolutional Neural Networks (DNNs) have enabled unprecedentedly accurate computational models of brain representations, and present an exciting opportunity to model diverse cognitive functions. State-of-the-art DNNs achieve human-level performance on object categorisation, but it is unclear how well they capture human behavior on complex cognitive tasks. Recent reports suggest that DNNs can explain significant variance in one such task, judging object similarity. Here, we extend these findings by replicating them for a rich set of object images, comparing performance across layers within two DNNs of different depths, and examining how the DNNs' performance compares to that of non-computational "conceptual" models. Human observers performed similarity judgments for a set of 92 images of real-world objects. Representations of the same images were obtained in each of the layers of two DNNs of different depths (8-layer AlexNet and 16-layer VGG-16). To create conceptual models, other human observers generated visual-feature labels (e.g., "eye") and category labels (e.g., "animal") for the same image set. Feature labels were divided into parts, colors, textures and contours, while category labels were divided into subordinate, basic, and superordinate categories. We fitted models derived from the features, categories, and from each layer of each DNN to the similarity judgments, using representational similarity analysis to evaluate model performance. In both DNNs, similarity within the last layer explains most of the explainable variance in human similarity judgments. The last layer outperforms almost all feature-based models. Late and mid-level layers outperform some but not all feature-based models. Importantly, categorical models predict similarity judgments significantly better than any DNN layer. Our results provide further evidence for commonalities between DNNs and brain representations. Models derived from visual features other than object parts perform relatively poorly, perhaps because DNNs more comprehensively capture the colors, textures and contours which matter to human object perception. However, categorical models outperform DNNs, suggesting that further work may be needed to bring high-level semantic representations in DNNs closer to those extracted by humans. Modern DNNs explain similarity judgments remarkably well considering they were not trained on this task, and are promising models for many aspects of human cognition.

摘要

深度卷积神经网络（DNN）的最新进展已经实现了前所未有的精确大脑表征计算模型，并为模拟多种认知功能提供了一个令人兴奋的机会。最先进的DNN在物体分类方面达到了人类水平的性能，但它们在复杂认知任务中对人类行为的捕捉程度尚不清楚。最近的报告表明，DNN可以解释一项此类任务（判断物体相似性）中的显著差异。在这里，我们通过对一组丰富的物体图像重复这些发现、比较两个不同深度的DNN各层之间的性能以及研究DNN的性能与非计算性“概念性”模型的性能对比，来扩展这些发现。人类观察者对一组92张真实世界物体的图像进行相似性判断。在两个不同深度的DNN（8层AlexNet和16层VGG - 16）的每一层中获取相同图像的表征。为了创建概念性模型，其他人类观察者为同一图像集生成视觉特征标签（例如“眼睛”）和类别标签（例如“动物”）。特征标签分为部分、颜色、纹理和轮廓，而类别标签分为从属、基本和上级类别。我们使用表征相似性分析来评估模型性能，将从特征、类别以及每个DNN的每一层导出的模型拟合到相似性判断上。在两个DNN中，最后一层内的相似性解释了人类相似性判断中大部分可解释的差异。最后一层的表现优于几乎所有基于特征的模型。中晚期层的表现优于一些但并非所有基于特征的模型。重要的是，类别模型对相似性判断的预测明显优于任何DNN层。我们的结果为DNN与大脑表征之间的共性提供了进一步的证据。从物体部分以外的视觉特征导出的模型表现相对较差，可能是因为DNN更全面地捕捉了对人类物体感知重要的颜色、纹理和轮廓。然而，类别模型的表现优于DNN，这表明可能需要进一步的工作来使DNN中的高级语义表征更接近人类提取的表征。考虑到现代DNN并非针对此任务进行训练，它们对相似性判断的解释非常出色，并且是人类认知许多方面的有前途的模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/e7a6f428a385/fpsyg-08-01726-g001.jpg

相似文献

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.

J Neurosci. 2023 Mar 8;43(10):1731-1741. doi: 10.1523/JNEUROSCI.1424-22.2022. Epub 2023 Feb 9.

Visual features as stepping stones toward semantics: Explaining object similarity in IT and perception with non-negative least squares.

Neuropsychologia. 2016 Mar;83:201-226. doi: 10.1016/j.neuropsychologia.2015.10.023. Epub 2015 Oct 19.

Divergences in color perception between deep neural networks and humans.

Cognition. 2023 Dec;241:105621. doi: 10.1016/j.cognition.2023.105621. Epub 2023 Sep 14.

Which deep learning model can best explain object representations of within-category exemplars?

J Vis. 2021 Sep 1;21(10):12. doi: 10.1167/jov.21.10.12.

Diverse Deep Neural Networks All Predict Human Inferior Temporal Cortex Well, After Training and Fitting.

J Cogn Neurosci. 2021 Sep 1;33(10):2044-2064. doi: 10.1162/jocn_a_01755.

Deep Neural Networks as a Computational Model for Human Shape Sensitivity.

PLoS Comput Biol. 2016 Apr 28;12(4):e1004896. doi: 10.1371/journal.pcbi.1004896. eCollection 2016 Apr.

Fixed versus mixed RSA: Explaining visual representations by fixed and mixed feature sets from shallow and deep computational models.

J Math Psychol. 2017 Feb;76(Pt B):184-197. doi: 10.1016/j.jmp.2016.10.007.

Brain-like illusion produced by Skye's Oblique Grating in deep neural networks.

PLoS One. 2024 Feb 23;19(2):e0299083. doi: 10.1371/journal.pone.0299083. eCollection 2024.

Approximating Human-Level 3D Visual Inferences With Deep Neural Networks.

Open Mind (Camb). 2025 Feb 16;9:305-324. doi: 10.1162/opmi_a_00189. eCollection 2025.

引用本文的文献

Dimensions underlying the representational alignment of deep neural networks with humans.

Nat Mach Intell. 2025;7(6):848-859. doi: 10.1038/s42256-025-01041-7. Epub 2025 Jun 23.

Human shape perception spontaneously discovers the biological origin of novel, but natural, stimuli.

J R Soc Interface. 2025 May;22(226):20240931. doi: 10.1098/rsif.2024.0931. Epub 2025 May 21.

Computational biology and artificial intelligence in mRNA vaccine design for cancer immunotherapy.

Front Cell Infect Microbiol. 2025 Jan 20;14:1501010. doi: 10.3389/fcimb.2024.1501010. eCollection 2024.

The canonical deep neural network as a model for human symmetry processing.

iScience. 2024 Dec 5;28(1):111540. doi: 10.1016/j.isci.2024.111540. eCollection 2025 Jan 17.

Dorsoventral comparison of intraspecific variation in the butterfly wing pattern using a convolutional neural network.

Biol Lett. 2025 Jan;21(1):20240446. doi: 10.1098/rsbl.2024.0446. Epub 2025 Jan 15.

A computational deep learning investigation of animacy perception in the human brain.

Commun Biol. 2024 Dec 31;7(1):1718. doi: 10.1038/s42003-024-07415-8.

On the ability of standard and brain-constrained deep neural networks to support cognitive superposition: a position paper.

Cogn Neurodyn. 2024 Dec;18(6):3383-3400. doi: 10.1007/s11571-023-10061-1. Epub 2024 Feb 4.

Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.

Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.

Visual search and real-image similarity: An empirical assessment through the lens of deep learning.

Psychon Bull Rev. 2025 Apr;32(2):822-838. doi: 10.3758/s13423-024-02583-4. Epub 2024 Sep 26.

Hearing temperatures: employing machine learning for elucidating the cross-modal perception of thermal properties through audition.

Front Psychol. 2024 Aug 2;15:1353490. doi: 10.3389/fpsyg.2024.1353490. eCollection 2024.

本文引用的文献

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.

Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.

Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.

Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.

Deep Neural Networks as a Computational Model for Human Shape Sensitivity.

PLoS Comput Biol. 2016 Apr 28;12(4):e1004896. doi: 10.1371/journal.pcbi.1004896. eCollection 2016 Apr.

Eight open questions in the computational modeling of higher sensory cortex.

Curr Opin Neurobiol. 2016 Apr;37:114-120. doi: 10.1016/j.conb.2016.02.001. Epub 2016 Feb 26.

Explicit information for category-orthogonal object properties increases along the ventral stream.

Nat Neurosci. 2016 Apr;19(4):613-22. doi: 10.1038/nn.4247. Epub 2016 Feb 22.

Visual features as stepping stones toward semantics: Explaining object similarity in IT and perception with non-negative least squares.

Neuropsychologia. 2016 Mar;83:201-226. doi: 10.1016/j.neuropsychologia.2015.10.023. Epub 2015 Oct 19.

Comparison of Object Recognition Behavior in Human and Monkey.

J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.

Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.

J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.

Deep supervised, but not unsupervised, models may explain IT cortical representation.

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

在解释物体相似性判断方面，深度卷积神经网络的表现优于基于特征的模型，但不优于分类模型。

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译