• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在解释物体相似性判断方面,深度卷积神经网络的表现优于基于特征的模型,但不优于分类模型。

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.

作者信息

Jozwik Kamila M, Kriegeskorte Nikolaus, Storrs Katherine R, Mur Marieke

机构信息

Neural Dynamics of Visual Cognition, Department of Education and Psychology, Free University of Berlin, Berlin, Germany.

Memory and Perception Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom.

出版信息

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

DOI:10.3389/fpsyg.2017.01726
PMID:29062291
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5640771/
Abstract

Recent advances in Deep convolutional Neural Networks (DNNs) have enabled unprecedentedly accurate computational models of brain representations, and present an exciting opportunity to model diverse cognitive functions. State-of-the-art DNNs achieve human-level performance on object categorisation, but it is unclear how well they capture human behavior on complex cognitive tasks. Recent reports suggest that DNNs can explain significant variance in one such task, judging object similarity. Here, we extend these findings by replicating them for a rich set of object images, comparing performance across layers within two DNNs of different depths, and examining how the DNNs' performance compares to that of non-computational "conceptual" models. Human observers performed similarity judgments for a set of 92 images of real-world objects. Representations of the same images were obtained in each of the layers of two DNNs of different depths (8-layer AlexNet and 16-layer VGG-16). To create conceptual models, other human observers generated visual-feature labels (e.g., "eye") and category labels (e.g., "animal") for the same image set. Feature labels were divided into parts, colors, textures and contours, while category labels were divided into subordinate, basic, and superordinate categories. We fitted models derived from the features, categories, and from each layer of each DNN to the similarity judgments, using representational similarity analysis to evaluate model performance. In both DNNs, similarity within the last layer explains most of the explainable variance in human similarity judgments. The last layer outperforms almost all feature-based models. Late and mid-level layers outperform some but not all feature-based models. Importantly, categorical models predict similarity judgments significantly better than any DNN layer. Our results provide further evidence for commonalities between DNNs and brain representations. Models derived from visual features other than object parts perform relatively poorly, perhaps because DNNs more comprehensively capture the colors, textures and contours which matter to human object perception. However, categorical models outperform DNNs, suggesting that further work may be needed to bring high-level semantic representations in DNNs closer to those extracted by humans. Modern DNNs explain similarity judgments remarkably well considering they were not trained on this task, and are promising models for many aspects of human cognition.

摘要

深度卷积神经网络(DNN)的最新进展已经实现了前所未有的精确大脑表征计算模型,并为模拟多种认知功能提供了一个令人兴奋的机会。最先进的DNN在物体分类方面达到了人类水平的性能,但它们在复杂认知任务中对人类行为的捕捉程度尚不清楚。最近的报告表明,DNN可以解释一项此类任务(判断物体相似性)中的显著差异。在这里,我们通过对一组丰富的物体图像重复这些发现、比较两个不同深度的DNN各层之间的性能以及研究DNN的性能与非计算性“概念性”模型的性能对比,来扩展这些发现。人类观察者对一组92张真实世界物体的图像进行相似性判断。在两个不同深度的DNN(8层AlexNet和16层VGG - 16)的每一层中获取相同图像的表征。为了创建概念性模型,其他人类观察者为同一图像集生成视觉特征标签(例如“眼睛”)和类别标签(例如“动物”)。特征标签分为部分、颜色、纹理和轮廓,而类别标签分为从属、基本和上级类别。我们使用表征相似性分析来评估模型性能,将从特征、类别以及每个DNN的每一层导出的模型拟合到相似性判断上。在两个DNN中,最后一层内的相似性解释了人类相似性判断中大部分可解释的差异。最后一层的表现优于几乎所有基于特征的模型。中晚期层的表现优于一些但并非所有基于特征的模型。重要的是,类别模型对相似性判断的预测明显优于任何DNN层。我们的结果为DNN与大脑表征之间的共性提供了进一步的证据。从物体部分以外的视觉特征导出的模型表现相对较差,可能是因为DNN更全面地捕捉了对人类物体感知重要的颜色、纹理和轮廓。然而,类别模型的表现优于DNN,这表明可能需要进一步的工作来使DNN中的高级语义表征更接近人类提取的表征。考虑到现代DNN并非针对此任务进行训练,它们对相似性判断的解释非常出色,并且是人类认知许多方面的有前途的模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/fab701484027/fpsyg-08-01726-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/e7a6f428a385/fpsyg-08-01726-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/cb725bfdf7b4/fpsyg-08-01726-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/6b20d9889b35/fpsyg-08-01726-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/9fc00549a977/fpsyg-08-01726-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/401e73384999/fpsyg-08-01726-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/7d7faac051e3/fpsyg-08-01726-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/303c7bfccbe6/fpsyg-08-01726-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/8e920aa62713/fpsyg-08-01726-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/a452a5051e46/fpsyg-08-01726-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/4cff18ab50b9/fpsyg-08-01726-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/fab701484027/fpsyg-08-01726-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/e7a6f428a385/fpsyg-08-01726-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/cb725bfdf7b4/fpsyg-08-01726-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/6b20d9889b35/fpsyg-08-01726-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/9fc00549a977/fpsyg-08-01726-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/401e73384999/fpsyg-08-01726-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/7d7faac051e3/fpsyg-08-01726-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/303c7bfccbe6/fpsyg-08-01726-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/8e920aa62713/fpsyg-08-01726-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/a452a5051e46/fpsyg-08-01726-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/4cff18ab50b9/fpsyg-08-01726-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abce/5640771/fab701484027/fpsyg-08-01726-g011.jpg

相似文献

1
Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.在解释物体相似性判断方面,深度卷积神经网络的表现优于基于特征的模型,但不优于分类模型。
Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.
2
Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.深度神经网络和视语义模型解释了人类腹侧流表象动态的互补组成部分。
J Neurosci. 2023 Mar 8;43(10):1731-1741. doi: 10.1523/JNEUROSCI.1424-22.2022. Epub 2023 Feb 9.
3
Visual features as stepping stones toward semantics: Explaining object similarity in IT and perception with non-negative least squares.作为通向语义学垫脚石的视觉特征:用非负最小二乘法解释信息技术与感知中的物体相似性。
Neuropsychologia. 2016 Mar;83:201-226. doi: 10.1016/j.neuropsychologia.2015.10.023. Epub 2015 Oct 19.
4
Divergences in color perception between deep neural networks and humans.深度神经网络与人类在颜色感知上的差异。
Cognition. 2023 Dec;241:105621. doi: 10.1016/j.cognition.2023.105621. Epub 2023 Sep 14.
5
Which deep learning model can best explain object representations of within-category exemplars?哪种深度学习模型最能解释类别内样本的对象表示?
J Vis. 2021 Sep 1;21(10):12. doi: 10.1167/jov.21.10.12.
6
Diverse Deep Neural Networks All Predict Human Inferior Temporal Cortex Well, After Training and Fitting.各种深度神经网络在经过训练和适配后都能很好地预测人类的下颞叶皮质。
J Cogn Neurosci. 2021 Sep 1;33(10):2044-2064. doi: 10.1162/jocn_a_01755.
7
Deep Neural Networks as a Computational Model for Human Shape Sensitivity.深度神经网络作为人类形状敏感度的计算模型
PLoS Comput Biol. 2016 Apr 28;12(4):e1004896. doi: 10.1371/journal.pcbi.1004896. eCollection 2016 Apr.
8
Fixed versus mixed RSA: Explaining visual representations by fixed and mixed feature sets from shallow and deep computational models.固定与混合随机语义分析:用浅层和深层计算模型中的固定和混合特征集解释视觉表征
J Math Psychol. 2017 Feb;76(Pt B):184-197. doi: 10.1016/j.jmp.2016.10.007.
9
Brain-like illusion produced by Skye's Oblique Grating in deep neural networks.神经网络中的 Skye 斜栅产生的类脑错觉。
PLoS One. 2024 Feb 23;19(2):e0299083. doi: 10.1371/journal.pone.0299083. eCollection 2024.
10
Approximating Human-Level 3D Visual Inferences With Deep Neural Networks.利用深度神经网络逼近人类水平的3D视觉推理
Open Mind (Camb). 2025 Feb 16;9:305-324. doi: 10.1162/opmi_a_00189. eCollection 2025.

引用本文的文献

1
Dimensions underlying the representational alignment of deep neural networks with humans.深度神经网络与人类表征对齐背后的维度。
Nat Mach Intell. 2025;7(6):848-859. doi: 10.1038/s42256-025-01041-7. Epub 2025 Jun 23.
2
Human shape perception spontaneously discovers the biological origin of novel, but natural, stimuli.人类形状感知能自发地发现新颖但自然的刺激的生物学起源。
J R Soc Interface. 2025 May;22(226):20240931. doi: 10.1098/rsif.2024.0931. Epub 2025 May 21.
3
Computational biology and artificial intelligence in mRNA vaccine design for cancer immunotherapy.

本文引用的文献

1
Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.深度神经网络:一种用于模拟生物视觉和大脑信息处理的新框架。
Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.
2
Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较,揭示了层级对应关系。
Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.
3
Deep Neural Networks as a Computational Model for Human Shape Sensitivity.
用于癌症免疫治疗的mRNA疫苗设计中的计算生物学与人工智能
Front Cell Infect Microbiol. 2025 Jan 20;14:1501010. doi: 10.3389/fcimb.2024.1501010. eCollection 2024.
4
The canonical deep neural network as a model for human symmetry processing.作为人类对称性处理模型的典型深度神经网络。
iScience. 2024 Dec 5;28(1):111540. doi: 10.1016/j.isci.2024.111540. eCollection 2025 Jan 17.
5
Dorsoventral comparison of intraspecific variation in the butterfly wing pattern using a convolutional neural network.使用卷积神经网络对蝴蝶翅膀图案种内变异进行背腹比较。
Biol Lett. 2025 Jan;21(1):20240446. doi: 10.1098/rsbl.2024.0446. Epub 2025 Jan 15.
6
A computational deep learning investigation of animacy perception in the human brain.对人类大脑中生物运动感知的计算深度学习研究。
Commun Biol. 2024 Dec 31;7(1):1718. doi: 10.1038/s42003-024-07415-8.
7
On the ability of standard and brain-constrained deep neural networks to support cognitive superposition: a position paper.论标准和脑约束深度神经网络支持认知叠加的能力:一篇立场文件。
Cogn Neurodyn. 2024 Dec;18(6):3383-3400. doi: 10.1007/s11571-023-10061-1. Epub 2024 Feb 4.
8
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。
Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.
9
Visual search and real-image similarity: An empirical assessment through the lens of deep learning.视觉搜索与真实图像相似度:基于深度学习视角的实证评估
Psychon Bull Rev. 2025 Apr;32(2):822-838. doi: 10.3758/s13423-024-02583-4. Epub 2024 Sep 26.
10
Hearing temperatures: employing machine learning for elucidating the cross-modal perception of thermal properties through audition.听觉温度:运用机器学习通过听觉阐明热特性的跨模态感知。
Front Psychol. 2024 Aug 2;15:1353490. doi: 10.3389/fpsyg.2024.1353490. eCollection 2024.
深度神经网络作为人类形状敏感度的计算模型
PLoS Comput Biol. 2016 Apr 28;12(4):e1004896. doi: 10.1371/journal.pcbi.1004896. eCollection 2016 Apr.
4
Eight open questions in the computational modeling of higher sensory cortex.关于高级感觉皮层计算建模的八个开放性问题。
Curr Opin Neurobiol. 2016 Apr;37:114-120. doi: 10.1016/j.conb.2016.02.001. Epub 2016 Feb 26.
5
Explicit information for category-orthogonal object properties increases along the ventral stream.明确的类别正交物体属性信息沿腹侧流增加。
Nat Neurosci. 2016 Apr;19(4):613-22. doi: 10.1038/nn.4247. Epub 2016 Feb 22.
6
Visual features as stepping stones toward semantics: Explaining object similarity in IT and perception with non-negative least squares.作为通向语义学垫脚石的视觉特征:用非负最小二乘法解释信息技术与感知中的物体相似性。
Neuropsychologia. 2016 Mar;83:201-226. doi: 10.1016/j.neuropsychologia.2015.10.023. Epub 2015 Oct 19.
7
Comparison of Object Recognition Behavior in Human and Monkey.人类与猴子物体识别行为的比较
J Neurosci. 2015 Sep 2;35(35):12127-36. doi: 10.1523/JNEUROSCI.0573-15.2015.
8
Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。
J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.
9
Deep supervised, but not unsupervised, models may explain IT cortical representation.深度监督模型而非无监督模型可能解释IT皮层表征。
PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.
10
Performance-optimized hierarchical models predict neural responses in higher visual cortex.性能优化的层次模型预测高级视觉皮层中的神经反应。
Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.