• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度监督模型而非无监督模型可能解释IT皮层表征。

Deep supervised, but not unsupervised, models may explain IT cortical representation.

作者信息

Khaligh-Razavi Seyed-Mahdi, Kriegeskorte Nikolaus

机构信息

Medical Research Council, Cognition and Brain Sciences Unit, Cambridge, United Kingdom.

出版信息

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

DOI:10.1371/journal.pcbi.1003915
PMID:25375136
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4222664/
Abstract

Inferior temporal (IT) cortex in human and nonhuman primates serves visual object recognition. Computational object-vision models, although continually improving, do not yet reach human performance. It is unclear to what extent the internal representations of computational models can explain the IT representation. Here we investigate a wide range of computational model representations (37 in total), testing their categorization performance and their ability to account for the IT representational geometry. The models include well-known neuroscientific object-recognition models (e.g. HMAX, VisNet) along with several models from computer vision (e.g. SIFT, GIST, self-similarity features, and a deep convolutional neural network). We compared the representational dissimilarity matrices (RDMs) of the model representations with the RDMs obtained from human IT (measured with fMRI) and monkey IT (measured with cell recording) for the same set of stimuli (not used in training the models). Better performing models were more similar to IT in that they showed greater clustering of representational patterns by category. In addition, better performing models also more strongly resembled IT in terms of their within-category representational dissimilarities. Representational geometries were significantly correlated between IT and many of the models. However, the categorical clustering observed in IT was largely unexplained by the unsupervised models. The deep convolutional network, which was trained by supervision with over a million category-labeled images, reached the highest categorization performance and also best explained IT, although it did not fully explain the IT data. Combining the features of this model with appropriate weights and adding linear combinations that maximize the margin between animate and inanimate objects and between faces and other objects yielded a representation that fully explained our IT data. Overall, our results suggest that explaining IT requires computational features trained through supervised learning to emphasize the behaviorally important categorical divisions prominently reflected in IT.

摘要

人类和非人类灵长类动物的颞下回(IT)皮质负责视觉物体识别。尽管计算物体视觉模型在不断改进,但仍未达到人类的表现水平。目前尚不清楚计算模型的内部表征在多大程度上能够解释IT表征。在此,我们研究了广泛的计算模型表征(总共37种),测试它们的分类性能以及解释IT表征几何结构的能力。这些模型包括著名的神经科学物体识别模型(如HMAX、VisNet)以及计算机视觉领域的几种模型(如尺度不变特征变换(SIFT)、全局特征描述符(GIST)、自相似特征和深度卷积神经网络)。我们将模型表征的表征差异矩阵(RDM)与从人类IT(通过功能磁共振成像(fMRI)测量)和猴子IT(通过细胞记录测量)获得的RDM进行了比较,这些RDM是针对同一组刺激(未用于模型训练)。表现较好的模型与IT更为相似,因为它们按类别显示出更大的表征模式聚类。此外,表现较好的模型在类别内表征差异方面也与IT更为相似。IT与许多模型之间的表征几何结构显著相关。然而,IT中观察到的类别聚类在很大程度上无法由无监督模型解释。通过对超过一百万个带有类别标签的图像进行监督训练的深度卷积神经网络达到了最高的分类性能,并且对IT的解释也最好,尽管它没有完全解释IT数据。将该模型的特征与适当的权重相结合,并添加能够最大化有生命与无生命物体之间以及面部与其他物体之间边界的线性组合,得到了一种能够完全解释我们IT数据的表征。总体而言,我们的结果表明,解释IT需要通过监督学习训练的计算特征,以突出在IT中显著体现的行为上重要的类别划分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/c9ecded677c1/pcbi.1003915.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/b0a7edada374/pcbi.1003915.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/41ac3fee9779/pcbi.1003915.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/f3268b541124/pcbi.1003915.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/019c45a019df/pcbi.1003915.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/c9627daaca92/pcbi.1003915.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/76d3c45c21ec/pcbi.1003915.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/8cf161ab9e3b/pcbi.1003915.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/10e27dbc36a8/pcbi.1003915.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/f9d36745a7cb/pcbi.1003915.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/695830dde0b1/pcbi.1003915.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/d91004e5b189/pcbi.1003915.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/c9ecded677c1/pcbi.1003915.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/b0a7edada374/pcbi.1003915.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/41ac3fee9779/pcbi.1003915.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/f3268b541124/pcbi.1003915.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/019c45a019df/pcbi.1003915.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/c9627daaca92/pcbi.1003915.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/76d3c45c21ec/pcbi.1003915.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/8cf161ab9e3b/pcbi.1003915.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/10e27dbc36a8/pcbi.1003915.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/f9d36745a7cb/pcbi.1003915.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/695830dde0b1/pcbi.1003915.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/d91004e5b189/pcbi.1003915.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e9b/4222664/c9ecded677c1/pcbi.1003915.g012.jpg

相似文献

1
Deep supervised, but not unsupervised, models may explain IT cortical representation.深度监督模型而非无监督模型可能解释IT皮层表征。
PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.
2
Matching categorical object representations in inferior temporal cortex of man and monkey.人类和猴子颞下皮质中分类物体表征的匹配
Neuron. 2008 Dec 26;60(6):1126-41. doi: 10.1016/j.neuron.2008.10.043.
3
Fixed versus mixed RSA: Explaining visual representations by fixed and mixed feature sets from shallow and deep computational models.固定与混合随机语义分析:用浅层和深层计算模型中的固定和混合特征集解释视觉表征
J Math Psychol. 2017 Feb;76(Pt B):184-197. doi: 10.1016/j.jmp.2016.10.007.
4
Unsupervised changes in core object recognition behavior are predicted by neural plasticity in inferior temporal cortex.无监督的核心物体识别行为变化可由下颞叶皮层的神经可塑性预测。
Elife. 2021 Jun 11;10:e60830. doi: 10.7554/eLife.60830.
5
Within-category representational stability through the lens of manipulable objects.通过可操作性物体的视角看类别内的代表性稳定性。
Cortex. 2021 Apr;137:282-291. doi: 10.1016/j.cortex.2020.12.026. Epub 2021 Feb 5.
6
Shape similarity, better than semantic membership, accounts for the structure of visual object representations in a population of monkey inferotemporal neurons.形状相似性比语义成员关系更能解释猴子下颞叶神经元群体中视觉物体表示的结构。
PLoS Comput Biol. 2013;9(8):e1003167. doi: 10.1371/journal.pcbi.1003167. Epub 2013 Aug 8.
7
Visual representations are dominated by intrinsic fluctuations correlated between areas.视觉表征由区域间相关的内在波动主导。
Neuroimage. 2015 Jul 1;114:275-86. doi: 10.1016/j.neuroimage.2015.04.026. Epub 2015 Apr 17.
8
Disentangling Representations of Object Shape and Object Category in Human Visual Cortex: The Animate-Inanimate Distinction.解析人类视觉皮层中物体形状与物体类别的表征:有生命与无生命的区分
J Cogn Neurosci. 2016 May;28(5):680-92. doi: 10.1162/jocn_a_00924. Epub 2016 Jan 14.
9
Human Object-Similarity Judgments Reflect and Transcend the Primate-IT Object Representation.人类的物体相似性判断反映并超越了灵长类 IT 物体的表示。
Front Psychol. 2013 Mar 22;4:128. doi: 10.3389/fpsyg.2013.00128. eCollection 2013.
10
Representations of regular and irregular shapes by deep Convolutional Neural Networks, monkey inferotemporal neurons and human judgments.深度卷积神经网络、猴子下颞叶神经元和人类判断对规则和不规则形状的表示。
PLoS Comput Biol. 2018 Oct 26;14(10):e1006557. doi: 10.1371/journal.pcbi.1006557. eCollection 2018 Oct.

引用本文的文献

1
Improving the Interpretability of fMRI Decoding using Deep Neural Networks and Adversarial Robustness.利用深度神经网络和对抗鲁棒性提高功能磁共振成像解码的可解释性
Apert Neuro. 2023;3. doi: 10.52294/001c.85074. Epub 2023 Aug 7.
2
Enhancing neural rehabilitation insights: on the path of bridging artificial and biological neural networks.增强神经康复见解:在连接人工神经网络与生物神经网络的道路上
Brain Inform. 2025 Aug 29;12(1):21. doi: 10.1186/s40708-025-00266-x.
3
Invariant inter-subject relational structures in high order human visual cortex.

本文引用的文献

1
Learning Invariance from Transformation Sequences.从变换序列中学习不变性。
Neural Comput. 1991 Summer;3(2):194-200. doi: 10.1162/neco.1991.3.2.194.
2
Deep neural networks rival the representation of primate IT cortex for core visual object recognition.深度神经网络在核心视觉目标识别方面可与灵长类动物的颞下皮质表征相媲美。
PLoS Comput Biol. 2014 Dec 18;10(12):e1003963. doi: 10.1371/journal.pcbi.1003963. eCollection 2014 Dec.
3
Feedforward object-vision models only tolerate small image variations compared to human.与人类相比,前馈目标视觉模型只能容忍很小的图像变化。
高阶人类视觉皮层中的主体间不变关系结构。
Nat Commun. 2025 Aug 27;16(1):8015. doi: 10.1038/s41467-025-62551-x.
4
High-level visual representations in the human brain are aligned with large language models.人类大脑中的高级视觉表征与大语言模型相一致。
Nat Mach Intell. 2025;7(8):1220-1234. doi: 10.1038/s42256-025-01072-0. Epub 2025 Aug 7.
5
A feedforward mechanism for human-like contour integration.一种类人轮廓整合的前馈机制。
PLoS Comput Biol. 2025 Aug 18;21(8):e1013391. doi: 10.1371/journal.pcbi.1013391. eCollection 2025 Aug.
6
Convolutional architectures are cortex-aligned de novo.卷积架构是从头开始与皮质对齐的。
bioRxiv. 2025 Jul 29:2024.05.10.593623. doi: 10.1101/2024.05.10.593623.
7
Temporal asymmetry of neural pattern similarity predicts recognition memory decisions.神经模式相似性的时间不对称性预测识别记忆决策。
Commun Biol. 2025 Jul 31;8(1):1138. doi: 10.1038/s42003-025-08569-9.
8
Human and artificial visual systems share a computational principle for transforming binocular disparity into depth representation.人类视觉系统和人工视觉系统共享一种将双眼视差转化为深度表征的计算原理。
Commun Biol. 2025 Jul 11;8(1):1042. doi: 10.1038/s42003-025-08474-1.
9
Dimensions underlying the representational alignment of deep neural networks with humans.深度神经网络与人类表征对齐背后的维度。
Nat Mach Intell. 2025;7(6):848-859. doi: 10.1038/s42256-025-01041-7. Epub 2025 Jun 23.
10
Using fMRI to examine nonlinear mixed selectivity tuning to task and category in the human brain.使用功能磁共振成像(fMRI)来检测人类大脑中对任务和类别进行非线性混合选择性调谐的情况。
Imaging Neurosci (Camb). 2024;2. doi: 10.1162/imag_a_00354. Epub 2024 Nov 8.
Front Comput Neurosci. 2014 Jul 18;8:74. doi: 10.3389/fncom.2014.00074. eCollection 2014.
4
The impact of the lateral geniculate nucleus and corticogeniculate interactions on efficient coding and higher-order visual object processing.外侧膝状体核及皮质-膝状体相互作用对高效编码和高阶视觉物体处理的影响。
Vision Res. 2014 Aug;101:82-93. doi: 10.1016/j.visres.2014.05.006. Epub 2014 Jun 6.
5
Performance-optimized hierarchical models predict neural responses in higher visual cortex.性能优化的层次模型预测高级视觉皮层中的神经反应。
Proc Natl Acad Sci U S A. 2014 Jun 10;111(23):8619-24. doi: 10.1073/pnas.1403112111. Epub 2014 May 8.
6
A toolbox for representational similarity analysis.用于表征相似性分析的工具箱。
PLoS Comput Biol. 2014 Apr 17;10(4):e1003553. doi: 10.1371/journal.pcbi.1003553. eCollection 2014 Apr.
7
Object-specific semantic coding in human perirhinal cortex.人类旁嗅皮层中的目标特定语义编码。
J Neurosci. 2014 Apr 2;34(14):4766-75. doi: 10.1523/JNEUROSCI.2828-13.2014.
8
Resolving human object recognition in space and time.解析人类在空间和时间中的物体识别
Nat Neurosci. 2014 Mar;17(3):455-62. doi: 10.1038/nn.3635. Epub 2014 Jan 26.
9
Representational similarity analysis reveals commonalities and differences in the semantic processing of words and objects.表象相似性分析揭示了词和物体语义处理的共性和差异。
J Neurosci. 2013 Nov 27;33(48):18906-16. doi: 10.1523/JNEUROSCI.3809-13.2013.
10
Comparing visual representations across human fMRI and computational vision.比较人类功能磁共振成像和计算视觉中的视觉表征。
J Vis. 2013 Nov 22;13(13):25. doi: 10.1167/13.13.25.