• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类大脑和卷积神经网络中对多个视觉对象的表征

Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks.

作者信息

Mocz Viola, Jeong Su Keun, Chun Marvin, Xu Yaoda

机构信息

Visual Cognitive Neuroscience Lab, Department of Psychology, Yale University, New Haven, CT 06520, USA.

Department of Psychology, Chungbuk National University, South Korea.

出版信息

bioRxiv. 2023 Mar 1:2023.02.28.530472. doi: 10.1101/2023.02.28.530472.

DOI:10.1101/2023.02.28.530472
PMID:36909506
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10002658/
Abstract

Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain.

摘要

现实世界中的物体常常与其他物体一同出现。为了识别一个物体的身份,无论其他物体是否同时被编码,在灵长类动物的物体处理区域,对一对物体的神经反应已被证明可以很好地用对单独呈现的每个组成物体的平均反应来近似,这表明整体等于其部分的平均值。这在猕猴颞下(IT)神经元对配对物体和单个物体的反应幅度斜率的单个单元水平上存在,并且在人类腹侧物体处理区域(例如外侧枕叶区(LO))的功能磁共振成像(fMRI)体素的群体水平反应模式中也存在。在这里,我们表明在人类外侧枕叶区的单个fMRI体素和体素群体反应中都存在平均化现象,单个体素中更好的平均化导致fMRI反应模式中更好的平均化,这表明在fMRI单元和群体水平上的平均化具有密切的对应关系。为了了解在为物体分类而预训练的卷积神经网络(CNN)中是否存在类似的平均化机制,我们研究了五个具有不同架构、深度以及是否存在循环处理的卷积神经网络。我们观察到在卷积神经网络单元水平存在平均化现象,但在群体水平很少见,在大多数情况下卷积神经网络单元反应分布与人类外侧枕叶区或猕猴颞下神经元反应不同。因此,在卷积神经网络中整体不等于其部分的平均值,这可能使得在视觉处理过程中,卷积神经网络中一对物体中的单个物体比在人类大脑中更难被识别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/8734830f7731/nihpp-2023.02.28.530472v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/5ae126448dba/nihpp-2023.02.28.530472v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/27420b1c9b00/nihpp-2023.02.28.530472v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/ff7df3ca7524/nihpp-2023.02.28.530472v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/4481a4280901/nihpp-2023.02.28.530472v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/d65819fdb77c/nihpp-2023.02.28.530472v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/8734830f7731/nihpp-2023.02.28.530472v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/5ae126448dba/nihpp-2023.02.28.530472v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/27420b1c9b00/nihpp-2023.02.28.530472v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/ff7df3ca7524/nihpp-2023.02.28.530472v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/4481a4280901/nihpp-2023.02.28.530472v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/d65819fdb77c/nihpp-2023.02.28.530472v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d7c/10002658/8734830f7731/nihpp-2023.02.28.530472v1-f0006.jpg

相似文献

1
Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks.人类大脑和卷积神经网络中对多个视觉对象的表征
bioRxiv. 2023 Mar 1:2023.02.28.530472. doi: 10.1101/2023.02.28.530472.
2
Multiple visual objects are represented differently in the human brain and convolutional neural networks.人类大脑和卷积神经网络对多个视觉对象的表示方式不同。
Sci Rep. 2023 Jun 5;13(1):9088. doi: 10.1038/s41598-023-36029-z.
3
Understanding transformation tolerant visual object representations in the human brain and convolutional neural networks.理解人类大脑和卷积神经网络中对变换具有容忍度的视觉对象表示。
Neuroimage. 2022 Nov;263:119635. doi: 10.1016/j.neuroimage.2022.119635. Epub 2022 Sep 15.
4
Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.检查人类枕颞叶皮层和卷积神经网络中对象身份和非身份特征的编码强度。
J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.
5
Limits to visual representational correspondence between convolutional neural networks and the human brain.卷积神经网络与人类大脑之间视觉表示对应关系的局限性。
Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.
6
Task-context-dependent Linear Representation of Multiple Visual Objects in Human Parietal Cortex.人类顶叶皮层中与任务相关的多个视觉对象的线性表示
J Cogn Neurosci. 2017 Oct;29(10):1778-1789. doi: 10.1162/jocn_a_01156. Epub 2017 Jun 9.
7
There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision.视觉中的杂讯中有个“U”:人类视觉中杂讯容忍背后强大稀疏编码的证据。
J Neurosci. 2015 Oct 21;35(42):14148-59. doi: 10.1523/JNEUROSCI.1211-15.2015.
8
Predicting Identity-Preserving Object Transformations in Human Posterior Parietal Cortex and Convolutional Neural Networks.预测人类后顶叶皮层和卷积神经网络中的身份保持对象转换。
J Cogn Neurosci. 2022 Nov 1;34(12):2406-2435. doi: 10.1162/jocn_a_01916.
9
Representations of regular and irregular shapes by deep Convolutional Neural Networks, monkey inferotemporal neurons and human judgments.深度卷积神经网络、猴子下颞叶神经元和人类判断对规则和不规则形状的表示。
PLoS Comput Biol. 2018 Oct 26;14(10):e1006557. doi: 10.1371/journal.pcbi.1006557. eCollection 2018 Oct.
10
The contribution of object identity and configuration to scene representation in convolutional neural networks.卷积神经网络中目标身份和配置对场景表示的贡献。
PLoS One. 2022 Jun 28;17(6):e0270667. doi: 10.1371/journal.pone.0270667. eCollection 2022.

本文引用的文献

1
Predicting Identity-Preserving Object Transformations in Human Posterior Parietal Cortex and Convolutional Neural Networks.预测人类后顶叶皮层和卷积神经网络中的身份保持对象转换。
J Cogn Neurosci. 2022 Nov 1;34(12):2406-2435. doi: 10.1162/jocn_a_01916.
2
Understanding transformation tolerant visual object representations in the human brain and convolutional neural networks.理解人类大脑和卷积神经网络中对变换具有容忍度的视觉对象表示。
Neuroimage. 2022 Nov;263:119635. doi: 10.1016/j.neuroimage.2022.119635. Epub 2022 Sep 15.
3
The contribution of object identity and configuration to scene representation in convolutional neural networks.
卷积神经网络中目标身份和配置对场景表示的贡献。
PLoS One. 2022 Jun 28;17(6):e0270667. doi: 10.1371/journal.pone.0270667. eCollection 2022.
4
Limits to visual representational correspondence between convolutional neural networks and the human brain.卷积神经网络与人类大脑之间视觉表示对应关系的局限性。
Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.
5
Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.检查人类枕颞叶皮层和卷积神经网络中对象身份和非身份特征的编码强度。
J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.
6
Qualitative similarities and differences in visual object representations between brains and deep networks.大脑和深度网络中视觉目标表示的定性相似性和差异性。
Nat Commun. 2021 Mar 25;12(1):1872. doi: 10.1038/s41467-021-22078-3.
7
The Functional Organization of High-Level Visual Cortex Determines the Representation of Complex Visual Stimuli.高级视觉皮层的功能组织决定了复杂视觉刺激的表现。
J Neurosci. 2020 Sep 23;40(39):7545-7558. doi: 10.1523/JNEUROSCI.0446-20.2020. Epub 2020 Aug 28.
8
Reliability-based voxel selection.基于可靠性的体素选择。
Neuroimage. 2020 Feb 15;207:116350. doi: 10.1016/j.neuroimage.2019.116350. Epub 2019 Nov 14.
9
Recurrence is required to capture the representational dynamics of the human visual system.为了捕捉人类视觉系统的表示动态,需要进行再现。
Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.
10
Deep Learning: The Good, the Bad, and the Ugly.深度学习:好的、坏的和丑的。
Annu Rev Vis Sci. 2019 Sep 15;5:399-426. doi: 10.1146/annurev-vision-091718-014951. Epub 2019 Aug 8.