• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

卷积神经网络中目标身份和配置对场景表示的贡献。

The contribution of object identity and configuration to scene representation in convolutional neural networks.

机构信息

Department of Psychology, Yale University, New Haven, CT, United States of America.

出版信息

PLoS One. 2022 Jun 28;17(6):e0270667. doi: 10.1371/journal.pone.0270667. eCollection 2022.

DOI:10.1371/journal.pone.0270667
PMID:35763531
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9239439/
Abstract

Scene perception involves extracting the identities of the objects comprising a scene in conjunction with their configuration (the spatial layout of the objects in the scene). How object identity and configuration information is weighted during scene processing and how this weighting evolves over the course of scene processing however, is not fully understood. Recent developments in convolutional neural networks (CNNs) have demonstrated their aptitude at scene processing tasks and identified correlations between processing in CNNs and in the human brain. Here we examined four CNN architectures (Alexnet, Resnet18, Resnet50, Densenet161) and their sensitivity to changes in object and configuration information over the course of scene processing. Despite differences among the four CNN architectures, across all CNNs, we observed a common pattern in the CNN's response to object identity and configuration changes. Each CNN demonstrated greater sensitivity to configuration changes in early stages of processing and stronger sensitivity to object identity changes in later stages. This pattern persists regardless of the spatial structure present in the image background, the accuracy of the CNN in classifying the scene, and even the task used to train the CNN. Importantly, CNNs' sensitivity to a configuration change is not the same as their sensitivity to any type of position change, such as that induced by a uniform translation of the objects without a configuration change. These results provide one of the first documentations of how object identity and configuration information are weighted in CNNs during scene processing.

摘要

场景感知涉及提取构成场景的物体的身份及其配置(场景中物体的空间布局)。然而,在场景处理过程中,对象身份和配置信息是如何加权的,以及这种加权是如何随场景处理过程的发展而演变的,目前还不完全清楚。卷积神经网络(CNN)的最新发展表明了它们在场景处理任务中的能力,并确定了 CNN 与人类大脑之间的处理相关性。在这里,我们研究了四个 CNN 架构(Alexnet、Resnet18、Resnet50、Densenet161)及其对场景处理过程中对象和配置信息变化的敏感性。尽管这四个 CNN 架构存在差异,但在所有 CNN 中,我们观察到了 CNN 对对象身份和配置变化的响应存在一种共同模式。每个 CNN 在处理的早期阶段对配置变化的敏感性更强,而在后期阶段对对象身份变化的敏感性更强。这种模式与图像背景中的空间结构、CNN 对场景分类的准确性甚至用于训练 CNN 的任务无关。重要的是,CNN 对配置变化的敏感性与它们对任何类型的位置变化的敏感性不同,例如在不改变配置的情况下对物体进行均匀平移所引起的位置变化。这些结果提供了关于对象身份和配置信息在场景处理过程中在 CNN 中加权的首批文档之一。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/22ae7defdb7d/pone.0270667.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/0b7102a43825/pone.0270667.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/c0b92536d043/pone.0270667.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/4b4aa8e9fc08/pone.0270667.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/26659a23d92b/pone.0270667.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/22ae7defdb7d/pone.0270667.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/0b7102a43825/pone.0270667.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/c0b92536d043/pone.0270667.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/4b4aa8e9fc08/pone.0270667.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/26659a23d92b/pone.0270667.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e28/9239439/22ae7defdb7d/pone.0270667.g005.jpg

相似文献

1
The contribution of object identity and configuration to scene representation in convolutional neural networks.卷积神经网络中目标身份和配置对场景表示的贡献。
PLoS One. 2022 Jun 28;17(6):e0270667. doi: 10.1371/journal.pone.0270667. eCollection 2022.
2
Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.检查人类枕颞叶皮层和卷积神经网络中对象身份和非身份特征的编码强度。
J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.
3
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.
4
Transfer of Learning in the Convolutional Neural Networks on Classifying Geometric Shapes Based on Local or Global Invariants.基于局部或全局不变量的卷积神经网络在几何形状分类中的学习迁移
Front Comput Neurosci. 2021 Feb 19;15:637144. doi: 10.3389/fncom.2021.637144. eCollection 2021.
5
Limits to visual representational correspondence between convolutional neural networks and the human brain.卷积神经网络与人类大脑之间视觉表示对应关系的局限性。
Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.
6
White blood cells detection and classification based on regional convolutional neural networks.基于区域卷积神经网络的白细胞检测与分类。
Med Hypotheses. 2020 Feb;135:109472. doi: 10.1016/j.mehy.2019.109472. Epub 2019 Nov 4.
7
Understanding transformation tolerant visual object representations in the human brain and convolutional neural networks.理解人类大脑和卷积神经网络中对变换具有容忍度的视觉对象表示。
Neuroimage. 2022 Nov;263:119635. doi: 10.1016/j.neuroimage.2022.119635. Epub 2022 Sep 15.
8
A comparative study of pre-trained convolutional neural networks for semantic segmentation of breast tumors in ultrasound.用于超声乳腺肿瘤语义分割的预训练卷积神经网络的比较研究
Comput Biol Med. 2020 Nov;126:104036. doi: 10.1016/j.compbiomed.2020.104036. Epub 2020 Oct 8.
9
A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations.一种新颖的特征混淆方法揭示了卷积神经网络学习空间关系的能力。
Neural Netw. 2023 Oct;167:400-414. doi: 10.1016/j.neunet.2023.08.021. Epub 2023 Aug 18.
10
Predicting Identity-Preserving Object Transformations in Human Posterior Parietal Cortex and Convolutional Neural Networks.预测人类后顶叶皮层和卷积神经网络中的身份保持对象转换。
J Cogn Neurosci. 2022 Nov 1;34(12):2406-2435. doi: 10.1162/jocn_a_01916.

引用本文的文献

1
Multiple visual objects are represented differently in the human brain and convolutional neural networks.人类大脑和卷积神经网络对多个视觉对象的表示方式不同。
Sci Rep. 2023 Jun 5;13(1):9088. doi: 10.1038/s41598-023-36029-z.
2
Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks.人类大脑和卷积神经网络中对多个视觉对象的表征
bioRxiv. 2023 Mar 1:2023.02.28.530472. doi: 10.1101/2023.02.28.530472.
3
Comparing the Dominance of Color and Form Information across the Human Ventral Visual Pathway and Convolutional Neural Networks.

本文引用的文献

1
Three cortical scene systems and their development.三个皮质场景系统及其发育。
Trends Cogn Sci. 2022 Feb;26(2):117-127. doi: 10.1016/j.tics.2021.11.002. Epub 2021 Nov 29.
2
Joint representation of color and form in convolutional neural networks: A stimulus-rich network perspective.卷积神经网络中颜色和形状的联合表示:一种富刺激网络视角。
PLoS One. 2021 Jun 30;16(6):e0253442. doi: 10.1371/journal.pone.0253442. eCollection 2021.
3
Limits to visual representational correspondence between convolutional neural networks and the human brain.
比较人类腹侧视觉通路和卷积神经网络中颜色和形状信息的优势。
J Cogn Neurosci. 2023 May 1;35(5):816-840. doi: 10.1162/jocn_a_01979.
卷积神经网络与人类大脑之间视觉表示对应关系的局限性。
Nat Commun. 2021 Apr 6;12(1):2065. doi: 10.1038/s41467-021-22244-7.
4
Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks.检查人类枕颞叶皮层和卷积神经网络中对象身份和非身份特征的编码强度。
J Neurosci. 2021 May 12;41(19):4234-4252. doi: 10.1523/JNEUROSCI.1993-20.2021. Epub 2021 Mar 31.
5
Integrative Benchmarking to Advance Neurally Mechanistic Models of Human Intelligence.综合基准测试以推进人类智能的神经机制模型。
Neuron. 2020 Nov 11;108(3):413-423. doi: 10.1016/j.neuron.2020.07.040. Epub 2020 Sep 11.
6
Unraveling Representations in Scene-selective Brain Regions Using Scene-Parsing Deep Neural Networks.使用场景解析深度神经网络揭示场景选择性脑区的表示。
J Cogn Neurosci. 2021 Sep 1;33(10):2032-2043. doi: 10.1162/jocn_a_01624.
7
Deep Learning: The Good, the Bad, and the Ugly.深度学习:好的、坏的和丑的。
Annu Rev Vis Sci. 2019 Sep 15;5:399-426. doi: 10.1146/annurev-vision-091718-014951. Epub 2019 Aug 8.
8
Scene Perception in the Human Brain.人类大脑中的场景感知。
Annu Rev Vis Sci. 2019 Sep 15;5:373-397. doi: 10.1146/annurev-vision-091718-014809. Epub 2019 Jun 21.
9
Similarity judgments and cortical visual responses reflect different properties of object and scene categories in naturalistic images.相似性判断和皮质视觉反应反映了自然图像中物体和场景类别的不同属性。
Neuroimage. 2019 Aug 15;197:368-382. doi: 10.1016/j.neuroimage.2019.04.079. Epub 2019 May 1.
10
Predicting eye movement patterns from fMRI responses to natural scenes.从 fMRI 对自然场景的反应中预测眼球运动模式。
Nat Commun. 2018 Dec 4;9(1):5159. doi: 10.1038/s41467-018-07471-9.