整合深度视觉和语义吸引子神经网络可预测腹侧物体处理通路中的 fMRI 模式信息。

Integrated deep visual and semantic attractor neural networks predict fMRI pattern-information along the ventral object processing pathway.

机构信息

Department of Psychology, University of Cambridge, Downing Street, Cambridge, CB2 3EB, United Kingdom.

Institute of Electronics, Communications & Information Technology, Queen's University, Belfast, UK.

出版信息

Sci Rep. 2018 Jul 13;8(1):10636. doi: 10.1038/s41598-018-28865-1.

DOI:10.1038/s41598-018-28865-1

PMID:30006530

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6045572/

Abstract

Recognising an object involves rapid visual processing and activation of semantic knowledge about the object, but how visual processing activates and interacts with semantic representations remains unclear. Cognitive neuroscience research has shown that while visual processing involves posterior regions along the ventral stream, object meaning involves more anterior regions, especially perirhinal cortex. Here we investigate visuo-semantic processing by combining a deep neural network model of vision with an attractor network model of semantics, such that visual information maps onto object meanings represented as activation patterns across features. In the combined model, concept activation is driven by visual input and co-occurrence of semantic features, consistent with neurocognitive accounts. We tested the model's ability to explain fMRI data where participants named objects. Visual layers explained activation patterns in early visual cortex, whereas pattern-information in perirhinal cortex was best explained by later stages of the attractor network, when detailed semantic representations are activated. Posterior ventral temporal cortex was best explained by intermediate stages corresponding to initial semantic processing, when visual information has the greatest influence on the emerging semantic representation. These results provide proof of principle of how a mechanistic model of combined visuo-semantic processing can account for pattern-information in the ventral stream.

摘要

识别物体涉及快速的视觉处理和对物体语义知识的激活，但视觉处理如何激活和与语义表示相互作用仍不清楚。认知神经科学研究表明，虽然视觉处理涉及腹侧流的后部区域，但物体的意义涉及更靠前的区域，特别是在眶额皮层。在这里，我们通过将视觉的深度神经网络模型与语义的吸引子网络模型相结合来研究视-语义处理，使得视觉信息映射到作为特征之间激活模式表示的物体意义上。在组合模型中，概念激活由视觉输入和语义特征的共同出现驱动，这与神经认知解释一致。我们测试了该模型解释 fMRI 数据的能力，其中参与者命名物体。视觉层解释了早期视觉皮层中的激活模式，而在吸引子网络的后期阶段，当激活详细的语义表示时，对眶额皮层的模式信息的解释最好。当视觉信息对新兴的语义表示有最大影响时，与初始语义处理相对应的中间阶段可以最好地解释后腹侧颞叶皮层。这些结果提供了一个原理证明，即联合视-语义处理的机制模型如何可以解释腹侧流中的模式信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27e4/6045572/091460f21a8f/41598_2018_28865_Fig1_HTML.jpg

相似文献

Integrated deep visual and semantic attractor neural networks predict fMRI pattern-information along the ventral object processing pathway.整合深度视觉和语义吸引子神经网络可预测腹侧物体处理通路中的 fMRI 模式信息。

Sci Rep. 2018 Jul 13;8(1):10636. doi: 10.1038/s41598-018-28865-1.

Objects and categories: feature statistics and object processing in the ventral stream.物体与类别：腹侧流中的特征统计与物体加工

J Cogn Neurosci. 2013 Oct;25(10):1723-35. doi: 10.1162/jocn_a_00419. Epub 2013 May 10.

Object-specific semantic coding in human perirhinal cortex.人类旁嗅皮层中的目标特定语义编码。

J Neurosci. 2014 Apr 2;34(14):4766-75. doi: 10.1523/JNEUROSCI.2828-13.2014.

Deep Neural Networks and Visuo-Semantic Models Explain Complementary Components of Human Ventral-Stream Representational Dynamics.深度神经网络和视语义模型解释了人类腹侧流表象动态的互补组成部分。

J Neurosci. 2023 Mar 8;43(10):1731-1741. doi: 10.1523/JNEUROSCI.1424-22.2022. Epub 2023 Feb 9.

Perceptual and Semantic Representations at Encoding Contribute to True and False Recognition of Objects.在编码时的知觉和语义表示有助于对物体的真实和错误识别。

J Neurosci. 2021 Oct 6;41(40):8375-8389. doi: 10.1523/JNEUROSCI.0677-21.2021. Epub 2021 Aug 19.

Conjunctive Coding of Complex Object Features.复杂对象特征的联合编码

Cereb Cortex. 2016 May;26(5):2271-2282. doi: 10.1093/cercor/bhv081. Epub 2015 Apr 28.

Predicting Identity-Preserving Object Transformations across the Human Ventral Visual Stream.预测人类腹侧视觉流中的保持身份的物体转换。

J Neurosci. 2021 Sep 1;41(35):7403-7419. doi: 10.1523/JNEUROSCI.2137-20.2021. Epub 2021 Jul 12.

The Ventral Visual Pathway Represents Animal Appearance over Animacy, Unlike Human Behavior and Deep Neural Networks.腹侧视觉通路代表动物的外观而不是能动性，与人类行为和深度神经网络不同。

J Neurosci. 2019 Aug 14;39(33):6513-6525. doi: 10.1523/JNEUROSCI.1714-18.2019. Epub 2019 Jun 13.

The relative contributions of visual and semantic information in the neural representation of object categories.视觉信息和语义信息在物体类别神经表示中的相对贡献。

Brain Behav. 2019 Oct;9(10):e01373. doi: 10.1002/brb3.1373. Epub 2019 Sep 27.

Mid-level visual features underlie the high-level categorical organization of the ventral stream.中层视觉特征是腹侧流高级类别组织的基础。

Proc Natl Acad Sci U S A. 2018 Sep 18;115(38):E9015-E9024. doi: 10.1073/pnas.1719616115. Epub 2018 Aug 31.

引用本文的文献

Representational similarity learning reveals a graded multidimensional semantic space in the human anterior temporal cortex.表征相似性学习揭示了人类前颞叶皮质中的一个分级多维语义空间。

Imaging Neurosci (Camb). 2024 Feb 22;2. doi: 10.1162/imag_a_00093. eCollection 2024.

Object fine-grained discrimination as a sensitive cognitive marker of transentorhinal integrity.作为内嗅皮层完整性敏感认知标志物的客体细粒度辨别

Commun Biol. 2025 May 25;8(1):800. doi: 10.1038/s42003-025-08201-w.

Fine-Grained Concreteness Effects on Word Processing and Representation Across Three Tasks: An ERP Study.三项任务中细粒度具体性对词汇加工与表征的影响：一项事件相关电位研究

Psychophysiology. 2025 May;62(5):e70074. doi: 10.1111/psyp.70074.

Convolutional networks can model the functional modulation of the MEG responses associated with feed-forward processes during visual word recognition.卷积网络可以对与视觉单词识别过程中的前馈过程相关的脑磁图反应的功能调制进行建模。

Elife. 2025 May 13;13:RP96217. doi: 10.7554/eLife.96217.

The Scope and Limits of Fine-Grained Image and Category Information in the Ventral Visual Pathway.腹侧视觉通路中细粒度图像和类别信息的范围与局限

J Neurosci. 2025 Jan 15;45(3):e0936242024. doi: 10.1523/JNEUROSCI.0936-24.2024.

Application of the artificial intelligence system based on graphics and vision in ethnic tourism of subtropical grasslands.基于图形与视觉的人工智能系统在亚热带草原民族旅游中的应用。

Heliyon. 2024 May 17;10(11):e31442. doi: 10.1016/j.heliyon.2024.e31442. eCollection 2024 Jun 15.

Visual Recognition Memory of Scenes Is Driven by Categorical, Not Sensory, Visual Representations.视觉场景识别记忆是由类别而非感觉视觉表象驱动的。

J Neurosci. 2024 May 22;44(21):e1479232024. doi: 10.1523/JNEUROSCI.1479-23.2024.

Using deep neural networks to disentangle visual and semantic information in human perception and memory.利用深度神经网络分离人类感知和记忆中的视觉和语义信息。

Nat Hum Behav. 2024 Apr;8(4):702-717. doi: 10.1038/s41562-024-01816-9. Epub 2024 Feb 8.

Representation of event and object concepts in ventral anterior temporal lobe and angular gyrus.腹侧前颞叶和角回中事件与物体概念的表征。

Cereb Cortex. 2024 Jan 31;34(2). doi: 10.1093/cercor/bhad519.

Hippocampal Functions Modulate Transfer-Appropriate Cortical Representations Supporting Subsequent Memory.海马功能调节转移适当的皮层代表，支持后续的记忆。

J Neurosci. 2024 Jan 3;44(1):e1135232023. doi: 10.1523/JNEUROSCI.1135-23.2023.

本文引用的文献

Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing.深度神经网络：一种用于模拟生物视觉和大脑信息处理的新框架。

Annu Rev Vis Sci. 2015 Nov 24;1:417-446. doi: 10.1146/annurev-vision-082114-035447.

Fractionating the anterior temporal lobe: MVPA reveals differential responses to input and conceptual modality.对颞叶前部进行分割：多变量模式分析揭示了对输入和概念模态的不同反应。

Neuroimage. 2017 Feb 15;147:19-31. doi: 10.1016/j.neuroimage.2016.11.067. Epub 2016 Nov 28.

Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较，揭示了层级对应关系。

Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.

Understanding What We See: How We Derive Meaning From Vision.理解我们所看到的：我们如何从视觉中获取意义。

Trends Cogn Sci. 2015 Nov;19(11):677-687. doi: 10.1016/j.tics.2015.08.008.

Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。

J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.

Feature Statistics Modulate the Activation of Meaning During Spoken Word Processing.特征统计在口语单词处理过程中调节语义激活。

Cogn Sci. 2016 Mar;40(2):325-50. doi: 10.1111/cogs.12234. Epub 2015 Jun 4.

Dynamic information processing states revealed through neurocognitive models of object semantics.通过物体语义神经认知模型揭示的动态信息处理状态

Lang Cogn Neurosci. 2015 Apr 21;30(4):409-419. doi: 10.1080/23273798.2014.970652.

The perirhinal cortex and conceptual processing: Effects of feature-based statistics following damage to the anterior temporal lobes.嗅周皮层与概念加工：颞叶前部受损后基于特征的统计效应

Neuropsychologia. 2015 Sep;76:192-207. doi: 10.1016/j.neuropsychologia.2015.01.041. Epub 2015 Jan 29.

Deep neural networks rival the representation of primate IT cortex for core visual object recognition.深度神经网络在核心视觉目标识别方面可与灵长类动物的颞下皮质表征相媲美。

PLoS Comput Biol. 2014 Dec 18;10(12):e1003963. doi: 10.1371/journal.pcbi.1003963. eCollection 2014 Dec.

Deep supervised, but not unsupervised, models may explain IT cortical representation.深度监督模型而非无监督模型可能解释IT皮层表征。

PLoS Comput Biol. 2014 Nov 6;10(11):e1003915. doi: 10.1371/journal.pcbi.1003915. eCollection 2014 Nov.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合深度视觉和语义吸引子神经网络可预测腹侧物体处理通路中的 fMRI 模式信息。

Integrated deep visual and semantic attractor neural networks predict fMRI pattern-information along the ventral object processing pathway.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献