Suppr超能文献

三维自然场景中空间结构和语义内容的皮层编码

Cortical Encoding of Spatial Structure and Semantic Content in 3D Natural Scenes.

作者信息

Mononen Riikka, Saarela Toni, Vallinoja Jaakko, Olkkonen Maria, Henriksson Linda

机构信息

Department of Neuroscience and Biomedical Engineering, Aalto University, Espoo FI-00076, Finland.

MEG Core, Aalto NeuroImaging, Aalto University, Espoo FI-00076, Finland.

出版信息

J Neurosci. 2025 Feb 26;45(9):e2157232024. doi: 10.1523/JNEUROSCI.2157-23.2024.

Abstract

Our visual system enables us to effortlessly navigate and recognize real-world visual environments. Functional magnetic resonance imaging (fMRI) studies suggest a network of scene-responsive cortical visual areas, but much less is known about the temporal order in which different scene properties are analyzed by the human visual system. In this study, we selected a set of 36 full-color natural scenes that varied in spatial structure and semantic content that our male and female human participants viewed both in 2D and 3D while we recorded magnetoencephalography (MEG) data. MEG enables tracking of cortical activity in humans at millisecond timescale. We compared the representational geometry in the MEG responses with predictions based on the scene stimuli using the representational similarity analysis framework. The representational structure first reflected the spatial structure in the scenes in time window 90-125 ms, followed by the semantic content in time window 140-175 ms after stimulus onset. The 3D stereoscopic viewing of the scenes affected the responses relatively late, from ∼140 ms from stimulus onset. Taken together, our results indicate that the human visual system rapidly encodes a scene's spatial structure and suggest that this information is based on monocular instead of binocular depth cues.

摘要

我们的视觉系统使我们能够轻松地在现实世界的视觉环境中导航和识别。功能磁共振成像(fMRI)研究表明存在一个对场景有反应的皮质视觉区域网络,但对于人类视觉系统分析不同场景属性的时间顺序却知之甚少。在这项研究中,我们选择了一组36个全彩自然场景,这些场景在空间结构和语义内容上各不相同,我们的男性和女性人类参与者在观看2D和3D场景时,我们记录了脑磁图(MEG)数据。MEG能够在毫秒时间尺度上跟踪人类的皮质活动。我们使用表征相似性分析框架,将MEG反应中的表征几何与基于场景刺激的预测进行了比较。表征结构首先在刺激开始后90 - 125毫秒的时间窗口内反映场景中的空间结构,随后在140 - 175毫秒的时间窗口内反映语义内容。对场景的3D立体观看对反应的影响相对较晚,从刺激开始约140毫秒起。综上所述,我们的结果表明人类视觉系统能快速编码场景的空间结构,并表明该信息基于单眼而非双眼深度线索。

相似文献

2
Combined representation of visual features in the scene-selective cortex.场景选择性皮质中视觉特征的联合表示。
Behav Brain Res. 2024 Aug 5;471:115110. doi: 10.1016/j.bbr.2024.115110. Epub 2024 Jun 11.

本文引用的文献

2
From pictures to reality: modelling the phenomenology and psychophysics of 3D perception.从图片到现实:三维感知的现象学和心理物理学建模。
Philos Trans R Soc Lond B Biol Sci. 2023 Jan 30;378(1869):20210454. doi: 10.1098/rstb.2021.0454. Epub 2022 Dec 13.
4
Dynamics of retinotopic spatial attention revealed by multifocal MEG.多焦点脑磁图揭示的视网膜空间注意动力学。
Neuroimage. 2022 Nov;263:119643. doi: 10.1016/j.neuroimage.2022.119643. Epub 2022 Sep 21.
6
Three cortical scene systems and their development.三个皮质场景系统及其发育。
Trends Cogn Sci. 2022 Feb;26(2):117-127. doi: 10.1016/j.tics.2021.11.002. Epub 2021 Nov 29.
7
Behavioral and neural representations en route to intuitive action understanding.行为和神经表现与直观动作理解。
Neuropsychologia. 2021 Dec 10;163:108048. doi: 10.1016/j.neuropsychologia.2021.108048. Epub 2021 Oct 12.
10
Category systems for real-world scenes.真实场景的分类系统。
J Vis. 2021 Feb 3;21(2):8. doi: 10.1167/jov.21.2.8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验