Raposo Francisco Afonso, Martins de Matos David, Ribeiro Ricardo
INESC-ID Lisboa, R. Alves Redol 9, Lisboa, 1000-029, Portugal.
Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais, Lisboa, 1049-001, Portugal.
Neuroinformatics. 2022 Apr;20(2):451-461. doi: 10.1007/s12021-021-09560-5. Epub 2022 Jan 7.
Embodied Cognition (EC) states that semantics is encoded in the brain as firing patterns of neural circuits, which are learned according to the statistical structure of human multimodal experience. However, each human brain is idiosyncratically biased, according to its subjective experience, making this biological semantic machinery noisy with respect to semantics inherent to media, such as music and language. We propose to represent media semantics using low-dimensional vector embeddings by jointly modeling the functional Magnetic Resonance Imaging (fMRI) activity of several brains via Generalized Canonical Correlation Analysis (GCCA). We evaluate the semantic richness of the resulting latent space in appropriate semantic classification tasks: music genres and language topics. We show that the resulting unsupervised representations outperform the original high-dimensional fMRI voxel spaces in these downstream tasks while being more computationally efficient. Furthermore, we show that joint modeling of several subjects increases the semantic richness of the learned latent vector spaces as the number of subjects increases. Quantitative results and corresponding statistical significance testing demonstrate the instantiation of music and language semantics in the brain, thereby providing further evidence for multimodal embodied cognition as well as a method for extraction of media semantics from multi-subject brain dynamics.
具身认知(EC)认为,语义在大脑中被编码为神经回路的放电模式,这些模式是根据人类多模态体验的统计结构学习而来的。然而,每个人的大脑根据其主观体验都存在独特的偏差,这使得这种生物语义机制在涉及音乐和语言等媒介所固有的语义时变得嘈杂。我们建议通过广义典型相关分析(GCCA)联合对多个大脑的功能磁共振成像(fMRI)活动进行建模,使用低维向量嵌入来表示媒介语义。我们在适当的语义分类任务(音乐流派和语言主题)中评估所得潜在空间的语义丰富度。我们表明,在这些下游任务中,所得的无监督表示优于原始的高维fMRI体素空间,同时计算效率更高。此外,我们表明,随着受试者数量的增加,对多个受试者进行联合建模会增加所学潜在向量空间的语义丰富度。定量结果和相应的统计显著性检验证明了音乐和语言语义在大脑中的实例化,从而为多模态具身认知提供了进一步的证据,以及一种从多受试者大脑动态中提取媒介语义的方法。