意义本身的痕迹：在大脑活动中编码分布式词向量

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

作者信息

Sassenhagen Jona, Fiebach Christian J

机构信息

Department of Psychology, Goethe University Frankfurt, Germany.

Brain Imaging Center, Goethe University Frankfurt, Germany.

出版信息

Neurobiol Lang (Camb). 2020 Mar 1;1(1):54-76. doi: 10.1162/nol_a_00003. eCollection 2020.

DOI:10.1162/nol_a_00003

PMID:36794005

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9923691/

Abstract

How is semantic information stored in the human mind and brain? Some philosophers and cognitive scientists argue for vectorial representations of concepts, where the meaning of a word is represented as its position in a high-dimensional neural state space. At the intersection of natural language processing and artificial intelligence, a class of very successful distributional word vector models has developed that can account for classic EEG findings of language, that is, the ease versus difficulty of integrating a word with its sentence context. However, models of semantics have to account not only for context-based word processing, but should also describe how word meaning is represented. Here, we investigate whether distributional vector representations of word meaning can model brain activity induced by words presented without context. Using EEG activity (event-related brain potentials) collected while participants in two experiments (English and German) read isolated words, we encoded and decoded word vectors taken from the family of prediction-based Word2vec algorithms. We found that, first, the position of a word in vector space allows the prediction of the pattern of corresponding neural activity over time, in particular during a time window of 300 to 500 ms after word onset. Second, distributional models perform better than a human-created taxonomic baseline model (WordNet), and this holds for several distinct vector-based models. Third, multiple latent semantic dimensions of word meaning can be decoded from brain activity. Combined, these results suggest that empiricist, prediction-based vectorial representations of meaning are a viable candidate for the representational architecture of human semantic knowledge.

摘要

语义信息是如何存储在人类的心智和大脑中的？一些哲学家和认知科学家主张概念的向量表示，即一个词的意义被表示为其在高维神经状态空间中的位置。在自然语言处理和人工智能的交叉领域，已经开发出一类非常成功的分布词向量模型，这些模型可以解释语言的经典脑电图结果，也就是说，一个词与它的句子语境整合的难易程度。然而，语义模型不仅要解释基于语境的词处理，还应该描述词的意义是如何被表示的。在这里，我们研究词意义的分布向量表示是否可以模拟在无语境呈现词时诱发的大脑活动。利用在两个实验（英语和德语）中参与者阅读孤立单词时收集的脑电图活动（事件相关脑电位），我们对从基于预测的Word2vec算法家族中提取的词向量进行编码和解码。我们发现，首先，一个词在向量空间中的位置可以预测相应神经活动随时间的模式，特别是在单词出现后300到500毫秒的时间窗口内。其次，分布模型的表现优于人工创建的分类基线模型（WordNet），并且这适用于几个不同的基于向量的模型。第三，可以从大脑活动中解码词意义的多个潜在语义维度。综合起来，这些结果表明，基于预测的经验主义向量意义表示是人类语义知识表示架构的一个可行候选方案。

相似文献

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

Neurobiol Lang (Camb). 2020 Mar 1;1(1):54-76. doi: 10.1162/nol_a_00003. eCollection 2020.

Deep Artificial Neural Networks Reveal a Distributed Cortical Network Encoding Propositional Sentence-Level Meaning.

J Neurosci. 2021 May 5;41(18):4100-4119. doi: 10.1523/JNEUROSCI.1152-20.2021. Epub 2021 Mar 22.

An Integrated Neural Decoder of Linguistic and Experiential Meaning.

J Neurosci. 2019 Nov 6;39(45):8969-8987. doi: 10.1523/JNEUROSCI.2575-18.2019. Epub 2019 Sep 30.

Distinct fronto-temporal substrates of distributional and taxonomic similarity among words: evidence from RSA of BOLD signals.

Neuroimage. 2021 Jan 1;224:117408. doi: 10.1016/j.neuroimage.2020.117408. Epub 2020 Oct 10.

Probing Lexical Ambiguity: Word Vectors Encode Number and Relatedness of Senses.

Cogn Sci. 2021 May;45(5):e12943. doi: 10.1111/cogs.12943.

Vector representations of multi-word terms for semantic relatedness.

J Biomed Inform. 2018 Jan;77:111-119. doi: 10.1016/j.jbi.2017.12.006. Epub 2017 Dec 13.

Exploring the Representations of Individual Entities in the Brain Combining EEG and Distributional Semantics.

Front Artif Intell. 2022 Feb 23;5:796793. doi: 10.3389/frai.2022.796793. eCollection 2022.

EEG decoding of spoken words in bilingual listeners: from words to language invariant semantic-conceptual representations.

Front Psychol. 2015 Feb 6;6:71. doi: 10.3389/fpsyg.2015.00071. eCollection 2015.

Exploring What Is Encoded in Distributional Word Vectors: A Neurobiologically Motivated Analysis.

Cogn Sci. 2020 Jun;44(6):e12844. doi: 10.1111/cogs.12844.

Delta-band neural activity primarily tracks sentences instead of semantic properties of words.

Neuroimage. 2022 May 1;251:118979. doi: 10.1016/j.neuroimage.2022.118979. Epub 2022 Feb 7.

引用本文的文献

: Editorial.

Neurobiol Lang (Camb). 2020 Mar 1;1(1):1-8. doi: 10.1162/nol_e_00009. eCollection 2020.

Exploring the Representations of Individual Entities in the Brain Combining EEG and Distributional Semantics.

Front Artif Intell. 2022 Feb 23;5:796793. doi: 10.3389/frai.2022.796793. eCollection 2022.

Brains and algorithms partially converge in natural language processing.

Commun Biol. 2022 Feb 16;5(1):134. doi: 10.1038/s42003-022-03036-1.

Decoding EEG Brain Activity for Multi-Modal Natural Language Processing.

Front Hum Neurosci. 2021 Jul 13;15:659410. doi: 10.3389/fnhum.2021.659410. eCollection 2021.

本文引用的文献

Voxelwise encoding models with non-spherical multivariate normal priors.

Neuroimage. 2019 Aug 15;197:482-492. doi: 10.1016/j.neuroimage.2019.04.012. Epub 2019 May 7.

Toward a universal decoder of linguistic meaning from brain activation.

Nat Commun. 2018 Mar 6;9(1):963. doi: 10.1038/s41467-018-03068-4.

Electrophysiological Correlates of Semantic Dissimilarity Reflect the Comprehension of Natural, Narrative Speech.

Curr Biol. 2018 Mar 5;28(5):803-809.e3. doi: 10.1016/j.cub.2018.01.080. Epub 2018 Feb 22.

The neuro-cognitive representations of symbols: the case of concrete words.

Neuropsychologia. 2017 Oct;105:4-17. doi: 10.1016/j.neuropsychologia.2017.06.026. Epub 2017 Jun 23.

Predicting Lexical Priming Effects from Distributional Semantic Similarities: A Replication with Extension.

Front Psychol. 2016 Oct 24;7:1646. doi: 10.3389/fpsyg.2016.01646. eCollection 2016.

Natural speech reveals the semantic maps that tile human cerebral cortex.

Nature. 2016 Apr 28;532(7600):453-8. doi: 10.1038/nature17637.

A Thousand Words Are Worth a Picture: Snapshots of Printed-Word Processing in an Event-Related Potential Megastudy.

Psychol Sci. 2015 Dec;26(12):1887-97. doi: 10.1177/0956797615603934. Epub 2015 Nov 2.

Latent semantic analysis cosines as a cognitive similarity measure: Evidence from priming studies.

Q J Exp Psychol (Hove). 2016;69(4):626-53. doi: 10.1080/17470218.2015.1038280. Epub 2015 May 8.

Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses.

PLoS One. 2014 Nov 26;9(11):e112575. doi: 10.1371/journal.pone.0112575. eCollection 2014.

Characterizing the dynamics of mental representations: the temporal generalization method.

Trends Cogn Sci. 2014 Apr;18(4):203-10. doi: 10.1016/j.tics.2014.01.002. Epub 2014 Mar 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

意义本身的痕迹：在大脑活动中编码分布式词向量

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

作者信息

Sassenhagen Jona, Fiebach Christian J

机构信息

Department of Psychology, Goethe University Frankfurt, Germany.

Brain Imaging Center, Goethe University Frankfurt, Germany.

出版信息

Neurobiol Lang (Camb). 2020 Mar 1;1(1):54-76. doi: 10.1162/nol_a_00003. eCollection 2020.

DOI:10.1162/nol_a_00003

PMID:36794005

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9923691/

Abstract

摘要

意义本身的痕迹：在大脑活动中编码分布式词向量

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

意义本身的痕迹：在大脑活动中编码分布式词向量

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献