利用词嵌入的上下文进行感知预测。

Leveraging Context for Perceptual Prediction Using Word Embeddings.

作者信息

Carter Georgia-Ann, Keller Frank, Hoffman Paul

机构信息

Institute for Language, Cognition and Computation, School of Informatics, The University of Edinburgh.

School of Philosophy, Psychology and Language Sciences, The University of Edinburgh.

出版信息

Cogn Sci. 2025 Jun;49(6):e70072. doi: 10.1111/cogs.70072.

DOI:10.1111/cogs.70072

PMID:40481822

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12145134/

Abstract

Word embeddings derived from large language corpora have been successfully used in cognitive science and artificial intelligence to represent linguistic meaning. However, there is continued debate as to how well they encode useful information about the perceptual qualities of concepts. This debate is critical to identifying the scope of embodiment in human semantics. If perceptual object properties can be inferred from word embeddings derived from language alone, this suggests that language provides a useful adjunct to direct perceptual experience for acquiring this kind of conceptual knowledge. Previous research has shown mixed performance when embeddings are used to predict perceptual qualities. Here, we tested if we could improve performance by leveraging the ability of Transformer-based language models to represent word meaning in context. To this end, we conducted two experiments. Our first experiment investigated noun representations. We generated decontextualized ("charcoal") and contextualized ("the brightness of charcoal") Word2Vec and BERT embeddings for a large set of concepts and compared their ability to predict human ratings of the concepts' brightness. We repeated this procedure to also probe for the shape of those concepts. In general, we found very good prediction performance for shape, and a more modest performance for brightness. The addition of context did not improve perceptual prediction performance. In Experiment 2, we investigated representations of adjective-noun phrases. Perceptual prediction performance was generally found to be good, with the nonadditive nature of adjective brightness reflected in the word embeddings. We also found that the addition of context had a limited impact on how well perceptual features could be predicted. We frame these results against current work on the interpretability of language models and debates surrounding embodiment in human conceptual processing.

摘要

从大型语言语料库中衍生出的词嵌入已成功应用于认知科学和人工智能领域，用于表示语言意义。然而，关于它们对概念的感知特性编码有用信息的程度，仍存在持续的争论。这场争论对于确定人类语义中具身性的范围至关重要。如果仅从语言衍生的词嵌入中就能推断出感知对象的属性，这表明语言为获取这类概念知识提供了一种有用的辅助手段，可辅助直接的感知体验。先前的研究表明，在使用嵌入来预测感知特性时，表现参差不齐。在此，我们测试了能否通过利用基于Transformer的语言模型在上下文中表示词义的能力来提高性能。为此，我们进行了两项实验。我们的第一个实验研究了名词表征。我们为大量概念生成了去语境化（“木炭”）和语境化（“木炭的亮度”）的Word2Vec和BERT嵌入，并比较了它们预测人类对这些概念亮度评分的能力。我们重复这个过程，也探究了这些概念的形状。总体而言，我们发现形状的预测性能非常好，而亮度的预测性能则较为一般。添加上下文并没有提高感知预测性能。在实验2中，我们研究了形容词 - 名词短语的表征。感知预测性能总体上被发现良好，形容词亮度的非相加性质在词嵌入中得到了体现。我们还发现，添加上下文对感知特征的预测能力影响有限。我们将这些结果与当前关于语言模型可解释性的研究以及围绕人类概念处理中具身性的争论联系起来。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/635e/12145134/0384ea3d8ff0/COGS-49-e70072-g005.jpg

相似文献

Leveraging Context for Perceptual Prediction Using Word Embeddings.

Cogn Sci. 2025 Jun;49(6):e70072. doi: 10.1111/cogs.70072.

Improved biomedical word embeddings in the transformer era.

J Biomed Inform. 2021 Aug;120:103867. doi: 10.1016/j.jbi.2021.103867. Epub 2021 Jul 18.

Exploring What Is Encoded in Distributional Word Vectors: A Neurobiologically Motivated Analysis.

Cogn Sci. 2020 Jun;44(6):e12844. doi: 10.1111/cogs.12844.

Conceptual Combination in Large Language Models: Uncovering Implicit Relational Interpretations in Compound Words With Contextualized Word Embeddings.

Cogn Sci. 2025 Mar;49(3):e70048. doi: 10.1111/cogs.70048.

A comparison of word embeddings for the biomedical natural language processing.

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

Heteromodal Cortical Areas Encode Sensory-Motor Features of Word Meaning.

J Neurosci. 2016 Sep 21;36(38):9763-9. doi: 10.1523/JNEUROSCI.4095-15.2016.

Language with vision: A study on grounded word and sentence embeddings.

Behav Res Methods. 2024 Sep;56(6):5622-5646. doi: 10.3758/s13428-023-02294-z. Epub 2023 Dec 19.

Modality exclusivity norms for 400 nouns: the relationship between perceptual experience and surface word form.

Behav Res Methods. 2013 Jun;45(2):516-26. doi: 10.3758/s13428-012-0267-0.

Feature Uncertainty Predicts Behavioral and Neural Responses to Combined Concepts.

J Neurosci. 2020 Jun 17;40(25):4900-4912. doi: 10.1523/JNEUROSCI.2926-19.2020. Epub 2020 May 13.

Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.

J Am Med Inform Assoc. 2020 Oct 1;27(10):1538-1546. doi: 10.1093/jamia/ocaa136.

本文引用的文献

Semantic projection recovers rich human knowledge of multiple object features from word embeddings.

Nat Hum Behav. 2022 Jul;6(7):975-987. doi: 10.1038/s41562-022-01316-8. Epub 2022 Apr 14.

Dual coding of knowledge in the human brain.

Trends Cogn Sci. 2021 Oct;25(10):883-895. doi: 10.1016/j.tics.2021.07.006. Epub 2021 Sep 8.

Exploring What Is Encoded in Distributional Word Vectors: A Neurobiologically Motivated Analysis.

Cogn Sci. 2020 Jun;44(6):e12844. doi: 10.1111/cogs.12844.

Feature Uncertainty Predicts Behavioral and Neural Responses to Combined Concepts.

J Neurosci. 2020 Jun 17;40(25):4900-4912. doi: 10.1523/JNEUROSCI.2926-19.2020. Epub 2020 May 13.

Task-Dependent Recruitment of Modality-Specific and Multimodal Regions during Conceptual Processing.

Cereb Cortex. 2020 Jun 1;30(7):3938-3959. doi: 10.1093/cercor/bhaa010.

Vector-Space Models of Semantic Representation From a Cognitive Perspective: A Discussion of Common Misconceptions.

Perspect Psychol Sci. 2019 Nov;14(6):1006-1033. doi: 10.1177/1745691619861372. Epub 2019 Sep 10.

Words as social tools: Language, sociality and inner grounding in abstract concepts.

Phys Life Rev. 2019 Jul;29:120-153. doi: 10.1016/j.plrev.2018.12.001. Epub 2018 Dec 6.

Editors' Introduction: Abstract Concepts: Structure, Processing, and Modeling.

Top Cogn Sci. 2018 Jul;10(3):490-500. doi: 10.1111/tops.12354. Epub 2018 Jun 22.

Moving beyond the distinction between concrete and abstract concepts.

Philos Trans R Soc Lond B Biol Sci. 2018 Aug 5;373(1752). doi: 10.1098/rstb.2017.0144.

Knowing the Meaning of a Word by the Linguistic and Perceptual Company It Keeps.

Top Cogn Sci. 2018 Jul;10(3):573-589. doi: 10.1111/tops.12349. Epub 2018 May 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用词嵌入的上下文进行感知预测。

Leveraging Context for Perceptual Prediction Using Word Embeddings.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献