Department of Psychology, McGill University.
Cogn Sci. 2024 Feb;48(2):e13413. doi: 10.1111/cogs.13413.
Distributional models of lexical semantics are capable of acquiring sophisticated representations of word meanings. The main theoretical insight provided by these models is that they demonstrate the systematic connection between the knowledge that people acquire and the experience that they have with the natural language environment. However, linguistic experience is inherently variable and differs radically across people due to demographic and cultural variables. Recently, distributional models have been used to examine how word meanings vary across languages and it was found that there is considerable variability in the meanings of words across languages for most semantic categories. The goal of this article is to examine how variable word meanings are across individual language users within a single language. This was accomplished by assembling 500 individual user corpora attained from the online forum Reddit. Each user corpus ranged between 3.8 and 32.3 million words each, and a count-based distributional framework was used to extract word meanings for each user. These representations were then used to estimate the semantic alignment of word meanings across individual language users. It was found that there are significant levels of relativity in word meanings across individuals, and these differences are partially explained by other psycholinguistic factors, such as concreteness, semantic diversity, and social aspects of language usage. These results point to word meanings being fundamentally relative and contextually fluid, with this relativeness being related to the individualized nature of linguistic experience.
词汇语义分布模型能够获取到复杂的单词含义表示。这些模型提供的主要理论见解是,它们展示了人们所获得的知识与他们在自然语言环境中所获得的经验之间的系统联系。然而,语言经验本质上是可变的,并且由于人口统计学和文化变量在人与人之间存在很大差异。最近,分布模型被用于研究单词含义在不同语言之间如何变化,结果发现,对于大多数语义类别,单词的含义在不同语言之间存在相当大的可变性。本文的目的是研究在单一语言内,个体语言使用者之间的单词含义变化有多大。这是通过从在线论坛 Reddit 上收集的 500 个个人用户语料库来实现的。每个用户语料库的字数在 380 万到 3230 万之间,并且使用基于计数的分布框架从每个用户中提取单词含义。然后,使用这些表示来估计个体语言使用者之间的单词含义的语义对齐。结果发现,个体之间的单词含义存在显著的相对性,这些差异部分可以用其他心理语言学因素来解释,如具体性、语义多样性和语言使用的社会方面。这些结果表明,单词含义从根本上是相对的,并且具有语境流动性,这种相对性与语言经验的个体化性质有关。