Suppr超能文献

语境多样性的剖析:交际需求作为词汇组织工具。

Disentangling contextual diversity: Communicative need as a lexical organizer.

机构信息

McGill University.

出版信息

Psychol Rev. 2021 Apr;128(3):525-557. doi: 10.1037/rev0000265. Epub 2021 Feb 11.

Abstract

Contextual diversity (CD; Adelman, Brown, & Quesada, 2006) modifies word frequency by ignoring word repetition in context. It has been repeatedly found that a CD count provides a better fit to lexical organization data than does word frequency (e.g., Adelman & Brown, 2008; Brysbaert & New, 2009). The importance of CD has been interpreted with the principle of likely need, adapted from the rational analysis of memory (Anderson & Schooler, 1991), which states that words that have been used in many past contexts are more likely to be needed in a future context. Central to the cognitive mechanisms of computing likely need is a definition of linguistic context itself. Typically, linguistic context is defined by relatively small units of language, such as a document within a corpus. However, recent research has demonstrated that larger definitions of context, some spanning tens or hundreds of thousands of words, provide a better accounting of lexical organization data (Johns, Dye, & Jones, 2020). This article attempts to redefine the notion of linguistic context by using socially based contextual measures, derived from the online communication patterns of hundreds of thousands of individuals from the discussion forum Reddit, consisting of over 55 billion words. Multiple count-based and semantic diversity models of contextual diversity were derived from this data. The results demonstrate that the communication patterns of individuals across discourses provides the best accounting of lexical organization data, indicating that classic notions of using local linguistic context to update a word's strength in the lexicon need to be reevaluated. (PsycInfo Database Record (c) 2021 APA, all rights reserved).

摘要

语境多样性(CD;Adelman、Brown 和 Quesada,2006)通过忽略语境中的单词重复来修改单词频率。已经反复发现,CD 计数比单词频率更能很好地拟合词汇组织数据(例如,Adelman 和 Brown,2008;Brysbaert 和 New,2009)。从记忆的理性分析(Anderson 和 Schooler,1991)中改编的可能需要原则解释了 CD 的重要性,该原则指出,在许多过去的语境中使用过的单词在未来的语境中更有可能被需要。计算可能需要的认知机制的核心是对语言语境本身的定义。通常,语言语境是通过相对较小的语言单位来定义的,例如语料库中的一个文档。然而,最近的研究表明,更大的语境定义,有些定义跨度达数十万或数百万个单词,可以更好地解释词汇组织数据(Johns、Dye 和 Jones,2020)。本文试图通过使用源自 Reddit 论坛上数十万个人在线交流模式的基于社会的语境度量来重新定义语言语境的概念,该论坛由超过 550 亿个单词组成。从这个数据中得出了基于计数和语义多样性的多个语境多样性模型。结果表明,个体在不同话语中的交流模式能够很好地解释词汇组织数据,这表明需要重新评估使用局部语言语境来更新词汇中单词强度的经典概念。(PsycInfo 数据库记录(c)2021 APA,保留所有权利)。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验