Suppr超能文献

语境动态在老化谱中的词汇编码中:一项模拟研究。

Contextual dynamics in lexical encoding across the ageing spectrum: A simulation study.

机构信息

Department of Psychology, McGill University, Montreal, Quebec, Canada.

University of Ottawa, Ottawa, Ontario, Canada.

出版信息

Q J Exp Psychol (Hove). 2023 Sep;76(9):2164-2182. doi: 10.1177/17470218221145685. Epub 2022 Dec 27.

Abstract

The field of psycholinguistics has recently questioned the primacy of word frequency (WF) in influencing word recognition and production, instead focusing on the importance of a word's contextual diversity (CD). WF is operationalised by counting the number of occurrences of a word in a corpus, while a word's CD is a count of the number of contexts that a word occurs in, with repetitions within a context being ignored. Numerous studies have converged on the conclusion that CD is a better predictor of word recognition latency and accuracy than frequency. These findings support a cognitive mechanism based on the principle of likely need over the principle of repetition in lexical organisation. In the current study, we trained the semantic distinctiveness model on communication patterns in social media platforms consisting of over 55-billion-word tokens and examined the ability of theoretically distinct models to explain word recognition latency and accuracy data from over 1 million participants from the Mandera et al. English Crowdsourding Project norms, consisting of approximately 59,000 words across six age bands ranging from ages 10 to 60 years. There was a clear quantitative trend across the age bands, where there is a shift from a social environment-based attention mechanism in the "younger" models, to a clear dominance for a discourse-based attention mechanism as models "aged." This pattern suggests that there is a dynamical interaction between the cognitive mechanisms of lexical organisation and environmental information that emerges across ageing.

摘要

心理语言学领域最近对影响词汇识别和生成的词频(WF)首要性提出了质疑,而是转而关注词汇的语境多样性(CD)的重要性。WF 通过计算词汇在语料库中的出现次数来操作,而词汇的 CD 是词汇在上下文中出现的次数的计数,上下文内的重复不计。许多研究都得出了这样的结论:CD 是词汇识别潜伏期和准确性的更好预测指标,而不是频率。这些发现支持了一种认知机制,该机制基于词汇组织中可能需要的原则,而不是重复的原则。在本研究中,我们使用由超过 550 亿个词元组成的社交媒体平台上的交流模式对语义区分模型进行了训练,并研究了理论上不同的模型解释来自 Mandera 等人的英语众包规范的 100 多万名参与者的词汇识别潜伏期和准确性数据的能力。该规范包含了六个年龄组的大约 59000 个单词,年龄从 10 岁到 60 岁不等。在年龄组之间存在明显的定量趋势,其中“年轻”模型中的基于社会环境的注意力机制发生了转变,而随着模型的“老化”,基于语篇的注意力机制则明显占主导地位。这种模式表明,在衰老过程中,词汇组织和环境信息的认知机制之间存在动态交互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e555/10466941/aabdf87942d5/10.1177_17470218221145685-fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验