• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

估计书面语言中的单词的流行度和多样性。

Estimating the prevalence and diversity of words in written language.

机构信息

Department of Communicative Disorders and Sciences, University at Buffalo, Buffalo, NY, USA.

University of California, Berkeley, CA, USA.

出版信息

Q J Exp Psychol (Hove). 2020 Jun;73(6):841-855. doi: 10.1177/1747021819897560. Epub 2020 Feb 14.

DOI:10.1177/1747021819897560
PMID:31826715
Abstract

Recently, a new crowd-sourced language metric has been introduced, entitled word prevalence, which estimates the proportion of the population that knows a given word. This measure has been shown to account for unique variance in large sets of lexical performance. This article aims to build on the work of Brysbaert et al. and Keuleers et al. by introducing new corpus-based metrics that estimate how likely a word is to be an active member of the natural language environment, and hence known by a larger subset of the general population. This metric is derived from an analysis of a newly collected corpus of over 25,000 fiction and non-fiction books and will be shown that it is capable of accounting for significantly more variance than past corpus-based measures.

摘要

最近,一种新的众包语言指标被引入,称为单词流行度,它估计了知道某个单词的人群比例。这一指标已被证明可以解释词汇表现的大型数据集的独特差异。本文旨在通过引入新的基于语料库的指标来扩展 Brysbaert 等人和 Keuleers 等人的工作,这些指标估计了一个单词成为自然语言环境中活跃成员的可能性,从而被更大比例的一般大众所知晓。这一指标是从对一个新收集的超过 25000 本小说和非小说类书籍的语料库进行分析得出的,结果表明,它能够解释更多的方差,而不仅仅是基于过去的语料库的测量方法。

相似文献

1
Estimating the prevalence and diversity of words in written language.估计书面语言中的单词的流行度和多样性。
Q J Exp Psychol (Hove). 2020 Jun;73(6):841-855. doi: 10.1177/1747021819897560. Epub 2020 Feb 14.
2
The impact of word prevalence on lexical decision times: Evidence from the Dutch Lexicon Project 2.词频对词汇判断时间的影响:来自荷兰词汇项目2的证据
J Exp Psychol Hum Percept Perform. 2016 Mar;42(3):441-58. doi: 10.1037/xhp0000159. Epub 2015 Oct 26.
3
Contributions of semantic and contextual diversity to the word frequency effect in L2 lexical access.语义和语境多样性对二语词汇提取中词频效应的贡献。
Can J Exp Psychol. 2020 Mar;74(1):25-34. doi: 10.1037/cep0000189. Epub 2019 Oct 3.
4
Determining the Relativity of Word Meanings Through the Construction of Individualized Models of Semantic Memory.通过构建语义记忆的个体化模型来确定词义的相对性。
Cogn Sci. 2024 Feb;48(2):e13413. doi: 10.1111/cogs.13413.
5
Contextual dynamics in lexical encoding across the ageing spectrum: A simulation study.语境动态在老化谱中的词汇编码中:一项模拟研究。
Q J Exp Psychol (Hove). 2023 Sep;76(9):2164-2182. doi: 10.1177/17470218221145685. Epub 2022 Dec 27.
6
Semantic diversity: a measure of semantic ambiguity based on variability in the contextual usage of words.语义多样性:一种基于词汇上下文使用变化的语义模糊性度量。
Behav Res Methods. 2013 Sep;45(3):718-30. doi: 10.3758/s13428-012-0278-x.
7
NSP-SCD: A corpus construction protocol for child-directed print in understudied languages.NSP-SCD:面向欠研究语言的面向儿童的印刷品语料库构建协议。
Behav Res Methods. 2024 Apr;56(4):2751-2764. doi: 10.3758/s13428-024-02339-x. Epub 2024 Feb 15.
8
Comparing word frequency, semantic diversity, and semantic distinctiveness in lexical organization.比较词汇组织中的词频、语义多样性和语义独特性。
J Exp Psychol Gen. 2023 Jun;152(6):1814-1823. doi: 10.1037/xge0001407.
9
A further examination of word frequency and age-of-acquisition effects in English lexical decision task performance: The role of frequency trajectory.英语词汇判断任务表现中词频和习得年龄效应的进一步考察:频率轨迹的作用
J Exp Psychol Learn Mem Cogn. 2019 Jan;45(1):82-96. doi: 10.1037/xlm0000564. Epub 2018 Apr 23.
10
Is vocabulary growth influenced by the relations among words in a language learner's vocabulary?词汇量的增长是否受到语言学习者词汇中单词之间关系的影响?
J Exp Psychol Learn Mem Cogn. 2013 Sep;39(5):1657-62. doi: 10.1037/a0032993. Epub 2013 May 6.

引用本文的文献

1
Lexical innovations are rarely passed on during one's lifetime: Epidemiological perspectives on estimating the basic reproductive ratio of words.词汇创新在一个人的一生中很少会传承下去:关于估计词汇基本繁殖率的流行病学观点。
PLoS One. 2024 Dec 5;19(12):e0312336. doi: 10.1371/journal.pone.0312336. eCollection 2024.
2
The Children and Young People's Books Lexicon (CYP-LEX): A large-scale lexical database of books read by children and young people in the United Kingdom.《儿童与青少年书籍词汇表》(CYP-LEX):一个大规模的词汇数据库,收录了英国儿童和青少年阅读的书籍。
Q J Exp Psychol (Hove). 2024 Dec;77(12):2418-2438. doi: 10.1177/17470218241229694. Epub 2024 Mar 12.
3
Contextual dynamics in lexical encoding across the ageing spectrum: A simulation study.
语境动态在老化谱中的词汇编码中:一项模拟研究。
Q J Exp Psychol (Hove). 2023 Sep;76(9):2164-2182. doi: 10.1177/17470218221145685. Epub 2022 Dec 27.
4
Using big data to understand bilingual performance in semantic fluency: Findings from the Canadian Longitudinal Study on Aging.利用大数据了解语义流畅性的双语表现:来自加拿大老龄化纵向研究的发现。
PLoS One. 2022 Nov 28;17(11):e0277660. doi: 10.1371/journal.pone.0277660. eCollection 2022.