• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

40777 个加泰罗尼亚语单词的使用频率规范:词汇量的在线巨量研究。

Prevalence norms for 40,777 Catalan words: An online megastudy of vocabulary size.

机构信息

Department of Psychology and CRAMC, Universitat Rovira i Virgili, Tarragona, Spain.

Centro de Investigación Nebrija en Cognición, Universidad Antonio de Nebrija, Madrid, Spain.

出版信息

Behav Res Methods. 2023 Sep;55(6):3198-3217. doi: 10.3758/s13428-022-01959-5. Epub 2022 Sep 9.

DOI:10.3758/s13428-022-01959-5
PMID:36085541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10556174/
Abstract

In this study, we present word prevalence data (i.e., the number of people who know a given word) for 40,777 Catalan words. An online massive visual lexical decision task involving more than 200,000 native speakers of this language was carried out. The characteristics of the participants as well as those of the words which mostly influence word knowledge were examined. Regarding the participants, the analysis of the data revealed that their age was the main factor influencing vocabulary size, followed by their educational level and other variables such as the number of languages spoken and their level of proficiency in Catalan. Concerning the words, by far the most determining factor was lexical frequency, with a minor influence of both length and the size of the orthographic neighborhood. These data mainly agree with those reported in other languages in which the same variables have been analyzed (Dutch, English, and Spanish, thus far). Therefore, the list is increased with Catalan, a language which, due to its use in an essentially bilingual context, is of special interest to researchers interested in the field of bilingualism and second language acquisition.

摘要

在这项研究中,我们呈现了 40777 个加泰罗尼亚语单词的词频数据(即知道给定单词的人数)。我们进行了一项涉及超过 20 万母语为该语言的人的在线大规模视觉词汇决策任务。我们检查了参与者的特征以及对词汇知识影响最大的单词特征。关于参与者,数据分析表明,年龄是影响词汇量的主要因素,其次是教育水平以及其他变量,如使用的语言数量和他们在加泰罗尼亚语方面的熟练程度。关于单词,到目前为止,最具决定性的因素是词汇频率,长度和正字法邻域的大小也有一定的影响。这些数据主要与在其他语言中分析的相同变量(迄今为止的荷兰语、英语和西班牙语)报告的数据一致。因此,由于其在本质上双语环境中的使用,加泰罗尼亚语的词汇表得到了扩充,这对双语和第二语言习得领域的研究人员特别感兴趣。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/bfbbb1fcfe60/13428_2022_1959_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/48e8b7ef98f6/13428_2022_1959_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/65c08573287d/13428_2022_1959_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/2a14067566fc/13428_2022_1959_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/2fe4a35f3eb6/13428_2022_1959_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/beeebd4a336c/13428_2022_1959_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/98d33a5679ab/13428_2022_1959_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/f37307491f95/13428_2022_1959_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/54a3623c219c/13428_2022_1959_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/147474901137/13428_2022_1959_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/87289e4e79dd/13428_2022_1959_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/a808eb009c9a/13428_2022_1959_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/763b67828797/13428_2022_1959_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/21f712936e51/13428_2022_1959_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/bfbbb1fcfe60/13428_2022_1959_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/48e8b7ef98f6/13428_2022_1959_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/65c08573287d/13428_2022_1959_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/2a14067566fc/13428_2022_1959_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/2fe4a35f3eb6/13428_2022_1959_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/beeebd4a336c/13428_2022_1959_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/98d33a5679ab/13428_2022_1959_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/f37307491f95/13428_2022_1959_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/54a3623c219c/13428_2022_1959_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/147474901137/13428_2022_1959_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/87289e4e79dd/13428_2022_1959_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/a808eb009c9a/13428_2022_1959_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/763b67828797/13428_2022_1959_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/21f712936e51/13428_2022_1959_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2abc/10556174/bfbbb1fcfe60/13428_2022_1959_Fig14_HTML.jpg

相似文献

1
Prevalence norms for 40,777 Catalan words: An online megastudy of vocabulary size.40777 个加泰罗尼亚语单词的使用频率规范:词汇量的在线巨量研究。
Behav Res Methods. 2023 Sep;55(6):3198-3217. doi: 10.3758/s13428-022-01959-5. Epub 2022 Sep 9.
2
How do Spanish speakers read words? Insights from a crowdsourced lexical decision megastudy.西班牙语使用者如何阅读单词?一项众包词汇判断巨量研究的启示。
Behav Res Methods. 2020 Oct;52(5):1867-1882. doi: 10.3758/s13428-020-01357-9.
3
Word knowledge in the crowd: Measuring vocabulary size and word prevalence in a massive online experiment.群体中的词汇知识:在一项大规模在线实验中测量词汇量和词汇流行度
Q J Exp Psychol (Hove). 2015;68(8):1665-92. doi: 10.1080/17470218.2015.1022560. Epub 2015 Apr 8.
4
Translation norms for Malay and English words: The effects of word class, semantic variability, lexical characteristics, and language proficiency on translation.马来语和英语单词的翻译规范:词类、语义可变性、词汇特征和语言水平对翻译的影响。
Behav Res Methods. 2023 Oct;55(7):3585-3601. doi: 10.3758/s13428-022-01977-3. Epub 2022 Oct 11.
5
Word prevalence norms for 62,000 English lemmas.62000 个英语词汇的词频规范。
Behav Res Methods. 2019 Apr;51(2):467-479. doi: 10.3758/s13428-018-1077-9.
6
Statistical word learning in Catalan-Spanish and English-speaking children with and without developmental language disorder.发展性语言障碍儿童与非发展性语言障碍儿童的加泰罗尼亚语-西班牙语和英语统计单词学习。
Int J Lang Commun Disord. 2022 Jan;57(1):42-62. doi: 10.1111/1460-6984.12673. Epub 2021 Oct 6.
7
The role of translation equivalents in bilingual word learning.翻译等价物在双语单词学习中的作用。
Dev Sci. 2024 Jul;27(4):e13476. doi: 10.1111/desc.13476. Epub 2024 Jan 16.
8
Can Lextale-Esp discriminate between groups of highly proficient Catalan-Spanish bilinguals with different language dominances?Lextale-Esp能否区分具有不同语言优势的高度熟练的加泰罗尼亚语-西班牙语双语者群体?
Behav Res Methods. 2017 Apr;49(2):717-723. doi: 10.3758/s13428-016-0728-y.
9
The word frequency effect in first- and second-language word recognition: a lexical entrenchment account.第一语言和第二语言词汇识别中的词频效应:一种词汇固化解释。
Q J Exp Psychol (Hove). 2013;66(5):843-63. doi: 10.1080/17470218.2012.720994. Epub 2012 Oct 2.
10
Crosslinguistic Influence (CLI) of Lexical Breadth and Depth in the Vocabulary of Bilingual Kindergarten Children - A Bilingual Intervention Study.双语幼儿园儿童词汇广度和深度的跨语言影响——一项双语干预研究
Front Psychol. 2021 Sep 30;12:671928. doi: 10.3389/fpsyg.2021.671928. eCollection 2021.

引用本文的文献

1
The polish vocabulary size test: A novel adaptive test for receptive vocabulary assessment.波兰语词汇量测试:一种用于接受性词汇评估的新型自适应测试。
Behav Res Methods. 2025 Aug 11;57(9):254. doi: 10.3758/s13428-025-02775-3.
2
Lexical decision times for nouns from the Croatian Psycholinguistic Database.来自克罗地亚心理语言学数据库的名词的词汇判断时间。
Behav Res Methods. 2025 Apr 25;57(6):156. doi: 10.3758/s13428-025-02676-5.
3
Jiwar: A database and calculator for word neighborhood measures in 40 languages.吉瓦尔:一个包含40种语言的词邻域度量的数据库和计算器。

本文引用的文献

1
How do Spanish speakers read words? Insights from a crowdsourced lexical decision megastudy.西班牙语使用者如何阅读单词?一项众包词汇判断巨量研究的启示。
Behav Res Methods. 2020 Oct;52(5):1867-1882. doi: 10.3758/s13428-020-01357-9.
2
SUBTLEX-CAT: Subtitle word frequencies and contextual diversity for Catalan.SUBTLEX-CAT:加泰罗尼亚语字幕词频和上下文多样性。
Behav Res Methods. 2020 Feb;52(1):360-375. doi: 10.3758/s13428-019-01233-1.
3
SPALEX: A Spanish Lexical Decision Database From a Massive Online Data Collection.SPALEX:一个源自大规模在线数据收集的西班牙语词汇判断数据库。
Behav Res Methods. 2025 Feb 19;57(3):98. doi: 10.3758/s13428-025-02612-7.
Front Psychol. 2018 Nov 12;9:2156. doi: 10.3389/fpsyg.2018.02156. eCollection 2018.
4
Word prevalence norms for 62,000 English lemmas.62000 个英语词汇的词频规范。
Behav Res Methods. 2019 Apr;51(2):467-479. doi: 10.3758/s13428-018-1077-9.
5
How Many Words Do We Know? Practical Estimates of Vocabulary Size Dependent on Word Definition, the Degree of Language Input and the Participant's Age.我们认识多少单词?基于单词定义、语言输入程度和参与者年龄的词汇量实际估算
Front Psychol. 2016 Jul 29;7:1116. doi: 10.3389/fpsyg.2016.01116. eCollection 2016.
6
The impact of word prevalence on lexical decision times: Evidence from the Dutch Lexicon Project 2.词频对词汇判断时间的影响:来自荷兰词汇项目2的证据
J Exp Psychol Hum Percept Perform. 2016 Mar;42(3):441-58. doi: 10.1037/xhp0000159. Epub 2015 Oct 26.
7
Foreign language comprehension achievement: insights from the cognate facilitation effect.外语理解成就:同源促进效应的见解
Front Psychol. 2015 May 6;6:588. doi: 10.3389/fpsyg.2015.00588. eCollection 2015.
8
Megastudies, crowdsourcing, and large datasets in psycholinguistics: An overview of recent developments.心理语言学中的大型研究、众包和大型数据集:近期发展概述。
Q J Exp Psychol (Hove). 2015;68(8):1457-68. doi: 10.1080/17470218.2015.1051065.
9
Word knowledge in the crowd: Measuring vocabulary size and word prevalence in a massive online experiment.群体中的词汇知识:在一项大规模在线实验中测量词汇量和词汇流行度
Q J Exp Psychol (Hove). 2015;68(8):1665-92. doi: 10.1080/17470218.2015.1022560. Epub 2015 Apr 8.
10
jsPsych: a JavaScript library for creating behavioral experiments in a Web browser.jsPsych:一个在网页浏览器中创建行为实验的 JavaScript 库。
Behav Res Methods. 2015 Mar;47(1):1-12. doi: 10.3758/s13428-014-0458-y.