• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

意大利众包项目:130495个意大利语单词的视觉单词识别时间

The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words.

作者信息

Amenta Simona, de Varda Andrea Gregor, Mandera Pawel, Keuleers Emmanuel, Brysbaert Marc, Marelli Marco

机构信息

Department of Psychology, University of Milano-Bicocca, P.zza dell'Ateneo Nuovo, 1, 20126, Milano, Italy.

Lingvist Technologies, Tallinn, Estonia.

出版信息

Behav Res Methods. 2024 Dec 28;57(1):26. doi: 10.3758/s13428-024-02548-4.

DOI:10.3758/s13428-024-02548-4
PMID:39733067
Abstract

Despite being largely spoken and studied by language and cognitive scientists, Italian lacks large resources of language processing data. The Italian Crowdsourcing Project (ICP) is a dataset of word recognition times and accuracy including responses to 130,465 words, which makes it the largest dataset of its kind item-wise. The data were collected in an online word knowledge task in which over 156,000 native speakers of Italian took part. We validated the ICP dataset by (1) showing that ICP reaction times correlate strongly (r = .78) with lexical decision latencies collected in a traditional lab experiment, (2) showing that the effect of major psycholinguistic variables (e.g., frequency, length, etc.) can be replicated in this dataset, and (3) replicating the effect of word prevalence, which we compute here for the first time for Italian. Given the inclusion of many inflectional forms of verbs, adjectives, and nouns, we further showcase the potential of this dataset by exploring two phenomena (inflectional entropy in verb paradigms and the clitic effect in isolated word recognition) that build on the peculiar properties of Italian. In this paper we present the ICP resource and release response times, accuracy, and prevalence estimates for all the words included.

摘要

尽管意大利语在很大程度上被语言和认知科学家所使用和研究,但它缺乏大量的语言处理数据资源。意大利众包项目(ICP)是一个单词识别时间和准确率的数据集,包含对130465个单词的反应,这使其成为同类项目中按项目计算最大的数据集。这些数据是在一项在线单词知识任务中收集的,超过156000名意大利语母语者参与了该任务。我们通过以下方式验证了ICP数据集:(1)表明ICP反应时间与传统实验室实验中收集的词汇判断潜伏期高度相关(r = 0.78);(2)表明主要心理语言学变量(如频率、长度等)的影响可以在该数据集中复制;(3)复制单词流行度的影响,这是我们首次为意大利语计算的。鉴于该数据集包含了动词、形容词和名词的许多屈折形式,我们通过探索基于意大利语特殊属性的两种现象(动词范式中的屈折熵和孤立单词识别中的小品词效应)进一步展示了该数据集的潜力。在本文中,我们介绍了ICP资源,并公布了所有包含单词的反应时间、准确率和流行度估计值。

相似文献

1
The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words.意大利众包项目:130495个意大利语单词的视觉单词识别时间
Behav Res Methods. 2024 Dec 28;57(1):26. doi: 10.3758/s13428-024-02548-4.
2
Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project.识别 62000 个英语单词所需的时间:来自英语众包项目的数据。
Behav Res Methods. 2020 Apr;52(2):741-760. doi: 10.3758/s13428-019-01272-8.
3
Lexical decision times for nouns from the Croatian Psycholinguistic Database.来自克罗地亚心理语言学数据库的名词的词汇判断时间。
Behav Res Methods. 2025 Apr 25;57(6):156. doi: 10.3758/s13428-025-02676-5.
4
Word knowledge in the crowd: Measuring vocabulary size and word prevalence in a massive online experiment.群体中的词汇知识:在一项大规模在线实验中测量词汇量和词汇流行度
Q J Exp Psychol (Hove). 2015;68(8):1665-92. doi: 10.1080/17470218.2015.1022560. Epub 2015 Apr 8.
5
How do Spanish speakers read words? Insights from a crowdsourced lexical decision megastudy.西班牙语使用者如何阅读单词?一项众包词汇判断巨量研究的启示。
Behav Res Methods. 2020 Oct;52(5):1867-1882. doi: 10.3758/s13428-020-01357-9.
6
There is Something About Grammatical Category in Chinese Visual Word Recognition.汉语视觉词汇识别中的语法类别问题
J Psycholinguist Res. 2016 Oct;45(5):1067-87. doi: 10.1007/s10936-015-9392-0.
7
MELD-SCH: A megastudy of lexical decision in simplified Chinese.MELD-SCH:简体中文词汇判断的一项巨量研究。
Behav Res Methods. 2018 Oct;50(5):1763-1777. doi: 10.3758/s13428-017-0944-0.
8
Prevalence norms for 40,777 Catalan words: An online megastudy of vocabulary size.40777 个加泰罗尼亚语单词的使用频率规范:词汇量的在线巨量研究。
Behav Res Methods. 2023 Sep;55(6):3198-3217. doi: 10.3758/s13428-022-01959-5. Epub 2022 Sep 9.
9
Individual differences in visual word recognition: insights from the English Lexicon Project.个体在视觉词汇识别上的差异:来自英语词汇项目的启示。
J Exp Psychol Hum Percept Perform. 2012 Feb;38(1):53-79. doi: 10.1037/a0024177. Epub 2011 Jul 4.
10
How strongly do word reading times and lexical decision times correlate? Combining data from eye movement corpora and megastudies.单词阅读时间和词汇判断时间的相关性有多强?结合来自眼动语料库和大型研究的数据。
Q J Exp Psychol (Hove). 2013;66(3):563-80. doi: 10.1080/17470218.2012.658820. Epub 2012 Apr 24.

引用本文的文献

1
Lexical decision times for nouns from the Croatian Psycholinguistic Database.来自克罗地亚心理语言学数据库的名词的词汇判断时间。
Behav Res Methods. 2025 Apr 25;57(6):156. doi: 10.3758/s13428-025-02676-5.

本文引用的文献

1
Form to meaning mapping and the impact of explicit morpheme combination in novel word processing.形态到意义的映射以及在新词处理中明确词素组合的影响。
Cogn Psychol. 2023 Sep;145:101594. doi: 10.1016/j.cogpsych.2023.101594. Epub 2023 Aug 18.
2
A cross-linguistic study of spatial parameters of eye-movement control during reading.阅读过程中眼球运动控制的空间参数的跨语言研究。
J Exp Psychol Hum Percept Perform. 2022 Nov;48(11):1213-1228. doi: 10.1037/xhp0001038. Epub 2022 Sep 1.
3
Affect across adulthood: Evidence from English, Dutch, and Spanish.
成年期的情感变化:来自英语、荷兰语和西班牙语的证据。
J Exp Psychol Gen. 2021 Apr;150(4):792-812. doi: 10.1037/xge0000950.
4
How do Spanish speakers read words? Insights from a crowdsourced lexical decision megastudy.西班牙语使用者如何阅读单词?一项众包词汇判断巨量研究的启示。
Behav Res Methods. 2020 Oct;52(5):1867-1882. doi: 10.3758/s13428-020-01357-9.
5
Perceptual modality norms for 1,121 Italian words: A comparison with concreteness and imageability scores and an analysis of their impact in word processing tasks.1,121 个意大利语单词的知觉方式规范:与具体性和形象性得分的比较,以及对其在文字处理任务中的影响的分析。
Behav Res Methods. 2020 Aug;52(4):1599-1616. doi: 10.3758/s13428-019-01337-8.
6
The Lancaster Sensorimotor Norms: multidimensional measures of perceptual and action strength for 40,000 English words.兰开斯特感觉运动规范:40000 个英语单词的感知和动作强度的多维测量
Behav Res Methods. 2020 Jun;52(3):1271-1291. doi: 10.3758/s13428-019-01316-z.
7
Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project.识别 62000 个英语单词所需的时间:来自英语众包项目的数据。
Behav Res Methods. 2020 Apr;52(2):741-760. doi: 10.3758/s13428-019-01272-8.
8
Recognition Times for 54 Thousand Dutch Words: Data from the Dutch Crowdsourcing Project.54000个荷兰语单词的识别时间:来自荷兰众包项目的数据。
Psychol Belg. 2019 Jul 17;59(1):281-300. doi: 10.5334/pb.491.
9
Italian Age of Acquisition Norms for a Large Set of Words (ItAoA).大量词汇的意大利语习得年龄规范(ItAoA)。
Front Psychol. 2019 Feb 13;10:278. doi: 10.3389/fpsyg.2019.00278. eCollection 2019.
10
A thousand studies for the price of one: Accelerating psychological science with Pushkin.千篇一律不如推陈出新:以普希金加速心理学科学研究
Behav Res Methods. 2019 Aug;51(4):1782-1803. doi: 10.3758/s13428-018-1155-z.