Suppr超能文献

VOC-ADO:一个面向说法语青少年的词汇数据库。

VOC-ADO: A lexical database for French-speaking adolescents.

作者信息

Gimenes Manuel, Lambert Eric, Chaussoy Louise, Wilson Maximiliano A, Quémart Pauline

机构信息

Centre de Recherches sur la Cognition et l'Apprentissage (CeRCA) - UMR CNRS 7295, University of Poitiers, MSHS - Cerca - Bâtiment A5, 5, rue T. Lefebvre, TSA 21103, 86073, Poitiers Cedex 9, France.

Centre interdisciplinaire de recherche en réadaptation et intégration sociale - CIRRIS et École des sciences de la réadaptation, Faculté de médecine, Université Laval, Québec , Québec, Canada.

出版信息

Behav Res Methods. 2025 Apr 2;57(5):137. doi: 10.3758/s13428-025-02656-9.

Abstract

We present VOC-ADO, a database of the written vocabulary of French adolescents between the ages of 11 and 15 (French secondary school students). VOC-ADO provides a wealth of lexical information for 110,338 words listed in school textbooks of all disciplines (i.e., academic vocabulary), as well as novels, comics, and magazines (i.e., non-academic vocabulary). For each word, several indexes of frequency and lexical dispersion are reported, as well as word length, syntactic categories, orthographic neighborhood size, and lemma frequency. Each analysis is presented separately for the Academic and Non-academic subcorpora, as well as for the overall Global corpus. Analyses of the corpora indicate that the Academic subcorpus contains a smaller variety of unique words than the Non-academic subcorpus and exhibits higher lexical sophistication. By contrast, there is a larger proportion of content words in non-academic media than in school textbooks. Finally, VOC-ADO shows a strong frequency correlation with Manulex, a French database of elementary school vocabulary, and Lexique, a lexical database of adult vocabulary. However, many words present in VOC-ADO are not found in elementary school vocabulary. These results underscore the need to examine lexical development beyond elementary school, considering the unique characteristics of the written vocabulary encountered by French-speaking adolescents. In this regard, VOC-ADO provides researchers, educators, and clinicians interested in adolescent literacy with a valuable tool to select and analyze words based on specific characteristics. The database is freely available and can be downloaded by clicking on the following link: VOC-ADO Database link.

摘要

我们展示了VOC-ADO,这是一个关于11至15岁法国青少年(法国中学生)书面词汇的数据库。VOC-ADO为所有学科的学校教科书(即学术词汇)以及小说、漫画和杂志(即非学术词汇)中列出的110338个单词提供了丰富的词汇信息。对于每个单词,报告了几个频率和词汇离散度指标,以及单词长度、句法类别、正字法邻域大小和词元频率。每个分析分别针对学术和非学术子语料库以及整体全球语料库进行呈现。语料库分析表明,学术子语料库包含的独特单词种类比非学术子语料库少,并且词汇复杂度更高。相比之下,非学术媒体中的实词比例高于学校教科书。最后,VOC-ADO与法国小学词汇数据库Manulex和成人词汇数据库Lexique显示出很强的频率相关性。然而,VOC-ADO中存在的许多单词在小学词汇中并未出现。这些结果强调了考虑法语青少年所遇到的书面词汇的独特特征来研究小学以上词汇发展的必要性。在这方面,VOC-ADO为对青少年读写能力感兴趣的研究人员、教育工作者和临床医生提供了一个有价值的工具,可根据特定特征选择和分析单词。该数据库可免费获取,点击以下链接即可下载:VOC-ADO数据库链接。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验