MPRG Reading Education and Development (REaD), Max Planck Institute for Human Development, Lentzeallee 94, 14195, Berlin, Germany.
Digital Dictionary of the German Language Project, Berlin-Brandenburg Academy of Sciences, Berlin, Germany.
Behav Res Methods. 2015 Dec;47(4):1085-1094. doi: 10.3758/s13428-014-0528-1.
This article introduces childLex, an online database of German read by children. childLex is based on a corpus of children's books and comprises 10 million words that were syntactically annotated and lemmatized. childLex reports linguistic norms for lexical, superlexical, and sublexical variables in three different age groups: 6-8 (grades 1-2), 9-10 (grades 3-4), and 11-12 years (grades 5-6). Here, we describe how childLex was collected and analyzed. In addition, we provide information about the distributions of word frequency, word length, and orthographic neighborhood size, as well as their intercorrelations. Finally, we explain how childLex can be accessed using a Web interface.
本文介绍了 childLex,这是一个儿童阅读的德语在线数据库。childLex 基于儿童书籍语料库,包含 1000 万词汇,这些词汇经过句法标注和词干化处理。childLex 报告了三个不同年龄组(6-8 岁[1-2 年级]、9-10 岁[3-4 年级]和 11-12 岁[5-6 年级])的词汇、超词汇和次词汇变量的语言规范。在这里,我们描述了如何收集和分析 childLex。此外,我们还提供了有关单词频率、单词长度和正字法邻域大小分布及其相互关系的信息。最后,我们解释了如何使用 Web 界面访问 childLex。