Suppr超能文献

HelexKids:一个针对希腊和塞浦路斯小学生的词频数据库。

HelexKids: A word frequency database for Greek and Cypriot primary school children.

作者信息

Terzopoulos Aris R, Duncan Lynne G, Wilson Mark A J, Niolaki Georgia Z, Masterson Jackie

机构信息

Psychology, School of Social Sciences, University of Dundee, Nethergate, DD1 4HN, Dundee, UK.

, Dundee, UK.

出版信息

Behav Res Methods. 2017 Feb;49(1):83-96. doi: 10.3758/s13428-015-0698-5.

Abstract

In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis, and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at www.helexkids.org .

摘要

在本文中,我们介绍了HelexKids,这是一个面向小学教育阶段(1至6年级)说希腊语儿童的在线书面文字数据库。该数据库按年级组织,并通过将一年级与二至六年级相结合进行累积组织。它提供了齐普夫值、每百万词频、离散度、估计每百万词频、标准词频、上下文多样性、正字法莱文斯坦距离和词元频率等数值。这些数值源自希腊和塞浦路斯小学教育中使用的116本教科书,共产生了68,692种不同的词类。HelexKids的开发旨在帮助研究人员研究语言发展,帮助教育工作者选择适合年龄的教学内容,以及帮助希腊/塞浦路斯儿童教育书籍的作者和作家。该数据库开放获取,可在www.helexkids.org在线搜索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62c3/5352803/3ecddc9795cf/13428_2015_698_Fig1_HTML.jpg

相似文献

1
HelexKids: A word frequency database for Greek and Cypriot primary school children.
Behav Res Methods. 2017 Feb;49(1):83-96. doi: 10.3758/s13428-015-0698-5.
2
CCLOWW: A grade-level Chinese children's lexicon of written words.
Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1.
3
MANULEX: a grade-level lexical database from French elementary school readers.
Behav Res Methods Instrum Comput. 2004 Feb;36(1):156-66. doi: 10.3758/bf03195560.
4
ESCOLEX: a grade-level lexical database from European Portuguese elementary to middle school textbooks.
Behav Res Methods. 2014 Mar;46(1):240-53. doi: 10.3758/s13428-013-0350-1.
5
GreekLex: a lexical database of Modern Greek.
Behav Res Methods. 2008 Aug;40(3):773-83. doi: 10.3758/brm.40.3.773.
6
EHME: a new word database for research in Basque language.
Span J Psychol. 2014 Nov 14;17:E79. doi: 10.1017/sjp.2014.79.
7
StimulStat: A lexical database for Russian.
Behav Res Methods. 2018 Dec;50(6):2305-2315. doi: 10.3758/s13428-017-0994-3.
8
Aralex: a lexical database for Modern Standard Arabic.
Behav Res Methods. 2010 May;42(2):481-7. doi: 10.3758/BRM.42.2.481.
9
Lexical Diversity in Cypriot-Greek-Speaking Toddlers: A Preliminary Longitudinal Study.
Folia Phoniatr Logop. 2021;73(4):277-288. doi: 10.1159/000507621. Epub 2020 Jun 18.
10
CCLOOW: Chinese children's lexicon of oral words.
Behav Res Methods. 2024 Feb;56(2):846-859. doi: 10.3758/s13428-023-02077-6. Epub 2023 Mar 7.

引用本文的文献

1
VOC-ADO: A lexical database for French-speaking adolescents.
Behav Res Methods. 2025 Apr 2;57(5):137. doi: 10.3758/s13428-025-02656-9.
3
NSP-SCD: A corpus construction protocol for child-directed print in understudied languages.
Behav Res Methods. 2024 Apr;56(4):2751-2764. doi: 10.3758/s13428-024-02339-x. Epub 2024 Feb 15.
5
The Children's Picture Books Lexicon (CPB-LEX): A large-scale lexical database from children's picture books.
Behav Res Methods. 2024 Aug;56(5):4504-4521. doi: 10.3758/s13428-023-02198-y. Epub 2023 Aug 11.
6
CCLOOW: Chinese children's lexicon of oral words.
Behav Res Methods. 2024 Feb;56(2):846-859. doi: 10.3758/s13428-023-02077-6. Epub 2023 Mar 7.
7
Multi-LEX: A database of multi-word frequencies for French and English.
Behav Res Methods. 2023 Dec;55(8):4315-4328. doi: 10.3758/s13428-022-02018-9. Epub 2022 Nov 28.
8
CCLOWW: A grade-level Chinese children's lexicon of written words.
Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1.

本文引用的文献

1
GreekLex 2: A comprehensive lexical database with part-of-speech, syllabic, phonological, and stress information.
PLoS One. 2017 Feb 23;12(2):e0172493. doi: 10.1371/journal.pone.0172493. eCollection 2017.
2
childLex: a lexical database of German read by children.
Behav Res Methods. 2015 Dec;47(4):1085-1094. doi: 10.3758/s13428-014-0528-1.
3
On the advantages of word frequency and contextual diversity measures extracted from subtitles: The case of Portuguese.
Q J Exp Psychol (Hove). 2015;68(4):680-96. doi: 10.1080/17470218.2014.964271. Epub 2014 Nov 7.
5
SUBTLEX-UK: a new and improved word frequency database for British English.
Q J Exp Psychol (Hove). 2014;67(6):1176-90. doi: 10.1080/17470218.2013.850521. Epub 2014 Jan 13.
6
FMRI of phonemic perception and its relationship to reading development in elementary- to middle-school-age children.
Neuroimage. 2014 Apr 1;89:192-202. doi: 10.1016/j.neuroimage.2013.11.055. Epub 2013 Dec 6.
8
Intervention for a multi-character processing deficit in a Greek-speaking child with surface dyslexia.
Cogn Neuropsychol. 2013;30(4):208-32. doi: 10.1080/02643294.2013.842892. Epub 2013 Oct 9.
9
The influence of contextual diversity on eye movements in reading.
J Exp Psychol Learn Mem Cogn. 2014 Jan;40(1):275-83. doi: 10.1037/a0034058. Epub 2013 Aug 12.
10
ESCOLEX: a grade-level lexical database from European Portuguese elementary to middle school textbooks.
Behav Res Methods. 2014 Mar;46(1):240-53. doi: 10.3758/s13428-013-0350-1.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验