Suppr超能文献

HelexKids:一个针对希腊和塞浦路斯小学生的词频数据库。

HelexKids: A word frequency database for Greek and Cypriot primary school children.

作者信息

Terzopoulos Aris R, Duncan Lynne G, Wilson Mark A J, Niolaki Georgia Z, Masterson Jackie

机构信息

Psychology, School of Social Sciences, University of Dundee, Nethergate, DD1 4HN, Dundee, UK.

, Dundee, UK.

出版信息

Behav Res Methods. 2017 Feb;49(1):83-96. doi: 10.3758/s13428-015-0698-5.

Abstract

In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis, and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at www.helexkids.org .

摘要

在本文中,我们介绍了HelexKids,这是一个面向小学教育阶段(1至6年级)说希腊语儿童的在线书面文字数据库。该数据库按年级组织,并通过将一年级与二至六年级相结合进行累积组织。它提供了齐普夫值、每百万词频、离散度、估计每百万词频、标准词频、上下文多样性、正字法莱文斯坦距离和词元频率等数值。这些数值源自希腊和塞浦路斯小学教育中使用的116本教科书,共产生了68,692种不同的词类。HelexKids的开发旨在帮助研究人员研究语言发展,帮助教育工作者选择适合年龄的教学内容,以及帮助希腊/塞浦路斯儿童教育书籍的作者和作家。该数据库开放获取,可在www.helexkids.org在线搜索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/62c3/5352803/3ecddc9795cf/13428_2015_698_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验