• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ESCOLEX:一个源自欧洲葡萄牙小学到中学课本的词汇级数据库。

ESCOLEX: a grade-level lexical database from European Portuguese elementary to middle school textbooks.

机构信息

School of Psychology, University of Minho, Minho, Portugal,

出版信息

Behav Res Methods. 2014 Mar;46(1):240-53. doi: 10.3758/s13428-013-0350-1.

DOI:10.3758/s13428-013-0350-1
PMID:23709164
Abstract

In this article, we introduce ESCOLEX, the first European Portuguese children's lexical database with grade-level-adjusted word frequency statistics. Computed from a 3.2-million-word corpus, ESCOLEX provides 48,381 word forms extracted from 171 elementary and middle school textbooks for 6- to 11-year-old children attending the first six grades in the Portuguese educational system. Like other children's grade-level databases (e.g., Carroll, Davies, & Richman, 1971; Corral, Ferrero, & Goikoetxea, Behavior Research Methods, 41, 1009-1017, 2009; Lété, Sprenger-Charolles, & Colé, Behavior Research Methods, Instruments, & Computers, 36, 156-166, 2004; Zeno, Ivens, Millard, Duvvuri, 1995), ESCOLEX provides four frequency indices for each grade: overall word frequency (F), index of dispersion across the selected textbooks (D), estimated frequency per million words (U), and standard frequency index (SFI). It also provides a new measure, contextual diversity (CD). In addition, the number of letters in the word and its part(s) of speech, number of syllables, syllable structure, and adult frequencies taken from P-PAL (a European Portuguese corpus-based lexical database; Soares, Comesaña, Iriarte, Almeida, Simões, Costa, …, Machado, 2010; Soares, Iriarte, Almeida, Simões, Costa, França, …, Comesaña, in press) are provided. ESCOLEX will be a useful tool both for researchers interested in language processing and development and for professionals in need of verbal materials adjusted to children's developmental stages. ESCOLEX can be downloaded along with this article or from http://p-pal.di.uminho.pt/about/databases .

摘要

本文介绍了 ESCOLEX,这是第一个具有年级调整词频统计功能的欧洲葡萄牙儿童词汇数据库。该数据库基于 320 万词的语料库计算得出,包含从葡萄牙教育系统的 171 本小学和中学教科书中提取的 48,381 个单词形式,适用于 6 至 11 岁的一至六年级儿童。像其他儿童年级数据库(例如,Carroll、Davies 和 Richman,1971;Corral、Ferrero 和 Goikoetxea,《行为研究方法》,41,1009-1017,2009;Lété、Sprenger-Charolles 和 Colé,《行为研究方法、仪器和计算机》,36,156-166,2004;Zeno、Ivens、Millard 和 Duvvuri,1995)一样,ESCOLEX 为每个年级提供四个频率指标:总词频(F)、所选教材分布指数(D)、每百万词估计频率(U)和标准频率指数(SFI)。它还提供了一个新的指标,语境多样性(CD)。此外,还提供了单词的字母数及其词性、音节数、音节结构以及从 P-PAL(一个基于欧洲葡萄牙语语料库的词汇数据库;Soares、Comesaña、Iriarte、Almeida、Simões、Costa、…、Machado,2010;Soares、Iriarte、Almeida、Simões、Costa、Franca、…、Comesaña,in press)中获取的成人频率。ESCOLEX 将成为对语言处理和发展感兴趣的研究人员以及需要适应儿童发展阶段的语言材料的专业人员的有用工具。ESCOLEX 可以与本文一起下载,也可以从 http://p-pal.di.uminho.pt/about/databases 下载。

相似文献

1
ESCOLEX: a grade-level lexical database from European Portuguese elementary to middle school textbooks.ESCOLEX:一个源自欧洲葡萄牙小学到中学课本的词汇级数据库。
Behav Res Methods. 2014 Mar;46(1):240-53. doi: 10.3758/s13428-013-0350-1.
2
MANULEX: a grade-level lexical database from French elementary school readers.MANULEX:一个来自法国小学读物的年级词汇数据库。
Behav Res Methods Instrum Comput. 2004 Feb;36(1):156-66. doi: 10.3758/bf03195560.
3
Procura-PALavras (P-PAL): A Web-based interface for a new European Portuguese lexical database.Procura-PALavras (P-PAL):一个新的欧洲葡萄牙语词汇数据库的网络界面。
Behav Res Methods. 2018 Aug;50(4):1461-1481. doi: 10.3758/s13428-018-1058-z.
4
On the advantages of word frequency and contextual diversity measures extracted from subtitles: The case of Portuguese.论从字幕中提取的词频和语境多样性度量的优势:以葡萄牙语为例。
Q J Exp Psychol (Hove). 2015;68(4):680-96. doi: 10.1080/17470218.2014.964271. Epub 2014 Nov 7.
5
Free associate norms for 139 European Portuguese words for children from different age groups.为来自不同年龄组的儿童的 139 个欧洲葡萄牙语词汇生成自由联想常模。
Behav Res Methods. 2014 Jun;46(2):564-74. doi: 10.3758/s13428-013-0388-0.
6
A large-scale database of Chinese characters and words collected from elementary school textbooks.从小学语文课本中收集的大规模汉字和词语数据库。
Behav Res Methods. 2024 Aug;56(5):4732-4757. doi: 10.3758/s13428-023-02214-1. Epub 2023 Aug 24.
7
The Minho Word Pool: Norms for imageability, concreteness, and subjective frequency for 3,800 Portuguese words.米尼奥词库:3800个葡萄牙语单词的可想象性、具体性和主观频率规范
Behav Res Methods. 2017 Jun;49(3):1065-1081. doi: 10.3758/s13428-016-0767-4.
8
Children's printed word database: continuities and changes over time in children's early reading vocabulary.儿童印刷词汇数据库:儿童早期阅读词汇随时间的连续性和变化。
Br J Psychol. 2010 May;101(Pt 2):221-42. doi: 10.1348/000712608X371744. Epub 2009 Dec 14.
9
CCLOWW: A grade-level Chinese children's lexicon of written words.CCLOWW:一个中文儿童书面词汇的年级水平词库。
Behav Res Methods. 2023 Jun;55(4):1874-1889. doi: 10.3758/s13428-022-01890-9. Epub 2022 Jul 1.
10
LEXIN: a lexical database from Spanish kindergarten and first-grade readers.LEXIN:一个来自西班牙幼儿园和一年级阅读者的词汇数据库。
Behav Res Methods. 2009 Nov;41(4):1009-17. doi: 10.3758/BRM.41.4.1009.

引用本文的文献

1
-A Dictation Assessment Instrument with Automatic Analysis.- 一种具有自动分析功能的听写评估工具。
Children (Basel). 2025 Jun 14;12(6):774. doi: 10.3390/children12060774.
2
Integrating Cognitive Factors and Eye Movement Data in Reading Predictive Models for Children with Dyslexia and ADHD-I.将认知因素与眼动数据整合到阅读障碍和注意缺陷多动障碍-冲动型儿童的阅读预测模型中。
J Eye Mov Res. 2024 Mar 21;16(4). doi: 10.16910/jemr.16.4.6. eCollection 2023.
3
NSP-SCD: A corpus construction protocol for child-directed print in understudied languages.
NSP-SCD:面向欠研究语言的面向儿童的印刷品语料库构建协议。
Behav Res Methods. 2024 Apr;56(4):2751-2764. doi: 10.3758/s13428-024-02339-x. Epub 2024 Feb 15.
4
The Children and Young People's Books Lexicon (CYP-LEX): A large-scale lexical database of books read by children and young people in the United Kingdom.《儿童与青少年书籍词汇表》(CYP-LEX):一个大规模的词汇数据库,收录了英国儿童和青少年阅读的书籍。
Q J Exp Psychol (Hove). 2024 Dec;77(12):2418-2438. doi: 10.1177/17470218241229694. Epub 2024 Mar 12.
5
A large-scale database of Chinese characters and words collected from elementary school textbooks.从小学语文课本中收集的大规模汉字和词语数据库。
Behav Res Methods. 2024 Aug;56(5):4732-4757. doi: 10.3758/s13428-023-02214-1. Epub 2023 Aug 24.
6
The Children's Picture Books Lexicon (CPB-LEX): A large-scale lexical database from children's picture books.《儿童图画书词汇表》(CPB-LEX):一个来自儿童图画书的大规模词汇数据库。
Behav Res Methods. 2024 Aug;56(5):4504-4521. doi: 10.3758/s13428-023-02198-y. Epub 2023 Aug 11.
7
Effects of word length and word frequency among dyslexic, ADHD-I and typical readers.阅读障碍者、注意力缺陷多动障碍(ADHD-I)患者及正常读者中单词长度和单词频率的影响。
J Eye Mov Res. 2022 Jun 14;15(1). doi: 10.16910/jemr.15.1.1. eCollection 2022.
8
HelexKids: A word frequency database for Greek and Cypriot primary school children.HelexKids:一个针对希腊和塞浦路斯小学生的词频数据库。
Behav Res Methods. 2017 Feb;49(1):83-96. doi: 10.3758/s13428-015-0698-5.
9
Tracking the emergence of the consonant bias in visual-word recognition: evidence with developing readers.追踪视觉单词识别中辅音偏向的出现:来自发展中读者的证据。
PLoS One. 2014 Feb 11;9(2):e88580. doi: 10.1371/journal.pone.0088580. eCollection 2014.