Northwestern University, Evanston, Illinois, USA.
PLoS One. 2012;7(8):e43230. doi: 10.1371/journal.pone.0043230. Epub 2012 Aug 20.
Past research has demonstrated cross-linguistic, cross-modal, and task-dependent differences in neighborhood density effects, indicating a need to control for neighborhood variables when developing and interpreting research on language processing. The goals of the present paper are two-fold: (1) to introduce CLEARPOND (Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities), a centralized database of phonological and orthographic neighborhood information, both within and between languages, for five commonly-studied languages: Dutch, English, French, German, and Spanish; and (2) to show how CLEARPOND can be used to compare general properties of phonological and orthographic neighborhoods across languages. CLEARPOND allows researchers to input a word or list of words and obtain phonological and orthographic neighbors, neighborhood densities, mean neighborhood frequencies, word lengths by number of phonemes and graphemes, and spoken-word frequencies. Neighbors can be defined by substitution, deletion, and/or addition, and the database can be queried separately along each metric or summed across all three. Neighborhood values can be obtained both within and across languages, and outputs can optionally be restricted to neighbors of higher frequency. To enable researchers to more quickly and easily develop stimuli, CLEARPOND can also be searched by features, generating lists of words that meet precise criteria, such as a specific range of neighborhood sizes, lexical frequencies, and/or word lengths. CLEARPOND is freely-available to researchers and the public as a searchable, online database and for download at http://clearpond.northwestern.edu.
过去的研究表明,在语言处理的研究开发和解释中,需要控制词的邻域变量,因为词的邻域效应存在跨语言、跨模态和任务依赖的差异。本文有两个目标:(1)介绍 CLEARPOND(跨语言语音和正字法邻域密度的便捷获取资源库),这是一个集中的语音和正字法邻域信息数据库,包含五种常用语言:荷兰语、英语、法语、德语和西班牙语的本族语和跨语言信息;(2)展示如何使用 CLEARPOND 比较语言之间语音和正字法邻域的一般属性。CLEARPOND 允许研究人员输入一个单词或单词列表,并获得语音和正字法邻居、邻域密度、平均邻域频率、按音素和字母数量划分的单词长度以及口语单词频率。邻居可以通过替换、删除和/或添加来定义,并且可以分别按每个度量标准或按三个标准的总和查询数据库。可以在本族语和跨语言之间获取邻域值,并且输出结果可以选择限制为高频邻居。为了使研究人员能够更快、更容易地开发刺激,CLEARPOND 还可以按特征搜索,根据特定的邻域大小、词汇频率和/或单词长度范围等精确标准生成单词列表。CLEARPOND 是一个可在线搜索的免费数据库,可供研究人员和公众使用,并可在 http://clearpond.northwestern.edu 下载。