Département de Réadaptation, Université Laval, 1050 avenue de la Médecine, Québec, Québec, G1V 0A6, Canada.
Centre de Recherche de l'Institut Universitaire en Santé Mentale de Québec (CRIUSMQ), Québec, Québec, Canada.
Behav Res Methods. 2017 Oct;49(5):1852-1863. doi: 10.3758/s13428-016-0829-7.
Sublexical phonotactic regularities in language have a major impact on language development, as well as on speech processing and production throughout the entire lifespan. To understand the impact of phonotactic regularities on speech and language functions at the behavioral and neural levels, it is essential to have access to oral language corpora to study these complex phenomena in different languages. Yet, probably because of their complexity, oral language corpora remain less common than written language corpora. This article presents the first corpus and database of spoken Quebec French syllables and phones: SyllabO+. This corpus contains phonetic transcriptions of over 300,000 syllables (over 690,000 phones) extracted from recordings of 184 healthy adult native Quebec French speakers, ranging in age from 20 to 97 years. To ensure the representativeness of the corpus, these recordings were made in both formal and familiar communication contexts. Phonotactic distributional statistics (e.g., syllable and co-occurrence frequencies, percentages, percentile ranks, transition probabilities, and pointwise mutual information) were computed from the corpus. An open-access online application to search the database was developed, and is available at www.speechneurolab.ca/syllabo . In this article, we present a brief overview of the corpus, as well as the syllable and phone databases, and we discuss their practical applications in various fields of research, including cognitive neuroscience, psycholinguistics, neurolinguistics, experimental psychology, phonetics, and phonology. Nonacademic practical applications are also discussed, including uses in speech-language pathology.
亚词汇音位规则对语言发展以及整个生命周期中的言语处理和产生都有重大影响。为了在行为和神经水平上理解音位规则对言语和语言功能的影响,必须能够访问口语语言语料库来研究不同语言中的这些复杂现象。然而,可能由于其复杂性,口语语言语料库仍然比书面语言语料库少见。本文介绍了第一个魁北克法语口语音节和音位的语料库和数据库:SyllabO+。该语料库包含从 184 位年龄在 20 至 97 岁的健康成年母语为魁北克法语的说话者的录音中提取的超过 300,000 个音节(超过 690,000 个音素)的语音转录。为确保语料库的代表性,这些录音是在正式和熟悉的交流情境下进行的。从语料库中计算了音位分布统计数据(例如,音节和共现频率、百分比、百分位数等级、转移概率和逐点互信息)。开发了一个用于搜索数据库的开放访问在线应用程序,并可在 www.speechneurolab.ca/syllabo 上获得。在本文中,我们简要介绍了语料库以及音节和音位数据库,并讨论了它们在认知神经科学、心理语言学、神经语言学、实验心理学、语音学和音韵学等各个研究领域的实际应用。还讨论了非学术性的实际应用,包括在言语病理学中的应用。