Department of Hearing and Speech Sciences & Center for Comparative and Evolutionary Biology of Hearing, University of Maryland, College Park, MD, USA.
Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'études cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
Dev Sci. 2021 Sep;24(5):e13090. doi: 10.1111/desc.13090. Epub 2021 Apr 6.
This study evaluates whether early vocalizations develop in similar ways in children across diverse cultural contexts. We analyze data from daylong audio recordings of 49 children (1-36 months) from five different language/cultural backgrounds. Citizen scientists annotated these recordings to determine if child vocalizations contained canonical transitions or not (e.g., "ba" vs. "ee"). Results revealed that the proportion of clips reported to contain canonical transitions increased with age. Furthermore, this proportion exceeded 0.15 by around 7 months, replicating and extending previous findings on canonical vocalization development but using data from the natural environments of a culturally and linguistically diverse sample. This work explores how crowdsourcing can be used to annotate corpora, helping establish developmental milestones relevant to multiple languages and cultures. Lower inter-annotator reliability on the crowdsourcing platform, relative to more traditional in-lab expert annotators, means that a larger number of unique annotators and/or annotations are required, and that crowdsourcing may not be a suitable method for more fine-grained annotation decisions. Audio clips used for this project are compiled into a large-scale infant vocalization corpus that is available for other researchers to use in future work.
本研究评估了在不同文化背景下,儿童早期的发声是否以相似的方式发展。我们分析了来自五个不同语言/文化背景的 49 名儿童(1-36 个月)全天录音的数据分析。公民科学家对这些录音进行注释,以确定儿童的发声是否包含规范的转换(例如,“ba”与“ee”)。结果表明,报告包含规范转换的片段比例随着年龄的增长而增加。此外,这一比例在大约 7 个月时超过了 0.15,复制并扩展了先前关于规范发声发展的发现,但使用了来自具有文化和语言多样性样本的自然环境中的数据。这项工作探讨了众包如何用于注释语料库,帮助建立与多种语言和文化相关的发展里程碑。与更传统的实验室专家注释者相比,众包平台上的注释者间可靠性较低,这意味着需要更多独特的注释者和/或注释,并且众包可能不适合更精细的注释决策。本项目使用的音频剪辑被汇编成一个大规模的婴儿发声语料库,供其他研究人员在未来的工作中使用。