Department of Psychology, University of Pennsylvania, 425 S University Ave, Philadelphia, PA 19104, USA.
Cognition. 2023 Jun;235:105401. doi: 10.1016/j.cognition.2023.105401. Epub 2023 Feb 12.
Over the first year, infants begin to learn the words of their language. Previous work suggests that certain statistical regularities in speech could help infants segment the speech stream into words, thereby forming a proto-lexicon that could support learning of the eventual vocabulary. However, computational models of word segmentation have typically been tested using language input that is much less variable than actual speech is. We show that using actual, transcribed pronunciations rather than dictionary pronunciations of the same speech leads to worse segmentation performance across models. We also find that phonologically variable input poses serious problems for lexicon building, because even correctly segmented word forms exhibit a complex, many-to-many relationship with speakers' intended words. Many phonologically distinct word forms were actually the same intended word, and many identical transcriptions came from different intended words. The fact that previous models appear to have substantially overestimated the utility of simple statistical heuristics suggests a need to consider the formation of the lexicon in infancy differently.
在第一年中,婴儿开始学习他们的语言的单词。先前的工作表明,语音中的某些统计规律可以帮助婴儿将语音流分割成单词,从而形成一个可能支持最终词汇学习的原始词汇。然而,单词分割的计算模型通常使用比实际语音变化少得多的语言输入进行测试。我们表明,使用实际的转录发音而不是同一语音的字典发音会导致所有模型的分割性能都变差。我们还发现,语音可变输入给词汇构建带来了严重的问题,因为即使是正确分割的单词形式也与说话者的预期单词之间存在复杂的多对一关系。许多语音上不同的单词形式实际上是同一个预期单词,而许多相同的转录来自不同的预期单词。先前的模型似乎大大高估了简单统计启发式的效用,这表明需要以不同的方式考虑婴儿期词汇的形成。