Department of Linguistics, University of Maryland, College Park, MD 20742;
University of Maryland Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742.
Proc Natl Acad Sci U S A. 2021 Feb 9;118(7). doi: 10.1073/pnas.2001844118.
Before they even speak, infants become attuned to the sounds of the language(s) they hear, processing native phonetic contrasts more easily than nonnative ones. For example, between 6 to 8 mo and 10 to 12 mo, infants learning American English get better at distinguishing English and [l], as in "rock" vs. "lock," relative to infants learning Japanese. Influential accounts of this phenomenon initially proposed that infants group sounds into native vowel- and consonant-like phonetic categories-like and [l] in English-through a statistical clustering mechanism dubbed "distributional learning." The feasibility of this mechanism for learning phonetic categories has been challenged, however. Here, we demonstrate that a distributional learning algorithm operating on naturalistic speech can predict early phonetic learning, as observed in Japanese and American English infants, suggesting that infants might learn through distributional learning after all. We further show, however, that, contrary to the original distributional learning proposal, our model learns units too brief and too fine-grained acoustically to correspond to phonetic categories. This challenges the influential idea that what infants learn are phonetic categories. More broadly, our work introduces a approach to the study of early phonetic learning, together with a quantitative modeling framework that can handle realistic input. This allows accounts of early phonetic learning to be linked to concrete, systematic predictions regarding infants' attunement.
在婴儿开口说话之前,他们就已经开始适应他们所听到的语言的声音,从而更容易地处理母语的语音差异,而不是非母语的语音差异。例如,在 6 到 8 个月到 10 到 12 个月之间,学习美式英语的婴儿在区分英语和[l](如“rock”与“lock”)方面比学习日语的婴儿要好。这一现象的最初解释认为,婴儿通过一种被称为“分布学习”的统计聚类机制,将声音分组为母语的元音和辅音样的语音类别,如英语中的[l]。然而,这种通过分布学习来学习语音类别的机制的可行性受到了挑战。在这里,我们证明了一种在自然语言环境中运行的分布学习算法可以预测日语和美式英语婴儿早期的语音学习,这表明婴儿可能确实是通过分布学习来学习的。然而,我们进一步表明,与最初的分布学习假说相反,我们的模型学习的单位在声学上过于短暂和精细,无法对应于语音类别。这对婴儿学习的是语音类别的有影响力的观点提出了挑战。更广泛地说,我们的工作引入了一种研究早期语音学习的方法,以及一个可以处理实际输入的定量建模框架。这使得早期语音学习的解释能够与关于婴儿适应能力的具体、系统的预测联系起来。