Vallabha Gautam K, McClelland James L, Pons Ferran, Werker Janet F, Amano Shigeaki
Department of Psychology, Stanford University, Jordan Hall Building 420, Stanford, CA 94305, USA.
Proc Natl Acad Sci U S A. 2007 Aug 14;104(33):13273-8. doi: 10.1073/pnas.0705369104. Epub 2007 Jul 30.
Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation-Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (/I, i, epsilon, e/ for English, /I, i, e, e/ for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner.
婴儿能迅速学习其母语的语音类别,即便他们并未接受明确或有针对性的训练。近期研究表明,这种学习归因于婴儿对语音分布的敏感性,且面向婴儿的言语包含形成母语元音类别的分布信息。本文提出一种基于期望最大化的算法,用于从元音样本序列中学习类别,而无需(i)每个元音样本都附带任何类别信息,(ii)预先知道要学习的类别数量,或(iii)获取整个数据集。当该算法接触从英语或日语面向婴儿的言语中提取的元音样本时,它成功发现了特定语言的元音类别(英语为 /I, i, epsilon, e/,日语为 /I, i, e, e/)。该算法的非参数版本与基于拓扑表示和竞争性赫布学习的神经网络模型密切相关,也能够发现元音类别,尽管可靠性稍低。这些结果强化了以下观点:母语语音类别是通过分布学习获得的,且这种学习可能以生物学上合理的方式实现。