Schuler Kathryn D, Reeder Patricia A, Newport Elissa L, Aslin Richard N
Center for Brain Plasticity and Recovery, Department of Neurology, Georgetown University, Washington DC 20057.
Department of Psychological Science, Gustavus Adolphus College, Saint Peter, MN 56082.
Lang Learn Dev. 2017;13(4):357-374. doi: 10.1080/15475441.2016.1263571. Epub 2017 Aug 2.
Successful language acquisition hinges on organizing individual words into grammatical categories and learning the relationships between them, but the method by which children accomplish this task has been debated in the literature. One proposal is that learners use the shared distributional contexts in which words appear as a cue to their underlying category structure. Indeed, recent research using artificial languages has demonstrated that learners can acquire grammatical categories from this type of distributional information. However, artificial languages are typically composed of a small number of equally frequent words, while words in natural languages vary widely in frequency, complicating the distributional information needed to determine categorization. In a series of three experiments we demonstrate that distributional learning is preserved in an artificial language composed of words that vary in frequency as they do in natural language, along a Zipfian distribution. Rather than depending on the absolute frequency of words and their contexts, the conditional probabilities that words will occur in certain contexts (given their base frequency) is a better basis for assigning words to categories; and this appears to be the type of statistic that human learners utilize.
成功的语言习得取决于将单个单词组织成语法类别并学习它们之间的关系,但儿童完成这项任务的方法在文献中一直存在争议。一种观点认为,学习者利用单词出现的共享分布语境作为其潜在类别结构的线索。事实上,最近使用人工语言的研究表明,学习者可以从这种类型的分布信息中获取语法类别。然而,人工语言通常由少量出现频率相同的单词组成,而自然语言中的单词频率差异很大,这使得确定分类所需的分布信息变得复杂。在一系列三个实验中,我们证明了分布学习在一种由频率如自然语言中那样变化的单词组成的人工语言中得以保留,遵循齐普夫分布。将单词分配到类别更好的依据不是取决于单词及其语境的绝对频率,而是单词在特定语境中出现的条件概率(给定其基本频率);这似乎就是人类学习者所利用的那种统计数据。