Department of Cognitive Science & Artificial Intelligence, Tilburg University, The Netherlands.
Language Development Department, Max Planck Institute for Psycholinguistics, The Netherlands.
J Child Lang. 2023 Nov;50(6):1374-1393. doi: 10.1017/S0305000923000302. Epub 2023 Jun 20.
While there are well-known demonstrations that children can use distributional information to acquire multiple components of language, the underpinnings of these achievements are unclear. In the current paper, we investigate the potential pre-requisites for a distributional learning model that can explain how children learn their first words. We review existing literature and then present the results of a series of computational simulations with Vector Space Models, a type of distributional semantic model used in Computational Linguistics, which we evaluate against vocabulary acquisition data from children. We focus on nouns and verbs, and we find that: (i) a model with flexibility to adjust for the frequency of events provides a better fit to the human data, (ii) the influence of context words is very local, especially for nouns, and (iii) words that share more contexts with other words are harder to learn.
虽然有一些著名的研究表明儿童可以利用分布信息来习得语言的多个组成部分,但这些成就的基础还不清楚。在当前的论文中,我们研究了一个分布式学习模型的潜在前提条件,该模型可以解释儿童如何学习他们的第一个单词。我们回顾了现有文献,然后展示了一系列使用向量空间模型(一种用于计算语言学的分布语义模型)的计算模拟结果,我们将这些结果与来自儿童的词汇习得数据进行了比较。我们专注于名词和动词,发现:(i)一个具有灵活性以适应事件频率的模型可以更好地拟合人类数据,(ii)上下文词的影响非常局部化,尤其是对于名词,(iii)与其他词共享更多上下文的词更难学习。