Siskind J M
Department of Electrical Engineering, Technion, Haifa, Israel.
Cognition. 1996 Oct-Nov;61(1-2):39-91. doi: 10.1016/s0010-0277(96)00728-7.
This paper presents a computational study of part of the lexical-acquisition task faced by children, namely the acquisition of word-to-meaning mappings. It first approximates this task as a formal mathematical problem. It then presents an implemented algorithm for solving this problem, illustrating its operation on a small example. This algorithm offers one precise interpretation of the intuitive notions of cross-situational learning and the principle of contrast applied between words in an utterance. It robustly learns a homonymous lexicon despite noisy multi-word input, in the presence of referential uncertainty, with no prior knowledge that is specific to the language being learned. Computational simulations demonstrate the robustness of this algorithm and illustrate how algorithms based on cross-situational learning and the principle of contrast might be able to solve lexical-acquisition problems of the size faced by children, under weak, worst-case assumptions about the type and quantity of data available.
本文展示了一项针对儿童所面临的词汇习得任务的一部分(即单词到意义映射的习得)的计算研究。它首先将此任务近似为一个形式化的数学问题。然后提出了一种用于解决该问题的实现算法,并通过一个小例子说明其操作。该算法为跨情境学习的直观概念以及话语中单词之间应用的对比原则提供了一种精确解释。尽管存在多词输入噪声、指称不确定性,且没有特定于所学语言的先验知识,但它仍能稳健地学习同音异义词词典。计算模拟证明了该算法的稳健性,并说明了基于跨情境学习和对比原则的算法在关于可用数据的类型和数量的弱、最坏情况假设下,如何能够解决儿童所面临规模的词汇习得问题。