Plunkett K, Marchman V
Institute of Psychology, University of Aarhus, Risskov, Denmark.
Cognition. 1991 Jan;38(1):43-102. doi: 10.1016/0010-0277(91)90022-v.
A three-layer back-propagation network is used to implement a pattern association task in which four types of mapping are learned. These mappings, which are considered analogous to those which characterize the relationship between the stem and past tense forms of English verbs, include arbitrary mappings, identity mappings, vowel changes, and additions of a suffix. The degree of correspondence between parallel distributed processing (PDP) models which learn mappings of this sort (e.g., Rumelhart & McClelland, 1986, 1987) and children's acquisition of inflectional morphology has recently been at issue in discussions of the applicability of PDP models to the study of human cognition and language (Pinker & Mehler, 1989; Bever, in press). In this paper, we explore the capacity of a network to learn these types of mappings, focusing on three major issues. First, we compare the performance of a single-layered perceptron similar to the one used by Rumelhart and McClelland with a multi-layered perceptron. The results suggest that it is unlikely that a single-layered perceptron is capable of finding an adequate solution to the problem of mapping stems and past tense forms in input configurations that are sufficiently analogous to English. Second, we explore the input conditions which determine learning in these networks. Several factors that characterize linguistic input are investigated: (a) the nature of the mapping performed by the network (arbitrary, suffixation, identity, and vowel change); (b) the competition effects that arise when the task demands simultaneous learning of distinct mapping types; (c) the role of the type and token frequency of verb stems; and (d) the influence of phonological subregularities in the irregular verbs. Each of these factors is shown to have selective consequences on both successful and erroneous performance in the network. Third, we outline several types of systems which could result in U-shaped acquisition, and discuss the ways in which learning in multi-layered networks can be seen to capture several characteristics of U-shaped learning in children. In general, these models provide information about the role of input in determining the kinds of errors that a network will produce, including the conditions under which rule-like behavior and U-shaped learning will and will not emerge. The results from all simulations are discussed in light of behavioral data on children's acquisition of the past tense and the validity of drawing conclusions about the acquisition of language from models of this sort.
一个三层反向传播网络被用于执行一个模式关联任务,其中学习了四种类型的映射。这些映射被认为类似于那些表征英语动词词干和过去式形式之间关系的映射,包括任意映射、恒等映射、元音变化和后缀添加。学习这类映射的并行分布式处理(PDP)模型(例如,Rumelhart和McClelland,1986年,1987年)与儿童屈折形态习得之间的对应程度,最近在关于PDP模型在人类认知和语言研究中的适用性的讨论中成为了争议点(Pinker和Mehler,1989年;Bever,即将出版)。在本文中,我们探讨了一个网络学习这些类型映射的能力,重点关注三个主要问题。首先,我们将类似于Rumelhart和McClelland使用的单层感知器的性能与多层感知器进行比较。结果表明,单层感知器不太可能在与英语足够相似的输入配置中找到解决映射词干和过去式形式问题的适当方法。其次,我们探索了决定这些网络学习的输入条件。研究了几个表征语言输入的因素:(a)网络执行的映射的性质(任意、后缀化、恒等和元音变化);(b)当任务要求同时学习不同映射类型时出现的竞争效应;(c)动词词干的类型和实例频率的作用;以及(d)不规则动词中语音次规则性的影响。这些因素中的每一个都被证明对网络中的成功和错误表现有选择性的影响。第三,我们概述了几种可能导致U形习得的系统类型,并讨论了多层网络中的学习可以被视为捕捉儿童U形学习的几个特征的方式。一般来说,这些模型提供了关于输入在确定网络将产生的错误类型中的作用的信息,包括规则样行为和U形学习将出现和不会出现的条件。根据关于儿童过去式习得的行为数据以及从这类模型得出关于语言习得结论的有效性,讨论了所有模拟的结果。