Pérez Elvira, Santiago Julio, Palma Alfonso, O'Seaghdha Padraig G
Departamento de Psicología Experimental y Fisiología del Comportamiento, Universidad de Granada, Campus de Cartuja, 18179 Granada, Spain.
J Psycholinguist Res. 2007 May;36(3):207-35. doi: 10.1007/s10936-006-9042-7.
This paper studies the reliability and validity of naturalistic speech errors as a tool for language production research. Possible biases when collecting naturalistic speech errors are identified and specific predictions derived. These patterns are then contrasted with published reports from Germanic languages (English, German and Dutch) and one Romance language (Spanish). Unlike findings in the Germanic languages, Spanish speech errors show many patterns which run contrary to those expected from bias: (1) more phonological errors occur between words than within word; (2) word-initial consonants are less likely to participate in errors than word-medial consonants, (3) errors are equally likely in stressed and in unstressed syllables, (4) perseverations are more frequent than anticipations, and (5) there is no trace of a lexical bias. We present a new corpus of Spanish speech errors collected by many theoretically naïve observers (whereas the only corpus available so far was collected by two highly trained theoretically informed observers), give a general overview of it, and use it to replicate previous reports. In spite of the different susceptibility of these methods to bias, results were remarkably similar in both corpora and again contrary to predictions from bias. As a result, collecting speech errors "in the wild" seems to be free of bias to a reasonable extent even when using a multiple-collector method. The observed contrasting patterns between Spanish and Germanic languages arise as true cross-linguistic differences.
本文研究自然发生的言语错误作为语言生成研究工具的可靠性和有效性。识别了收集自然发生的言语错误时可能存在的偏差,并得出了具体预测。然后将这些模式与日耳曼语系(英语、德语和荷兰语)以及一种罗曼语系(西班牙语)已发表的报告进行对比。与日耳曼语系的研究结果不同,西班牙语的言语错误呈现出许多与偏差预期相反的模式:(1)词间的语音错误比词内的更多;(2)词首辅音比词中辅音参与错误的可能性更小;(3)重读音节和非重读音节中出现错误的可能性相同;(4)重复比提前出现更频繁;(5)不存在词汇偏差迹象。我们展示了一个由许多理论知识欠缺的观察者收集的西班牙语言语错误新语料库(而迄今为止唯一可用的语料库是由两名训练有素、理论知识丰富的观察者收集的),对其进行了总体概述,并使用它来复制先前的报告。尽管这些方法对偏差的敏感度不同,但两个语料库的结果非常相似,且再次与偏差预测相反。因此,即使使用多收集者方法,在自然环境中收集言语错误在合理程度上似乎也没有偏差。西班牙语和日耳曼语系之间观察到的对比模式是真正的跨语言差异。