Kingston John, Kawahara Shigeto, Chambless Della, Mash Daniel, Brenner-Alsop Eve
University of Massachusetts, Amherst.
J Phon. 2009 Jul;37(3):297-320. doi: 10.1016/j.wocn.2009.03.007.
In the experiments reported here, listeners categorized and discriminated speech and non-speech analogue stimuli in which the durations of a vowel and a following consonant or their analogues were varied orthogonally. The listeners' native languages differed in how these durations covary in speakers' productions of such sequences. Because auditorist and autonomous models of speech perception hypothesize that the auditory qualities evoked by both kinds of stimuli determine their initial perceptual evaluation, they both predict that listeners from all the languages will respond similarly to non-speech analogues as they do to speech in both tasks. Because neither direct realist nor interactive models hypothesize such a processing stage, they predict instead that in the way in which vowel and consonant duration covary in the listeners' native languages will determine how they categorize and discriminate the speech stimuli, and that all listeners will categorize and discriminate the non-speech differently from the speech stimuli. Listeners' categorization of the speech stimuli did differ as a function of how these durations covary in their native languages, but all listeners discriminated the speech stimuli in the same way, and they all categorized and discriminated the non-speech stimuli in the same way, too. These similarities could arise from listeners adding the durations of the vowel and consonant intervals (or their analogues) in these tasks with these stimuli; they do so when linguistic experience does not influence them to perceive these durations otherwise. These results support an autonomous rather than interactive model in which listeners either add or apply their linguistic experience at a post-perceptual stage of processing. They do not however support an auditorist over a direct realist model because they provide no evidence that the signal's acoustic properties are transformed during the hypothesized prior perceptual stage.
在本文所报道的实验中,听众对语音和非语音类似刺激进行分类和辨别,其中元音及其后的辅音或它们的类似物的时长是相互正交变化的。听众的母语在这些时长于说话者发出此类序列时如何共同变化方面存在差异。由于语音感知的听觉主义模型和自主模型假设这两种刺激所引发的听觉特性决定了它们最初的感知评估,所以它们都预测,来自所有语言的听众在这两项任务中对非语音类似物的反应将与对语音的反应相似。由于直接实在论模型和交互模型都没有假设这样一个处理阶段,所以它们反而预测,元音和辅音时长在听众母语中的共同变化方式将决定他们如何对语音刺激进行分类和辨别,并且所有听众对非语音刺激的分类和辨别方式将与对语音刺激的不同。听众对语音刺激的分类确实因这些时长在其母语中的共同变化方式而异,但所有听众辨别语音刺激的方式相同,并且他们对非语音刺激的分类和辨别方式也相同。这些相似性可能源于听众在处理这些刺激的任务中对元音和辅音间隔(或它们的类似物)的时长进行相加;当语言经验没有以其他方式影响他们对这些时长的感知时,他们就会这样做。这些结果支持自主模型而非交互模型,在自主模型中,听众在感知后的处理阶段要么相加要么应用他们的语言经验。然而,它们并不支持听觉主义模型优于直接实在论模型,因为它们没有提供证据表明信号的声学特性在假设的先前感知阶段会发生转换。