Shofner William P, Yacko Nicole, Bowdrie Kristina
Department of Speech and Hearing Sciences.
J Comp Psychol. 2019 Aug;133(3):326-339. doi: 10.1037/com0000165. Epub 2018 Dec 27.
One characteristic of human speech perception is a remarkable ability to recognize speech when the speech signal is highly degraded. It has been argued that this ability to perceive highly degraded speech reflects speech-specific mechanisms. The present study tested this hypothesis by measuring the ability of chinchillas to recognize noise-vocoded (NV) versions of naturally spoken monosyllabic words using operant conditioning in a stimulus generalization paradigm. Chinchillas do not generalize the vocoded words to be perceptually equivalent to the naturally spoken words. The responses from chinchillas to the vocoded words fall well below their responses to the naturally spoken words. In this case, pitch cues rather than speech cues may be controlling the behavioral responses. To reduce pitch cues, chinchillas were retrained using 64-channel NV words. The responses from chinchillas to the vocoded test words were now similar to those of the 64-channel versions and were similar to those obtained from human listeners. However, responses obtained from chinchillas to time-reversed versions were high and similar to responses obtained to time-normal versions suggesting that the cue controlling behavioral responses was the phonetic structure of the words. These results show that chinchillas used different acoustic cues than human listeners. The ability of chinchillas to recognize NV words as being perceptually equivalent to the naturally spoken versions is inferior compared to that of human listeners. The findings suggest that the ability of human listeners to recognize highly degraded speech is unlikely to be based solely on the general auditory and perceptual mechanisms that are common among mammals. (PsycINFO Database Record (c) 2019 APA, all rights reserved).
人类言语感知的一个特点是,在言语信号严重退化时仍具有非凡的言语识别能力。有人认为,这种感知严重退化言语的能力反映了特定于言语的机制。本研究通过在刺激泛化范式中使用操作性条件反射来测量龙猫识别自然说出的单音节词的噪声声码(NV)版本的能力,对这一假设进行了检验。龙猫不会将声码词泛化为在感知上等同于自然说出的词。龙猫对声码词的反应远低于它们对自然说出的词的反应。在这种情况下,音高线索而非言语线索可能在控制行为反应。为了减少音高线索,使用64通道NV词对龙猫进行了重新训练。现在,龙猫对声码测试词的反应与64通道版本的反应相似,并且与从人类听众那里获得的反应相似。然而,从龙猫对时间反转版本获得的反应很高,并且与对时间正常版本获得的反应相似,这表明控制行为反应的线索是词的语音结构。这些结果表明,龙猫使用的声学线索与人类听众不同。与人类听众相比,龙猫将NV词识别为在感知上等同于自然说出版本的能力较差。这些发现表明,人类听众识别严重退化言语的能力不太可能仅仅基于哺乳动物共有的一般听觉和感知机制。(PsycINFO数据库记录(c)2019美国心理学会,保留所有权利)