Rosen Stuart, Hui Sze Ngar Catherine
UCL Speech, Hearing and Phonetic Sciences, 2 Wakefield Street, London WC1N 1PF, United Kingdom.
J Acoust Soc Am. 2015 Dec;138(6):3698-702. doi: 10.1121/1.4937605.
Sine-wave speech (SWS) is a highly simplified version of speech consisting only of frequency- and amplitude-modulated sinusoids representing the formants. That listeners can successfully understand SWS has led to claims that speech perception must be based on abstract properties of the stimuli far removed from their specific acoustic form. Here it is shown, in bilingual Cantonese/English listeners, that performance with Cantonese SWS is improved by noise vocoding, with no effect on English SWS utterances. This manipulation preserves the abstract informational structure in the signals but changes its surface form. The differential effects of noise vocoding likely arise from the fact that Cantonese is a tonal language and hence more reliant on fundamental frequency (F0) contours for its intelligibility. SWS does not preserve tonal information from the original speech but does have false tonal information signalled by the lowest frequency sinusoid. Noise vocoding SWS appears to minimise the tonal percept, which thus interferes less in the perception of Cantonese. It has no effect in English, which is minimally reliant on F0 variations for intelligibility. Therefore it is not only the informational structure of a sound that is important but also how its acoustic detail interacts with the phonological structure of a given language.
正弦波语音(SWS)是一种高度简化的语音形式,仅由代表共振峰的频率和幅度调制的正弦波组成。听众能够成功理解SWS这一现象引发了这样的观点,即语音感知必须基于与刺激的特定声学形式相去甚远的抽象属性。本文通过粤语/英语双语听众的实验表明,对粤语SWS进行噪声声码转换可提高其可懂度,而对英语SWS话语则无影响。这种操作保留了信号中的抽象信息结构,但改变了其表面形式。噪声声码转换的不同效果可能源于粤语是一种声调语言,因此其可懂度更依赖于基频(F0)轮廓这一事实。SWS并未保留原始语音中的声调信息,但确实有由最低频率正弦波发出的虚假声调信息。对SWS进行噪声声码转换似乎能将声调感知降至最低,从而在粤语感知中产生的干扰较少。在英语中则没有这种效果,因为英语在很大程度上不依赖F0变化来保证可懂度。因此,不仅声音的信息结构很重要,其声学细节与特定语言的音系结构如何相互作用也很重要。