Moore C B, Jongman A
Cornell Phonetics Laboratory, Department of Modern Languages, Cornell University, Ithaca, New York 14853, USA.
J Acoust Soc Am. 1997 Sep;102(3):1864-77. doi: 10.1121/1.420092.
This study investigated speaker normalization in perception of Mandarin tone 2 (midrising) and tone 3 (low-falling-rising) by examining listeners' use of F0 range as a cue to speaker identity. Two speakers were selected such that tone 2 of the low-pitched speaker and tone 3 of the high-pitched speaker occurred at equivalent F0 heights. Production and perception experiments determined that turning point (or inflection point of the tone), and delta F0 (the difference in FO between onset and turning point) distinguished the two tones. Three tone continua varying in either turning point, delta FO, or both acoustic dimensions, were then appended to a natural precursor phrase from each of the two speakers. Results showed identification shifts such that identical stimuli were identified as low tones for the high precursor condition, but as high tones for the low precursor condition. Stimuli varying in turning point showed no significant shift, suggesting that listeners normalize only when the precursor varies in the same dimension as the stimuli. The magnitude of the shift was greater for stimuli varying only in delta FO, as compared to stimuli varying in both turning point and delta FO, indicating that normalization effects are reduced for stimuli more closely matching natural speech.
本研究通过考察听众对作为说话者身份线索的基频范围的使用,来探究普通话第二声(中升调)和第三声(低降升调)感知中的说话者归一化。选择了两位说话者,使得低音调说话者的第二声和高音调说话者的第三声出现在相同的基频高度。生产和感知实验确定,转折点(或声调的拐点)以及基频变化量(起始点和转折点之间的基频差异)区分了这两个声调。然后,将在转折点、基频变化量或两个声学维度上变化的三个声调连续体附加到两位说话者各自的一个自然前置短语上。结果显示出识别偏移,即相同的刺激在高前置条件下被识别为低音调,但在低前置条件下被识别为高音调。在转折点上变化的刺激没有显示出显著偏移,这表明听众仅当前置在与刺激相同的维度上变化时才进行归一化。与在转折点和基频变化量两者上都变化的刺激相比,仅在基频变化量上变化的刺激的偏移幅度更大,这表明对于更接近自然语音的刺激,归一化效果会降低。