van Dommelen W A
University of Kiel, FRG.
Lang Speech. 1990 Jul-Sep;33 ( Pt 3):259-72. doi: 10.1177/002383099003300302.
Four speaker identification tests were conducted using five female speakers known to the listeners. Starting from acoustic recordings of reiterant "ma" syllables, the perceptual importance of the following three factors was investigated: F0 height, F0 contour, and speech rhythm. For speakers with typically low or high voices F0 height turned out to be a highly relevant cue in speaker identification. For all speakers F0 contour was of secondary importance, whereas speech rhythm had a small but consistent influence on recognition rates. It could be inferred that remaining factors alone (mainly global spectral information) would yield recognition scores of approximately 50%. Consistent with previous investigations, the relevance of perceptual cues in the recognition of familiar voices was shown to be not hierarchically fixed, but to depend on speaker-specific voice characteristics.
使用听众熟悉的五名女性说话者进行了四项说话者识别测试。从重复“妈”音节的声学录音开始,研究了以下三个因素的感知重要性:基频高度、基频轮廓和语音节奏。对于声音通常较低或较高的说话者,基频高度在说话者识别中被证明是一个高度相关的线索。对于所有说话者来说,基频轮廓的重要性次之,而语音节奏对识别率有微小但一致的影响。可以推断,仅靠其他因素(主要是整体频谱信息)将产生约50%的识别分数。与之前的研究一致,在识别熟悉声音时感知线索的相关性并非层次固定,而是取决于说话者特定的声音特征。