Nawka T, Anders L C, Cebulla M, Zurakowski D
Phoniatric Department, Humboldt-Universität of Berlin, Germany.
J Voice. 1997 Dec;11(4):422-8. doi: 10.1016/s0892-1997(97)80038-0.
Spectral analysis of vowels during connected speech can be performed using the spectral intensity distribution within critical bands corresponding to a natural scale on the basilar membrane. Normalization of the spectra provides the opportunity to make objective comparisons independent from the recording level. An increasing envelope peak between 3,150 and 3,700 Hz has been confirmed statistically for a combination of seven vowels in three groups of male speakers with hoarse, normal, and professional voices. Each vowel is also analyzed individually. The local energy maximum is called "the speaker's formant" and can be found in the region of the fourth formant. The steepness of the spectral slope (i.e. the rate of decline) becomes less pronounced when the sonority or the intensity of the voice increases. The speaker's formant is connected with the sonorous quality of the voice. It increases gradually and is approximately 10 dB higher in professional male voices than in normal male voices at neutral loudness (60 dB at 0.3 min). The peak intensity becomes stronger (30 dB above normal voices) when the overall speaking loudness is increased to 80 dB. Shouting increases the spectral energy of the adjacent critical bands but not the speaker's formant itself.
在连贯语音中,元音的频谱分析可以利用与基底膜上自然尺度相对应的临界带内的频谱强度分布来进行。频谱归一化提供了独立于录音电平进行客观比较的机会。对于三组具有嘶哑、正常和专业嗓音的男性说话者所发出的七个元音的组合,统计证实了在3150至3700赫兹之间存在一个不断增加的包络峰值。每个元音也会单独进行分析。局部能量最大值被称为“说话者共振峰”,可以在第四共振峰区域找到。当语音的响度或强度增加时,频谱斜率的陡度(即下降速率)会变得不那么明显。说话者共振峰与语音的洪亮音质相关。它会逐渐增加,在中性响度(0.3分钟时60分贝)下,专业男性嗓音中的说话者共振峰比正常男性嗓音中的大约高10分贝。当总体说话响度增加到80分贝时,峰值强度会变强(比正常嗓音高30分贝)。喊叫会增加相邻临界带的频谱能量,但不会增加说话者共振峰本身的能量。