Murphy Peter
Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland.
J Voice. 2008 Mar;22(2):125-37. doi: 10.1016/j.jvoice.2006.09.007. Epub 2006 Dec 4.
An investigation of the effect of glottal source aperiodicities (jitter, shimmer, and aspiration noise) on the estimation of fundamental frequency (f0) perturbation and amplitude perturbation, of synthesized, glottal source and voiced speech waveforms, is considered. Firstly, 4, cycle-event f0 estimators are examined: (1) waveform matching of the low-pass filtered waveform, (2) positive peaks (PPs) from the speech waveform, (3) PPs from the low-pass filtered waveform, and (4) positive zero crossings from the low-pass filtered waveform. The analysis shows that f0 perturbation measures taken from the low-pass filtered waveform are affected by both amplitude perturbation and random glottal noise, whereas, f0 perturbation measures taken from the PPs of the original waveform are affected by noise but not by amplitude perturbation. It is shown for the low-pass filter methods that the effects of amplitude perturbation and noise lead to increased errors in the measurement of f0 perturbation for the synthesized speech waveforms when compared with the synthesized glottal waveforms. Shimmer of the synthesized speech waveform is approximately equal to shimmer of the synthesized glottal source. However, noise and jitter affect measures of amplitude perturbation. The estimation of f0 perturbation from the synthesized speech waveform is shown to be nonlinearly related to f0 perturbation estimation from the synthesized glottal waveform as a consequence of the filtering action of the vocal tract. Low-pass filtering the voiced speech waveform is shown to provide a partial solution to this problem.
研究了声门源非周期性(抖动、闪烁和吸气噪声)对合成的声门源和浊音语音波形的基频(f0)扰动估计和幅度扰动估计的影响。首先,研究了4种周期事件f0估计器:(1)低通滤波波形的波形匹配;(2)语音波形的正峰值(PPs);(3)低通滤波波形的PPs;(4)低通滤波波形的正过零点。分析表明,从低通滤波波形中获取的f0扰动测量值受幅度扰动和随机声门噪声的影响,而从原始波形的PPs中获取的f0扰动测量值受噪声影响但不受幅度扰动影响。结果表明,对于低通滤波方法,与合成声门波形相比,幅度扰动和噪声的影响导致合成语音波形的f0扰动测量误差增加。合成语音波形的闪烁近似等于合成声门源的闪烁。然而,噪声和抖动会影响幅度扰动的测量。由于声道的滤波作用,合成语音波形的f0扰动估计与合成声门波形的f0扰动估计呈非线性关系。对浊音语音波形进行低通滤波被证明是解决该问题的一种部分解决方案。