Camacho Arturo, Harris John G
Computational NeuroEngineering Laboratory, University of Florida, Gainesville, Florida 32611, USA.
J Acoust Soc Am. 2008 Sep;124(3):1638-52. doi: 10.1121/1.2951592.
A sawtooth waveform inspired pitch estimator (SWIPE) has been developed for speech and music. SWIPE estimates the pitch as the fundamental frequency of the sawtooth waveform whose spectrum best matches the spectrum of the input signal. The comparison of the spectra is done by computing a normalized inner product between the spectrum of the signal and a modified cosine. The size of the analysis window is chosen appropriately to make the width of the main lobes of the spectrum match the width of the positive lobes of the cosine. SWIPE('), a variation of SWIPE, utilizes only the first and prime harmonics of the signal, which significantly reduces subharmonic errors commonly found in other pitch estimation algorithms. The authors' tests indicate that SWIPE and SWIPE(') performed better on two spoken speech and one disordered voice database and one musical instrument database consisting of single notes performed at a variety of pitches.
一种受锯齿波波形启发的基音估计器(SWIPE)已被开发用于语音和音乐。SWIPE将基音估计为锯齿波波形的基频,该锯齿波波形的频谱与输入信号的频谱最匹配。频谱的比较是通过计算信号频谱与修正余弦之间的归一化内积来完成的。分析窗口的大小经过适当选择,以使频谱主瓣的宽度与余弦正瓣的宽度相匹配。SWIPE(') 是SWIPE的一种变体,它仅利用信号的一次谐波和基谐波,这显著减少了其他基音估计算法中常见的次谐波误差。作者的测试表明,SWIPE和SWIPE(') 在两个口语语音、一个紊乱语音数据库以及一个由各种音高演奏的单音符组成的乐器数据库上表现更好。