Department of Surgery, Division of Otolaryngology - Head and Neck Surgery, Medical Sciences Center, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin 53706-1532, USA.
J Voice. 2011 Jan;25(1):15-20. doi: 10.1016/j.jvoice.2009.08.004. Epub 2010 Mar 25.
OBJECTIVE/HYPOTHESIS: Low-pass filtering is often applied to eliminate effects of environmental noise when preparing voice recordings for acoustic analysis. This study tested the effects of low-pass filter cutoff frequency on the results of acoustic voice analysis, with a particular interest in the effects of low cutoff frequencies on nonlinear dynamic parameters.
A crossover randomized controlled trial was performed using voice recordings of sustained vowel phonation obtained from the Disordered Voice Database.
A second-order Butterworth filter was applied to the voices at cutoff frequencies ranging from 5000 to 40 Hz. Percent jitter, percent shimmer, fundamental frequency (F(0)), signal-to-noise ratio (SNR), correlation dimension (D(2)), and second-order entropy (K(2)) were calculated for each signal.
Traditional acoustic parameters were validly measured at cutoff frequencies as low as 300 Hz. The SNR and percent shimmer were improved by cutoff frequencies of 300 Hz or higher; F(0) and percent jitter were unaffected by filtering at these frequencies. D(2) and K(2) were measured stably for signals filtered at cutoff frequencies as low as 100 Hz.
To ensure accuracy in acoustic voice analysis, setting the cutoff frequency of a low-pass filter at least one octave above the F(0) (minimum of 300 Hz) is recommended. Nonlinear dynamic measures of D(2) and K(2) proved more robust and maintained accuracy at lower frequencies.
目的/假设:在为声学分析准备语音记录时,通常会应用低通滤波来消除环境噪声的影响。本研究测试了低通滤波器截止频率对声学语音分析结果的影响,特别关注低截止频率对非线性动态参数的影响。
使用来自 Disorder Voice Database 的持续元音发声的语音记录,进行了一项交叉随机对照试验。
使用二阶巴特沃斯滤波器对声音进行处理,截止频率范围从 5000 到 40 Hz。为每个信号计算抖动百分比、颤抖百分比、基频 (F(0))、信噪比 (SNR)、关联维数 (D(2)) 和二阶熵 (K(2))。
在截止频率低至 300 Hz 的情况下,可以有效地测量传统声学参数。SNR 和颤抖百分比在 300 Hz 或更高的截止频率下得到改善;在这些频率下滤波对 F(0)和抖动百分比没有影响。在截止频率低至 100 Hz 的情况下,D(2)和 K(2)可以稳定地测量。
为了确保声学语音分析的准确性,建议将低通滤波器的截止频率设置在 F(0)以上一个八度(最低 300 Hz)。非线性动态测量的 D(2)和 K(2)更具鲁棒性,并在较低频率下保持准确性。