Huang Gongping, Chen Jingdong, Benesty Jacob
Center of Intelligent Acoustics and Immersive Communications and School of Marine Science and Technology, Northwestern Polytechnical University, 127 Youyi West, Road, Xi'an 710072, China.
INRS-EMT, University of Quebec, 800 de la Gauchetiere Ouest, Montreal, Quebec H5A 1K6, Canada.
J Acoust Soc Am. 2015 Nov;138(5):3053-60. doi: 10.1121/1.4934954.
This paper presents an approach to the direction-of-arrival (DOA) estimation problem in acoustic environments using microphone arrays. It works in the short-time Fourier transform (STFT) domain. It first transforms the noisy speech signals received at the array into the STFT domain. A Householder transformation is then constructed and applied to the multichannel STFT coefficients in each subband. This transformation converts the multichannel STFT coefficients into two components: one is a single coefficient that is dominated by the signal of interest and the other consists of the M - 1 coefficient that is dominated by noise (or even consists of noise-only if there is no reverberation), where M is the number of sensors. A cost function is then formed from the outputs of the Householder transformation and the DOA information can subsequently be obtained by searching the extremum value of this cost function in the angle range between 0° and 180°. Simulation results are provided to illustrate the performance of this approach.
本文提出了一种利用麦克风阵列解决声学环境中到达方向(DOA)估计问题的方法。该方法在短时傅里叶变换(STFT)域中工作。它首先将阵列接收到的带噪语音信号变换到STFT域。然后构造一个豪斯霍尔德变换并将其应用于每个子带中的多通道STFT系数。该变换将多通道STFT系数转换为两个分量:一个是由感兴趣信号主导的单个系数,另一个由M - 1个由噪声主导的系数组成(如果没有混响,甚至仅由噪声组成),其中M是传感器的数量。然后根据豪斯霍尔德变换的输出形成一个代价函数,随后通过在0°到180°的角度范围内搜索该代价函数的极值来获得DOA信息。提供了仿真结果以说明该方法的性能。