IEEE Trans Cybern. 2016 Jan;46(1):20-6. doi: 10.1109/TCYB.2015.2391252. Epub 2015 Feb 11.
Steered response power phase transform (SRP-PHAT) is a method that is widely used for robust sound source localization (SSL). However, since SRP-PHAT searches over a large number of candidate locations, it is too slow to run in real-time for large-scale microphone array systems. In this paper, we propose a robust two-level search space clustering method to speed-up SRP-PHAT-based SSL. The proposed method divides the candidate locations of the sound source into a set of groups and finds a small number of groups that are likely to contain the maximum power location. By searching within the small number of groups, the computational costs are reduced by 61.8% compared to a previously proposed method without loss of accuracy.
声强相位变换谱(SRP-PHAT)是一种广泛应用于稳健声源定位(SSL)的方法。然而,由于 SRP-PHAT 在大量候选位置上进行搜索,因此对于大规模麦克风阵列系统来说,实时运行速度太慢。在本文中,我们提出了一种稳健的两级搜索空间聚类方法,以加快基于 SRP-PHAT 的 SSL。该方法将声源的候选位置划分为一组组,并找到少量可能包含最大功率位置的组。通过在少量组中搜索,可以将计算成本降低 61.8%,而不会降低准确性。