School of Electrical & Electronic Engineering, Universiti Sains Malaysia, Nibong Tebal 14300, Penang, Malaysia.
Flextronics Systems Sdn. Bhd., Batu Kawan Industrial Park PMT 719 Lingkaran Cassia Selatan, Simpang Ampat 14110, Penang, Malaysia.
Sensors (Basel). 2021 Jan 1;21(1):227. doi: 10.3390/s21010227.
In this work, a binaural model resembling the human auditory system was built using a pair of three-dimensional (3D)-printed ears to localize a sound source in both vertical and horizontal directions. An analysis on the proposed model was firstly conducted to study the correlations between the spatial auditory cues and the 3D polar coordinate of the source. Apart from the estimation techniques via interaural and spectral cues, the property from the combined direct and reverberant energy decay curve is also introduced as part of the localization strategy. The preliminary analysis reveals that the latter provides a much more accurate distance estimation when compared to approximations via sound pressure level approach, but is alone not sufficient to disambiguate the front-rear confusions. For vertical localization, it is also shown that the elevation angle can be robustly encoded through the spectral notches. By analysing the strengths and shortcomings of each estimation method, a new algorithm is formulated to localize the sound source which is also further improved by cross-correlating the interaural and spectral cues. The proposed technique has been validated via a series of experiments where the sound source was randomly placed at 30 different locations in an outdoor environment up to a distance of 19 m. Based on the experimental and numerical evaluations, the localization performance has been significantly improved with an average error of 0.5 m from the distance estimation and a considerable reduction of total ambiguous points to 3.3%.
在这项工作中,使用一对三维(3D)打印耳朵构建了一个类似于人类听觉系统的双耳模型,以在垂直和水平方向上定位声源。首先对所提出的模型进行了分析,以研究空间听觉线索与声源的 3D 极坐标之间的相关性。除了通过耳间和频谱线索进行估计技术外,还引入了来自直接和混响能量衰减曲线组合的特性作为定位策略的一部分。初步分析表明,与通过声压级方法进行的近似相比,后者提供了更准确的距离估计,但单独使用不足以消除前后混淆。对于垂直定位,还表明通过频谱凹口可以稳健地编码仰角。通过分析每种估计方法的优缺点,制定了一种新的声源定位算法,并通过互相关耳间和频谱线索进一步改进了该算法。该技术已通过一系列实验进行了验证,其中声源随机放置在户外环境中的 30 个不同位置,距离可达 19 米。基于实验和数值评估,定位性能得到了显著改善,距离估计的平均误差为 0.5 米,总歧义点数量减少了 3.3%。