Suppr超能文献

用于精确声音定位的子带中头部相关传递函数的高效近似

Efficient Approximation of Head-Related Transfer Functions in Subbands for Accurate Sound Localization.

作者信息

Marelli Damián, Baumgartner Robert, Majdak Piotr

机构信息

School of Electrical Engineering and Computer Science, University of Newcastle, Callaghan, NSW 2308, Australia; Acoustics Research Institute, Austrian Academy of Sciences, Austria (

Acoustics Research Institute, Austrian Academy of Sciences, 1040 Vienna, Austria (

出版信息

IEEE Trans Audio Speech Lang Process. 2015 Jul 1;23(7):1130-1143.

Abstract

Head-related transfer functions (HRTFs) describe the acoustic filtering of incoming sounds by the human morphology and are essential for listeners to localize sound sources in virtual auditory displays. Since rendering complex virtual scenes is computationally demanding, we propose four algorithms for efficiently representing HRTFs in subbands, i.e., as an analysis filterbank (FB) followed by a transfer matrix and a synthesis FB. All four algorithms use sparse approximation procedures to minimize the computational complexity while maintaining perceptually relevant HRTF properties. The first two algorithms separately optimize the complexity of the transfer matrix associated to each HRTF for fixed FBs. The other two algorithms jointly optimize the FBs and transfer matrices for complete HRTF sets by two variants. The first variant aims at minimizing the complexity of the transfer matrices, while the second one does it for the FBs. Numerical experiments investigate the latency-complexity trade-off and show that the proposed methods offer significant computational savings when compared with other available approaches. Psychoacoustic localization experiments were modeled and conducted to find a reasonable approximation tolerance so that no significant localization performance degradation was introduced by the subband representation.

摘要

头部相关传递函数(HRTFs)描述了人体形态对传入声音的声学滤波作用,对于听众在虚拟听觉显示中定位声源至关重要。由于渲染复杂的虚拟场景在计算上要求很高,我们提出了四种算法,用于在子带中高效表示HRTFs,即作为一个分析滤波器组(FB),后跟一个传递矩阵和一个合成FB。所有四种算法都使用稀疏逼近程序来最小化计算复杂度,同时保持与感知相关的HRTF属性。前两种算法针对固定的FB分别优化与每个HRTF相关的传递矩阵的复杂度。另外两种算法通过两个变体为完整的HRTF集联合优化FB和传递矩阵。第一个变体旨在最小化传递矩阵的复杂度,而第二个则针对FB进行此操作。数值实验研究了延迟 - 复杂度权衡,并表明与其他可用方法相比,所提出的方法在计算上有显著节省。对心理声学定位实验进行了建模和实施,以找到合理的近似容差,从而使子带表示不会导致显著的定位性能下降。

相似文献

3
Comparative Analysis of HRTFs Measurement Using In-Ear Microphones.
Sensors (Basel). 2023 Jun 29;23(13):6016. doi: 10.3390/s23136016.
4
A priori mesh grading for the numerical calculation of the head-related transfer functions.
Appl Acoust. 2016 Dec 15;114:99-110. doi: 10.1016/j.apacoust.2016.07.005.
5
Perceptually enhanced spectral distance metric for head-related transfer function quality prediction.
J Acoust Soc Am. 2024 Dec 1;156(6):4133-4152. doi: 10.1121/10.0034632.
9
Sensitivity of human subjects to head-related transfer-function phase spectra.
J Acoust Soc Am. 1999 May;105(5):2821-40. doi: 10.1121/1.426898.
10
Localization using nonindividualized head-related transfer functions.
J Acoust Soc Am. 1993 Jul;94(1):111-23. doi: 10.1121/1.407089.

引用本文的文献

1
Deep Learning: A Rapid and Efficient Route to Automatic Metasurface Design.
Adv Sci (Weinh). 2019 Apr 19;6(12):1900128. doi: 10.1002/advs.201900128. eCollection 2019 Jun 19.

本文引用的文献

1
Modeling sound-source localization in sagittal planes for human listeners.
J Acoust Soc Am. 2014 Aug;136(2):791-802. doi: 10.1121/1.4887447.
2
Acoustic and non-acoustic factors in modeling listener-specific performance of sagittal-plane sound localization.
Front Psychol. 2014 Apr 23;5:319. doi: 10.3389/fpsyg.2014.00319. eCollection 2014.
3
Sound localization in individualized and non-individualized crosstalk cancellation systems.
J Acoust Soc Am. 2013 Apr;133(4):2055-68. doi: 10.1121/1.4792355.
4
3-D localization of virtual sound sources: effects of visual environment, pointing method, and training.
Atten Percept Psychophys. 2010 Feb;72(2):454-69. doi: 10.3758/APP.72.2.454.
5
Interaural fluctuations and the detection of interaural incoherence: bandwidth effects.
J Acoust Soc Am. 2006 Jun;119(6):3971-86. doi: 10.1121/1.2200147.
6
Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited.
J Acoust Soc Am. 2002 May;111(5 Pt 1):2219-36. doi: 10.1121/1.1471898.
7
Virtual localization improved by scaling nonindividualized external-ear transfer functions in frequency.
J Acoust Soc Am. 1999 Sep;106(3 Pt 1):1493-510. doi: 10.1121/1.427147.
8
Difference limens for phase in normal and hearing-impaired subjects.
J Acoust Soc Am. 1989 Oct;86(4):1351-65. doi: 10.1121/1.398695.
9
Derivation of auditory filter shapes from notched-noise data.
Hear Res. 1990 Aug 1;47(1-2):103-38. doi: 10.1016/0378-5955(90)90170-t.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验