• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用两个正交一阶差分麦克风阵列的深度学习辅助声源定位

Deep learning assisted sound source localization using two orthogonal first-order differential microphone arrays.

作者信息

Liu Nian, Chen Huawei, Songgong Kunkun, Li Yanwen

机构信息

College of Electronic and Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China.

出版信息

J Acoust Soc Am. 2021 Feb;149(2):1069. doi: 10.1121/10.0003445.

DOI:10.1121/10.0003445
PMID:33639792
Abstract

Sound source localization in noisy and reverberant rooms using microphone arrays remains a challenging task, especially for small-sized arrays. Recent years have seen promising advances on deep learning assisted approaches by reformulating the sound localization problem as a classification one. A key to the deep learning-based approaches lies in extracting sound location features effectively in noisy and reverberant conditions. The popularly adopted features are based on the well-established generalized cross correlation phase transform (GCC-PHAT), which is known to be helpful in combating room reverberation. However, the GCC-PHAT features may not be applicable to small-sized arrays. This paper proposes a deep learning assisted sound localization method using a small-sized microphone array constructed by two orthogonal first-order differential microphone arrays. An improved feature extraction scheme based on sound intensity estimation is also proposed by decoupling the correlation between sound pressure and particle velocity components in the whitening weighting construction to enhance the robustness of the time-frequency bin-wise sound intensity features. Simulation and real-world experimental results show that the proposed deep learning assisted approach can achieve higher spatial resolution and is superior to its state-of-the-art counterparts using the GCC-PHAT or sound intensity features for small-sized arrays in noisy and reverberant environments.

摘要

在嘈杂且有混响的房间中使用麦克风阵列进行声源定位仍然是一项具有挑战性的任务,尤其是对于小型阵列而言。近年来,通过将声音定位问题重新表述为分类问题,深度学习辅助方法取得了令人瞩目的进展。基于深度学习的方法的关键在于在嘈杂和混响条件下有效地提取声音位置特征。普遍采用的特征基于成熟的广义互相关相位变换(GCC-PHAT),众所周知,它有助于对抗房间混响。然而,GCC-PHAT特征可能不适用于小型阵列。本文提出了一种使用由两个正交一阶差分麦克风阵列构成的小型麦克风阵列的深度学习辅助声音定位方法。还通过在白化加权构造中解耦声压和质点速度分量之间的相关性,提出了一种基于声强估计的改进特征提取方案,以增强时频逐仓声强特征的鲁棒性。仿真和实际实验结果表明,所提出的深度学习辅助方法可以实现更高的空间分辨率,并且在嘈杂和混响环境中,对于小型阵列而言,优于使用GCC-PHAT或声强特征的同类先进方法。

相似文献

1
Deep learning assisted sound source localization using two orthogonal first-order differential microphone arrays.使用两个正交一阶差分麦克风阵列的深度学习辅助声源定位
J Acoust Soc Am. 2021 Feb;149(2):1069. doi: 10.1121/10.0003445.
2
3D Multiple Sound Source Localization by Proposed T-Shaped Circular Distributed Microphone Arrays in Combination with GEVD and Adaptive GCC-PHAT/ML Algorithms.基于 T 型圆形分布式麦克风阵列与 GEVD 以及自适应 GCC-PHAT/ML 算法的三维多声源定位。
Sensors (Basel). 2022 Jan 28;22(3):1011. doi: 10.3390/s22031011.
3
On the use of modified phase transform weighting functions for acoustic imaging with the generalized cross correlation.关于使用改进的相位变换加权函数进行广义互相关声学成像
J Acoust Soc Am. 2019 Mar;145(3):1546. doi: 10.1121/1.5094419.
4
BeamLearning: An end-to-end deep learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data.波束学习:一种使用原始多通道声压数据进行声源角度定位的端到端深度学习方法。
J Acoust Soc Am. 2021 Jun;149(6):4248. doi: 10.1121/10.0005046.
5
Objective performance analysis of spherical microphone arrays for speech enhancement in rooms.房间内语音增强用球形传声器阵列的客观性能分析。
J Acoust Soc Am. 2012 Sep;132(3):1473-81. doi: 10.1121/1.4742698.
6
Sound source localization using multiple circular microphone arrays based on harmonic analysis.基于谐波分析的多圆形麦克风阵列声源定位
J Acoust Soc Am. 2021 May;149(5):3517. doi: 10.1121/10.0003496.
7
Multisensory Fusion for Unsupervised Spatiotemporal Speaker Diarization.用于无监督时空说话人分离的多感官融合
Sensors (Basel). 2024 Jun 29;24(13):4229. doi: 10.3390/s24134229.
8
Optimization Algorithm for Delay Estimation Based on Singular Value Decomposition and Improved - Weighting.基于奇异值分解和改进加权的时延估计优化算法。
Sensors (Basel). 2022 Sep 24;22(19):7254. doi: 10.3390/s22197254.
9
Enhanced sound localization.增强的声音定位。
IEEE Trans Syst Man Cybern B Cybern. 2004 Jun;34(3):1526-40. doi: 10.1109/tsmcb.2004.826398.
10
Sound source direction-of-arrival estimation method for microphone array based on ultra-weak fiber Bragg grating distributed acoustic sensor.基于超弱光纤布拉格光栅分布式声传感器的麦克风阵列声源到达方向估计方法
Opt Express. 2023 Sep 11;31(19):31342-31353. doi: 10.1364/OE.498027.