• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于稀疏贝叶斯学习波束形成的声源定位和语音增强。

Sound source localization and speech enhancement with sparse Bayesian learning beamforming.

机构信息

GN Hearing A/S, DK-2750 Ballerup, Denmark.

Audio Analysis Lab, AD:MT, Aalborg University, DK-9000 Aalborg, Denmark.

出版信息

J Acoust Soc Am. 2018 Jun;143(6):3912. doi: 10.1121/1.5042222.

DOI:10.1121/1.5042222
PMID:29960460
Abstract

Speech localization and enhancement involves sound source mapping and reconstruction from noisy recordings of speech mixtures with microphone arrays. Conventional beamforming methods suffer from low resolution, especially with a limited number of microphones. In practice, there are only a few sources compared to the possible directions-of-arrival (DOA). Hence, DOA estimation is formulated as a sparse signal reconstruction problem and solved with sparse Bayesian learning (SBL). SBL uses a hierarchical two-level Bayesian inference to reconstruct sparse estimates from a small set of observations. The first level derives the posterior probability of the complex source amplitudes from the data likelihood and the prior. The second level tunes the prior towards sparse solutions with hyperparameters which maximize the evidence, i.e., the data probability. The adaptive learning of the hyperparameters from the data auto-regularizes the inference problem towards sparse robust estimates. Simulations and experimental data demonstrate that SBL beamforming provides high-resolution DOA maps outperforming traditional methods especially for correlated or non-stationary signals. Specifically for speech signals, the high-resolution SBL reconstruction offers not only speech enhancement but effectively speech separation.

摘要

语音定位和增强涉及到声源映射和重建,从带有麦克风阵列的语音混合噪声记录中进行。传统的波束形成方法分辨率较低,尤其是在麦克风数量有限的情况下。在实际中,与可能的到达方向(DOA)相比,声源数量较少。因此,DOA 估计被表述为稀疏信号重建问题,并通过稀疏贝叶斯学习(SBL)来解决。SBL 使用两级分层贝叶斯推理,从小数据集观测中重建稀疏估计值。第一级从数据似然和先验中得出复源幅度的后验概率。第二级通过调整超参数(最大化证据,即数据概率)使先验朝着稀疏解进行调整。超参数从数据中自适应学习会使推断问题朝着稀疏稳健估计进行自动正则化。模拟和实验数据表明,SBL 波束形成提供了高分辨率的 DOA 图,比传统方法表现更好,特别是对于相关或非平稳信号。对于语音信号,高分辨率 SBL 重建不仅提供了语音增强,还实现了有效的语音分离。

相似文献

1
Sound source localization and speech enhancement with sparse Bayesian learning beamforming.基于稀疏贝叶斯学习波束形成的声源定位和语音增强。
J Acoust Soc Am. 2018 Jun;143(6):3912. doi: 10.1121/1.5042222.
2
Phase-based dual-microphone robust speech enhancement.基于相位的双麦克风鲁棒语音增强
IEEE Trans Syst Man Cybern B Cybern. 2004 Aug;34(4):1763-73. doi: 10.1109/tsmcb.2004.830345.
3
Long short-term memory for speaker generalization in supervised speech separation.用于监督语音分离中说话人泛化的长短期记忆网络
J Acoust Soc Am. 2017 Jun;141(6):4705. doi: 10.1121/1.4986931.
4
Impact of phase estimation on single-channel speech separation based on time-frequency masking.相位估计对基于时频掩蔽的单通道语音分离的影响。
J Acoust Soc Am. 2017 Jun;141(6):4668. doi: 10.1121/1.4986647.
5
Passive Sonar Target Identification Using Multiple-Measurement Sparse Bayesian Learning.基于多测量稀疏贝叶斯学习的被动声纳目标识别。
Sensors (Basel). 2022 Nov 4;22(21):8511. doi: 10.3390/s22218511.
6
Issues in forensic voice.法医语音学中的问题。
J Voice. 2014 Mar;28(2):170-84. doi: 10.1016/j.jvoice.2013.06.011. Epub 2013 Oct 28.
7
Speech intelligibility estimation using multi-resolution spectral features for speakers undergoing cancer treatment.使用多分辨率频谱特征对癌症治疗患者的语音清晰度进行估计。
J Acoust Soc Am. 2014 Oct;136(4):EL315-21. doi: 10.1121/1.4896410.
8
Block-sparse beamforming for spatially extended sources in a Bayesian formulation.贝叶斯公式下用于空间扩展源的块稀疏波束形成
J Acoust Soc Am. 2016 Sep;140(3):1828. doi: 10.1121/1.4962325.
9
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio.基于信噪比的语音分离最优时频掩蔽比。
J Acoust Soc Am. 2013 Nov;134(5):EL452-8. doi: 10.1121/1.4824632.
10
A Bayesian inference model for speech localization (L).贝叶斯推断模型在语音定位(L)中的应用。
J Acoust Soc Am. 2012 Sep;132(3):1257-60. doi: 10.1121/1.4740489.

引用本文的文献

1
Sound Event Localization and Detection Using Imbalanced Real and Synthetic Data via Multi-Generator.使用多生成器对不平衡真实和合成数据进行声音事件定位和检测。
Sensors (Basel). 2023 Mar 23;23(7):3398. doi: 10.3390/s23073398.
2
Passive Sonar Target Identification Using Multiple-Measurement Sparse Bayesian Learning.基于多测量稀疏贝叶斯学习的被动声纳目标识别。
Sensors (Basel). 2022 Nov 4;22(21):8511. doi: 10.3390/s22218511.
3
Frequency Analysis of Acoustic Data Using Multiple-Measurement Sparse Bayesian Learning.基于多测量稀疏贝叶斯学习的声学数据频率分析
Sensors (Basel). 2021 Aug 30;21(17):5827. doi: 10.3390/s21175827.
4
Parametric Estimations Based on Homomorphic Deconvolution for Time of Flight in Sound Source Localization System.基于同态反卷积的声源定位系统飞行时间参数估计
Sensors (Basel). 2020 Feb 10;20(3):925. doi: 10.3390/s20030925.