• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度滤波器组均衡器的低延迟单声道语音增强。

Low-latency monaural speech enhancement with deep filter-bank equalizer.

机构信息

Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, 100190 Beijing, China.

出版信息

J Acoust Soc Am. 2022 May;151(5):3291. doi: 10.1121/10.0011396.

DOI:10.1121/10.0011396
PMID:35649938
Abstract

It is highly desirable that speech enhancement algorithms can achieve good performance while keeping low latency for many applications, such as digital hearing aids, mobile phones, acoustically transparent hearing devices, and public address systems. To improve the performance of traditional low-latency speech enhancement algorithms, a deep filter-bank equalizer (FBE) framework was proposed that integrated a deep learning-based subband noise reduction network with a deep learning-based shortened digital filter mapping network. In the first network, a deep learning model was trained with a controllable small frame shift to satisfy the low-latency demand, i.e., no greater than 4 ms, so as to obtain (complex) subband gains that could be regarded as an adaptive digital filter in each frame. In the second network, to reduce the latency, this adaptive digital filter was implicitly shortened by a deep learning-based framework and was then applied to noisy speech to reconstruct the enhanced speech without the overlap-add method. Experimental results on the WSJ0-SI84 corpus indicated that the proposed DeepFBE with only 4-ms latency achieved much better performance than traditional low-latency speech enhancement algorithms across several objective metrics. Listening test results further confirmed that our approach achieved higher speech quality than other methods.

摘要

对于许多应用程序,如数字助听器、手机、透明听力设备和公共广播系统,希望语音增强算法能够在保持低延迟的同时实现良好的性能。为了提高传统低延迟语音增强算法的性能,提出了一种深度滤波器组均衡器 (FBE) 框架,该框架集成了基于深度学习的子带降噪网络和基于深度学习的缩短数字滤波器映射网络。在第一个网络中,使用可控制的小帧移训练深度学习模型,以满足低延迟需求,即不大于 4ms,从而获得(复)子带增益,这些增益可以在每帧中被视为自适应数字滤波器。在第二个网络中,为了降低延迟,通过基于深度学习的框架隐式缩短该自适应数字滤波器,然后将其应用于噪声语音,以无需重叠相加方法重建增强语音。在 WSJ0-SI84 语料库上的实验结果表明,与几种客观指标相比,具有仅 4ms 延迟的 DeepFBE 实现了更好的性能。听力测试结果进一步证实了我们的方法比其他方法具有更高的语音质量。

相似文献

1
Low-latency monaural speech enhancement with deep filter-bank equalizer.基于深度滤波器组均衡器的低延迟单声道语音增强。
J Acoust Soc Am. 2022 May;151(5):3291. doi: 10.1121/10.0011396.
2
Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction.基于非负稀疏编码的残余噪声降低驱动的可穿戴听力设备频谱增强
Sensors (Basel). 2020 Oct 10;20(20):5751. doi: 10.3390/s20205751.
3
Sixty Years of Frequency-Domain Monaural Speech Enhancement: From Traditional to Deep Learning Methods.六十年频域单声道语音增强:从传统方法到深度学习方法。
Trends Hear. 2023 Jan-Dec;27:23312165231209913. doi: 10.1177/23312165231209913.
4
CompNet: Complementary network for single-channel speech enhancement.CompNet:用于单通道语音增强的互补网络。
Neural Netw. 2023 Nov;168:508-517. doi: 10.1016/j.neunet.2023.09.041. Epub 2023 Sep 25.
5
Deep Learning Based Real-time Speech Enhancement for Dual-microphone Mobile Phones.基于深度学习的双麦克风手机实时语音增强
IEEE/ACM Trans Audio Speech Lang Process. 2021;29:1853-1863. doi: 10.1109/taslp.2021.3082318. Epub 2021 May 21.
6
A method for enhancing speech and warning signals based on parallel convolutional neural networks in a noisy environment.基于噪声环境下并行卷积神经网络的语音增强和预警信号方法。
Technol Health Care. 2021;29(S1):141-152. doi: 10.3233/THC-218015.
7
Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners.针对正常听力和听力受损听众的稀疏编码收缩降噪算法的语音质量评估
Hear Res. 2015 Sep;327:175-85. doi: 10.1016/j.heares.2015.07.019. Epub 2015 Jul 29.
8
Evaluation of model-based versus non-parametric monaural noise-reduction approaches for hearing aids.基于模型的与非参数单耳降噪方法在助听器中的评估。
Int J Audiol. 2012 Aug;51(8):627-39. doi: 10.3109/14992027.2012.684405. Epub 2012 May 30.
9
A Supervised Speech Enhancement Method for Smartphone-Based Binaural Hearing Aids.基于智能手机的双耳助听的监督式语音增强方法。
IEEE Trans Biomed Circuits Syst. 2020 Oct;14(5):951-960. doi: 10.1109/TBCAS.2020.2988121. Epub 2020 Apr 20.
10
Improving the performance of hearing aids in noisy environments based on deep learning technology.基于深度学习技术提高助听器在嘈杂环境中的性能。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:404-408. doi: 10.1109/EMBC.2018.8512277.

引用本文的文献

1
Neural-WDRC: A Deep Learning Wide Dynamic Range Compression Method Combined With Controllable Noise Reduction for Hearing Aids.神经宽动态范围压缩:一种结合可控降噪的深度学习宽动态范围压缩方法用于助听器。
Trends Hear. 2025 Jan-Dec;29:23312165241309301. doi: 10.1177/23312165241309301.
2
Sixty Years of Frequency-Domain Monaural Speech Enhancement: From Traditional to Deep Learning Methods.六十年频域单声道语音增强:从传统方法到深度学习方法。
Trends Hear. 2023 Jan-Dec;27:23312165231209913. doi: 10.1177/23312165231209913.