• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用神经图估计先验信噪比以进行语音增强。

Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement.

作者信息

Jassim Wissam A, Harte Naomi

机构信息

Sigmedia Group, ADAPT Centre, School of Engineering, Trinity College Dublin, Ireland.

出版信息

J Acoust Soc Am. 2020 Jun;147(6):3830. doi: 10.1121/10.0001324.

DOI:10.1121/10.0001324
PMID:32611151
Abstract

In statistical-based speech enhancement algorithms, the a priori signal-to-noise ratio (SNR) must be estimated to calculate the required spectral gain function. This paper proposes a method to improve this estimation using features derived from the neural responses of the auditory-nerve (AN) system. The neural responses, interpreted as a neurogram (NG), are simulated for noisy speech using a computational model of the AN system with a range of characteristic frequencies (CFs). Two machine learning algorithms were explored to train the estimation model based on NG features: support vector regression and a convolutional neural network. The proposed estimator was placed in a common speech enhancement system, and three conventional spectral gain functions were employed to estimate the enhanced signal. The proposed method was tested using the NOIZEUS database at different SNR levels, and various speech quality and intelligibility measures were employed for performance evaluation. The a priori SNR estimated from NG features achieved better quality and intelligibility scores than that of recent estimators, especially for highly distorted speech and low SNR values.

摘要

在基于统计的语音增强算法中,必须估计先验信噪比(SNR)以计算所需的频谱增益函数。本文提出了一种利用从听觉神经(AN)系统的神经反应中提取的特征来改进这种估计的方法。使用具有一系列特征频率(CFs)的AN系统计算模型,对噪声语音的神经反应进行模拟,将其解释为神经图(NG)。探索了两种机器学习算法,基于NG特征训练估计模型:支持向量回归和卷积神经网络。将所提出的估计器置于通用语音增强系统中,并采用三种传统频谱增益函数来估计增强信号。使用NOIZEUS数据库在不同SNR水平下对所提出的方法进行测试,并采用各种语音质量和可懂度度量进行性能评估。从NG特征估计的先验SNR比最近的估计器获得了更好的质量和可懂度分数,特别是对于高度失真的语音和低SNR值。

相似文献

1
Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement.使用神经图估计先验信噪比以进行语音增强。
J Acoust Soc Am. 2020 Jun;147(6):3830. doi: 10.1121/10.0001324.
2
On training targets for deep learning approaches to clean speech magnitude spectrum estimation.深度学习方法在语音谱估计中对训练目标的研究。
J Acoust Soc Am. 2021 May;149(5):3273. doi: 10.1121/10.0004823.
3
Evaluation of the sparse coding shrinkage noise reduction algorithm in normal hearing and hearing impaired listeners.正常听力和听力受损听众中稀疏编码收缩降噪算法的评估
Hear Res. 2014 Apr;310:36-47. doi: 10.1016/j.heares.2014.01.006. Epub 2014 Feb 2.
4
Reference-Free Assessment of Speech Intelligibility Using Bispectrum of an Auditory Neurogram.基于听觉神经图双谱估计的免参考语音可懂度评估
PLoS One. 2016 Mar 11;11(3):e0150415. doi: 10.1371/journal.pone.0150415. eCollection 2016.
5
A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery.一种利用听觉外周模型响应的稳健说话人识别系统。
PLoS One. 2016 Jul 8;11(7):e0158520. doi: 10.1371/journal.pone.0158520. eCollection 2016.
6
Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids.最小方差无失真响应(MVDR)波束形成器对基于语音增强的助听器智能手机应用的影响。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:417-420. doi: 10.1109/EMBC.2018.8512369.
7
Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction.基于非负稀疏编码的残余噪声降低驱动的可穿戴听力设备频谱增强
Sensors (Basel). 2020 Oct 10;20(20):5751. doi: 10.3390/s20205751.
8
Speech onset enhancement improves intelligibility in adverse listening conditions for cochlear implant users.语音起始增强可改善人工耳蜗使用者在不利聆听条件下的言语可懂度。
Hear Res. 2016 Dec;342:13-22. doi: 10.1016/j.heares.2016.09.002. Epub 2016 Sep 30.
9
Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners.受听觉启发的机器学习技术可以提高听力受损听众的语音清晰度和质量。
J Acoust Soc Am. 2017 Mar;141(3):1985. doi: 10.1121/1.4977197.
10
Towards real-world objective speech quality and intelligibility assessment using speech-enhancement residuals and convolutional long short-term memory networks.利用语音增强残差和卷积长短期记忆网络进行真实客观的语音质量和可懂度评估。
J Acoust Soc Am. 2020 Nov;148(5):3348. doi: 10.1121/10.0002702.

引用本文的文献

1
Texture synthesis of ecological plant protection image based on convolution neural network.基于卷积神经网络的生态植物保护图像纹理合成
Front Plant Sci. 2022 Oct 18;13:1035077. doi: 10.3389/fpls.2022.1035077. eCollection 2022.