• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

噪声语音的时频掩蔽降噪的感知效果。

Perceptual effects of noise reduction by time-frequency masking of noisy speech.

机构信息

Academic Medical Center, Clinical and Experimental Audiology, Meibergdreef 9, 1105 AZ, Amsterdam, The Netherlands.

出版信息

J Acoust Soc Am. 2012 Oct;132(4):2690-9. doi: 10.1121/1.4747006.

DOI:10.1121/1.4747006
PMID:23039461
Abstract

Time-frequency masking is a method for noise reduction that is based on the time-frequency representation of a speech in noise signal. Depending on the estimated signal-to-noise ratio (SNR), each time-frequency unit is either attenuated or not. A special type of a time-frequency mask is the ideal binary mask (IBM), which has access to the real SNR (ideal). The IBM either retains or removes each time-frequency unit (binary mask). The IBM provides large improvements in speech intelligibility and is a valuable tool for investigating how different factors influence intelligibility. This study extends the standard outcome measure (speech intelligibility) with additional perceptual measures relevant for noise reduction: listening effort, noise annoyance, speech naturalness, and overall preference. Four types of time-frequency masking were evaluated: the original IBM, a tempered version of the IBM (called ITM) which applies limited and non-binary attenuation, and non-ideal masking (also tempered) with two different types of noise-estimation algorithms. The results from ideal masking imply that there is a trade-off between intelligibility and sound quality, which depends on the attenuation strength. Additionally, the results for non-ideal masking suggest that subjective measures can show effects of noise reduction even if noise reduction does not lead to differences in intelligibility.

摘要

时频掩蔽是一种基于噪声信号中语音的时频表示的降噪方法。根据估计的信噪比(SNR),每个时频单元要么被衰减,要么不被衰减。一种特殊类型的时频掩蔽是理想二进制掩蔽(IBM),它可以访问真实的 SNR(理想)。IBM 要么保留要么去除每个时频单元(二进制掩蔽)。IBM 可以显著提高语音可懂度,是研究不同因素如何影响可懂度的有价值的工具。本研究通过附加与降噪相关的额外感知测量来扩展标准结果测量(语音可懂度):听力努力、噪声烦恼、语音自然度和整体偏好。评估了四种类型的时频掩蔽:原始 IBM、IBM 的温和版本(称为 ITM),它应用有限和非二进制衰减,以及具有两种不同噪声估计算法的非理想掩蔽(也温和)。理想掩蔽的结果表明,可懂度和音质之间存在权衡,这取决于衰减强度。此外,非理想掩蔽的结果表明,即使降噪不会导致可懂度的差异,主观测量也可以显示降噪的效果。

相似文献

1
Perceptual effects of noise reduction by time-frequency masking of noisy speech.噪声语音的时频掩蔽降噪的感知效果。
J Acoust Soc Am. 2012 Oct;132(4):2690-9. doi: 10.1121/1.4747006.
2
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.
3
Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.混响环境下理想二值掩蔽对言语可懂度的影响:早期反射声和信噪比阈的作用。
J Acoust Soc Am. 2013 Mar;133(3):1707-17. doi: 10.1121/1.4789895.
4
The Influence of Noise Reduction on Speech Intelligibility, Response Times to Speech, and Perceived Listening Effort in Normal-Hearing Listeners.噪声降低对正常听力者言语可懂度、言语反应时间和感知聆听努力的影响。
Trends Hear. 2017 Jan-Dec;21:2331216517716844. doi: 10.1177/2331216517716844.
5
Intelligibility of reverberant noisy speech with ideal binary masking.用理想二值掩蔽评估混响噪声语音的可懂度。
J Acoust Soc Am. 2011 Oct;130(4):2153-61. doi: 10.1121/1.3631668.
6
Improving word recognition in noise among hearing-impaired subjects with a single-channel cochlear noise-reduction algorithm.提高单通道耳蜗降噪算法助听受试者噪声中单词识别能力。
J Acoust Soc Am. 2012 Sep;132(3):1718-31. doi: 10.1121/1.4739441.
7
Speech intelligibility in background noise with ideal binary time-frequency masking.基于理想二元时频掩蔽的背景噪声下语音清晰度
J Acoust Soc Am. 2009 Apr;125(4):2336-47. doi: 10.1121/1.3083233.
8
An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech.基于时频加权噪声语音可懂度预测的客观测量评估。
J Acoust Soc Am. 2011 Nov;130(5):3013-27. doi: 10.1121/1.3641373.
9
Speech perception of noise with binary gains.具有二元增益的噪声语音感知
J Acoust Soc Am. 2008 Oct;124(4):2303-7. doi: 10.1121/1.2967865.
10
Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.添加背景噪声可提高理想二值掩蔽噪声语音的可懂度。
J Acoust Soc Am. 2011 Apr;129(4):2227-36. doi: 10.1121/1.3559707.

引用本文的文献

1
Neural-WDRC: A Deep Learning Wide Dynamic Range Compression Method Combined With Controllable Noise Reduction for Hearing Aids.神经宽动态范围压缩:一种结合可控降噪的深度学习宽动态范围压缩方法用于助听器。
Trends Hear. 2025 Jan-Dec;29:23312165241309301. doi: 10.1177/23312165241309301.
2
An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona).一种理想的压缩式面罩,用于在不牺牲环境声音识别能力的情况下提高语音清晰度a)。
J Acoust Soc Am. 2024 Dec 1;156(6):3958-3969. doi: 10.1121/10.0034599.
3
Listening to Music Through Hearing Aids: Potential Lessons for Cochlear Implants.
通过助听器听音乐:对人工耳蜗植入的潜在启示。
Trends Hear. 2022 Jan-Dec;26:23312165211072969. doi: 10.1177/23312165211072969.
4
An ideal quantized mask to increase intelligibility and quality of speech in noise.一种理想的量化掩蔽,可提高噪声中的语音可懂度和质量。
J Acoust Soc Am. 2018 Sep;144(3):1392. doi: 10.1121/1.5053115.
5
Efficacy of a Hearing Aid Noise Reduction Function.助听器降噪功能的效果。
Trends Hear. 2018 Jan-Dec;22:2331216518782839. doi: 10.1177/2331216518782839.
6
The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility.将深度神经网络架构与理想比率掩蔽估计相结合,以提高语音可懂度,从而在计算语音分离中获益。
PLoS One. 2018 May 15;13(5):e0196924. doi: 10.1371/journal.pone.0196924. eCollection 2018.
7
The Benefits of Bimodal Aiding on Extended Dimensions of Speech Perception: Intelligibility, Listening Effort, and Sound Quality.双模式辅助对言语感知扩展维度的益处:可懂度、聆听努力程度和声音质量。
Trends Hear. 2017 Jan-Dec;21:2331216517727900. doi: 10.1177/2331216517727900.
8
The Influence of Noise Reduction on Speech Intelligibility, Response Times to Speech, and Perceived Listening Effort in Normal-Hearing Listeners.噪声降低对正常听力者言语可懂度、言语反应时间和感知聆听努力的影响。
Trends Hear. 2017 Jan-Dec;21:2331216517716844. doi: 10.1177/2331216517716844.
9
Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users.基于神经网络的语音增强技术可提高人工耳蜗使用者在噪声环境中的语音清晰度。
Hear Res. 2017 Feb;344:183-194. doi: 10.1016/j.heares.2016.11.012. Epub 2016 Nov 30.
10
Relationship Among Signal Fidelity, Hearing Loss, and Working Memory for Digital Noise Suppression.数字噪声抑制中信号保真度、听力损失与工作记忆之间的关系。
Ear Hear. 2015 Sep-Oct;36(5):505-16. doi: 10.1097/AUD.0000000000000173.