• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于理想二元时频掩蔽的背景噪声下语音清晰度

Speech intelligibility in background noise with ideal binary time-frequency masking.

作者信息

Wang DeLiang, Kjems Ulrik, Pedersen Michael S, Boldt Jesper B, Lunner Thomas

机构信息

Department of Computer Science & Engineering and Center for Cognitive Science, The Ohio State University, Columbus, Ohio 43210, USA.

出版信息

J Acoust Soc Am. 2009 Apr;125(4):2336-47. doi: 10.1121/1.3083233.

DOI:10.1121/1.3083233
PMID:19354408
Abstract

Ideal binary time-frequency masking is a signal separation technique that retains mixture energy in time-frequency units where local signal-to-noise ratio exceeds a certain threshold and rejects mixture energy in other time-frequency units. Two experiments were designed to assess the effects of ideal binary masking on speech intelligibility of both normal-hearing (NH) and hearing-impaired (HI) listeners in different kinds of background interference. The results from Experiment 1 demonstrate that ideal binary masking leads to substantial reductions in speech-reception threshold for both NH and HI listeners, and the reduction is greater in a cafeteria background than in a speech-shaped noise. Furthermore, listeners with hearing loss benefit more than listeners with normal hearing, particularly for cafeteria noise, and ideal masking nearly equalizes the speech intelligibility performances of NH and HI listeners in noisy backgrounds. The results from Experiment 2 suggest that ideal binary masking in the low-frequency range yields larger intelligibility improvements than in the high-frequency range, especially for listeners with hearing loss. The findings from the two experiments have major implications for understanding speech perception in noise, computational auditory scene analysis, speech enhancement, and hearing aid design.

摘要

理想二元时频掩蔽是一种信号分离技术,它在局部信噪比超过特定阈值的时频单元中保留混合能量,并在其他时频单元中拒绝混合能量。设计了两个实验来评估理想二元掩蔽对不同背景干扰下正常听力(NH)和听力受损(HI)听众语音可懂度的影响。实验1的结果表明,理想二元掩蔽可使NH和HI听众的言语接受阈大幅降低,且在自助餐厅背景下的降低幅度大于在言语噪声背景下。此外,听力损失听众比正常听力听众受益更多,尤其是在自助餐厅噪声环境下,理想掩蔽几乎使NH和HI听众在噪声背景下的语音可懂度表现趋于平等。实验2的结果表明,低频范围内的理想二元掩蔽比高频范围内能带来更大的可懂度提升,尤其是对于听力损失听众。这两个实验的结果对于理解噪声中的语音感知、计算听觉场景分析、语音增强和助听器设计具有重要意义。

相似文献

1
Speech intelligibility in background noise with ideal binary time-frequency masking.基于理想二元时频掩蔽的背景噪声下语音清晰度
J Acoust Soc Am. 2009 Apr;125(4):2336-47. doi: 10.1121/1.3083233.
2
Intelligibility of reverberant noisy speech with ideal binary masking.用理想二值掩蔽评估混响噪声语音的可懂度。
J Acoust Soc Am. 2011 Oct;130(4):2153-61. doi: 10.1121/1.3631668.
3
Speech perception of noise with binary gains.具有二元增益的噪声语音感知
J Acoust Soc Am. 2008 Oct;124(4):2303-7. doi: 10.1121/1.2967865.
4
Role of mask pattern in intelligibility of ideal binary-masked noisy speech.掩码模式在理想二元掩码噪声语音可懂度中的作用。
J Acoust Soc Am. 2009 Sep;126(3):1415-26. doi: 10.1121/1.3179673.
5
Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.添加背景噪声可提高理想二值掩蔽噪声语音的可懂度。
J Acoust Soc Am. 2011 Apr;129(4):2227-36. doi: 10.1121/1.3559707.
6
Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.混响环境下理想二值掩蔽对言语可懂度的影响:早期反射声和信噪比阈的作用。
J Acoust Soc Am. 2013 Mar;133(3):1707-17. doi: 10.1121/1.4789895.
7
Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners.正常听力和听力受损听众在波动掩蔽声中语音的听觉及视听清晰度
J Acoust Soc Am. 2009 May;125(5):3358-72. doi: 10.1121/1.3110132.
8
Relations between frequency selectivity, temporal fine-structure processing, and speech reception in impaired hearing.听力受损时频率选择性、时间精细结构处理与言语接收之间的关系。
J Acoust Soc Am. 2009 May;125(5):3328-45. doi: 10.1121/1.3097469.
9
Intelligibility of speech in noise at high presentation levels: effects of hearing loss and frequency region.高呈现水平下噪声中言语的可懂度:听力损失和频率区域的影响
J Acoust Soc Am. 2007 Aug;122(2):1130-7. doi: 10.1121/1.2751251.
10
Perceptual effects of noise reduction by time-frequency masking of noisy speech.噪声语音的时频掩蔽降噪的感知效果。
J Acoust Soc Am. 2012 Oct;132(4):2690-9. doi: 10.1121/1.4747006.

引用本文的文献

1
An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona).一种理想的压缩式面罩,用于在不牺牲环境声音识别能力的情况下提高语音清晰度a)。
J Acoust Soc Am. 2024 Dec 1;156(6):3958-3969. doi: 10.1121/10.0034599.
2
Parameter tuning of time-frequency masking algorithms for reverberant artifact removal within the cochlear implant stimulus.参数调整的时频掩蔽算法的混响伪影去除在耳蜗植入刺激。
Cochlear Implants Int. 2022 Nov;23(6):309-316. doi: 10.1080/14670100.2022.2096182. Epub 2022 Jul 23.
3
Measuring the Influence of Noise Reduction on Listening Effort in Hearing-Impaired Listeners Using Response Times to an Arithmetic Task in Noise.
使用噪声中算术任务的反应时间测量听力受损者在降噪环境下的聆听努力程度。
Trends Hear. 2021 Jan-Dec;25:23312165211014437. doi: 10.1177/23312165211014437.
4
Supervised Speech Separation Based on Deep Learning: An Overview.基于深度学习的监督语音分离:综述
IEEE/ACM Trans Audio Speech Lang Process. 2018 Oct;26(10):1702-1726. doi: 10.1109/TASLP.2018.2842159. Epub 2018 May 30.
5
A Competing Voices Test for Hearing-Impaired Listeners Applied to Spatial Separation and Ideal Time-Frequency Masks.用于空间分离和理想时频掩蔽的听力障碍者竞争声音测试
Trends Hear. 2019 Jan-Dec;23:2331216519848288. doi: 10.1177/2331216519848288.
6
A Tutorial on Auditory Attention Identification Methods.听觉注意力识别方法教程。
Front Neurosci. 2019 Mar 19;13:153. doi: 10.3389/fnins.2019.00153. eCollection 2019.
7
Autoscore: An open-source automated tool for scoring listener perception of speech.Autoscore:一个用于对语音的听众感知进行评分的开源自动化工具。
J Acoust Soc Am. 2019 Jan;145(1):392. doi: 10.1121/1.5087276.
8
An ideal quantized mask to increase intelligibility and quality of speech in noise.一种理想的量化掩蔽,可提高噪声中的语音可懂度和质量。
J Acoust Soc Am. 2018 Sep;144(3):1392. doi: 10.1121/1.5053115.
9
A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions.基于深度学习的分割算法,可提高在混响噪声环境下听力障碍者的语音可懂度。
J Acoust Soc Am. 2018 Sep;144(3):1627. doi: 10.1121/1.5055562.
10
Developmental Effects in Masking Release for Speech-in-Speech Perception Due to a Target/Masker Sex Mismatch.由于目标/掩蔽者性别不匹配导致语音感知掩蔽释放中的发育效应。
Ear Hear. 2018 Sep/Oct;39(5):935-945. doi: 10.1097/AUD.0000000000000554.