• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

混响环境下理想二值掩蔽对言语可懂度的影响:早期反射声和信噪比阈的作用。

Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.

机构信息

Department of Computer Science and Engineering, The Ohio State University at Lima, Lima, Ohio 45804, USA.

出版信息

J Acoust Soc Am. 2013 Mar;133(3):1707-17. doi: 10.1121/1.4789895.

DOI:10.1121/1.4789895
PMID:23464040
Abstract

Ideal binary masking is a signal processing technique that separates a desired signal from a mixture by retaining only the time-frequency units where the signal-to-noise ratio (SNR) exceeds a predetermined threshold. In reverberant conditions there are multiple possible definitions of the ideal binary mask in that one may choose to treat the target early reflections as either desired signal or noise. The ideal binary mask may therefore be parameterized by the reflection boundary, a predetermined division point between early and late reflections. Another important parameter is the local SNR threshold used in labeling the time-frequency units as either target or background. Two experiments were designed to assess the impact of these two parameters on speech intelligibility with ideal binary masking for normal-hearing listeners in reverberant conditions. Experiment 1 shows that in order to achieve intelligibility improvements only the early reflections should be preserved by the binary mask. Moreover, it shows that the effective SNR should be accounted for when deciding the local threshold optimal range. Experiment 2 shows that with long reverberation times, intelligibility improvements are only obtained when the reflection boundary is 100 ms or less. Also, the experiment suggests that binary masking can be used for dereverberation.

摘要

理想二值掩蔽是一种信号处理技术,通过仅保留信噪比(SNR)超过预定阈值的时频单元,从混合信号中分离出期望信号。在混响条件下,存在多种理想二值掩蔽的可能定义,因为可以选择将目标早期反射视为期望信号或噪声。因此,理想二值掩蔽可以通过反射边界进行参数化,反射边界是早期和晚期反射之间的预定划分点。另一个重要参数是用于将时频单元标记为目标或背景的局部 SNR 阈值。设计了两个实验来评估这两个参数对正常听力受试者在混响条件下使用理想二值掩蔽的言语可懂度的影响。实验 1 表明,为了实现可懂度的提高,二进制掩蔽只应保留早期反射。此外,实验表明,在决定局部阈值最佳范围时,应考虑有效 SNR。实验 2 表明,在较长的混响时间下,只有当反射边界为 100ms 或更短时,才会获得可懂度的提高。此外,该实验表明,二值掩蔽可用于去混响。

相似文献

1
Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.混响环境下理想二值掩蔽对言语可懂度的影响:早期反射声和信噪比阈的作用。
J Acoust Soc Am. 2013 Mar;133(3):1707-17. doi: 10.1121/1.4789895.
2
Intelligibility of reverberant noisy speech with ideal binary masking.用理想二值掩蔽评估混响噪声语音的可懂度。
J Acoust Soc Am. 2011 Oct;130(4):2153-61. doi: 10.1121/1.3631668.
3
Speech intelligibility in background noise with ideal binary time-frequency masking.基于理想二元时频掩蔽的背景噪声下语音清晰度
J Acoust Soc Am. 2009 Apr;125(4):2336-47. doi: 10.1121/1.3083233.
4
Perceptual effects of noise reduction by time-frequency masking of noisy speech.噪声语音的时频掩蔽降噪的感知效果。
J Acoust Soc Am. 2012 Oct;132(4):2690-9. doi: 10.1121/1.4747006.
5
Effect of the division between early and late reflections on intelligibility of ideal binary-masked speech.早期反射与晚期反射之间的划分对理想二元掩蔽语音可懂度的影响。
J Acoust Soc Am. 2015 May;137(5):2801-10. doi: 10.1121/1.4919287.
6
Role of mask pattern in intelligibility of ideal binary-masked noisy speech.掩码模式在理想二元掩码噪声语音可懂度中的作用。
J Acoust Soc Am. 2009 Sep;126(3):1415-26. doi: 10.1121/1.3179673.
7
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.
8
Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.添加背景噪声可提高理想二值掩蔽噪声语音的可懂度。
J Acoust Soc Am. 2011 Apr;129(4):2227-36. doi: 10.1121/1.3559707.
9
Effects of room acoustics on the intelligibility of speech in classrooms for young children.室内声学对幼儿教室中语音清晰度的影响。
J Acoust Soc Am. 2009 Feb;125(2):922-33. doi: 10.1121/1.3058900.
10
The combined effects of reverberation and nonstationary noise on sentence intelligibility.混响和非平稳噪声对句子可懂度的综合影响。
J Acoust Soc Am. 2008 Aug;124(2):1269-77. doi: 10.1121/1.2945153.

引用本文的文献

1
Objective intelligibility measurement of reverberant vocoded speech for normal-hearing listeners: Towards facilitating the development of speech enhancement algorithms for cochlear implants.为正常听力听众测量混响语音编码语音的客观可懂度:促进人工耳蜗语音增强算法的发展。
J Acoust Soc Am. 2024 Mar 1;155(3):2151-2168. doi: 10.1121/10.0025285.
2
Parameter tuning of time-frequency masking algorithms for reverberant artifact removal within the cochlear implant stimulus.参数调整的时频掩蔽算法的混响伪影去除在耳蜗植入刺激。
Cochlear Implants Int. 2022 Nov;23(6):309-316. doi: 10.1080/14670100.2022.2096182. Epub 2022 Jul 23.
3
The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study.
从语音中去除噪声的最佳阈值在正常和听力障碍人群中相似——时频掩蔽研究。
J Acoust Soc Am. 2019 Jun;145(6):EL581. doi: 10.1121/1.5112828.
4
A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation.一种深度学习算法,用于在存在竞争说话者和混响的情况下提高听力障碍者的可理解度。
J Acoust Soc Am. 2019 Mar;145(3):1378. doi: 10.1121/1.5093547.
5
A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions.基于深度学习的分割算法,可提高在混响噪声环境下听力障碍者的语音可懂度。
J Acoust Soc Am. 2018 Sep;144(3):1627. doi: 10.1121/1.5055562.
6
Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising.复域中的时频掩蔽用于语音去混响和降噪
IEEE/ACM Trans Audio Speech Lang Process. 2017 Jul;25(7):1492-1501. doi: 10.1109/TASLP.2017.2696307. Epub 2017 Apr 20.
7
Comparison of a target-equalization-cancellation approach and a localization approach to source separation.目标均衡抵消法与源分离定位法的比较。
J Acoust Soc Am. 2017 Nov;142(5):2933. doi: 10.1121/1.5009763.
8
Effects of early and late reflections on intelligibility of reverberated speech by cochlear implant listeners.早期和晚期反射对人工耳蜗聆听者混响语音可懂度的影响。
J Acoust Soc Am. 2014 Jan;135(1):EL22-8. doi: 10.1121/1.4834455.