• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用时频掩蔽后噪声中的语音识别:频率和阈值参数的依赖性。

Recognition of speech in noise after application of time-frequency masks: dependence on frequency and threshold parameters.

机构信息

Department of Psychology, Utah State University, 2810 Old Main Hill, Logan, Utah 84322-2810, USA.

出版信息

J Acoust Soc Am. 2013 Apr;133(4):2390-6. doi: 10.1121/1.4792143.

DOI:10.1121/1.4792143
PMID:23556604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3631261/
Abstract

Binary time-frequency (TF) masks can be applied to separate speech from noise. Previous studies have shown that with appropriate parameters, ideal TF masks can extract highly intelligible speech even at very low speech-to-noise ratios (SNRs). Two psychophysical experiments provided additional information about the dependence of intelligibility on the frequency resolution and threshold criteria that define the ideal TF mask. Listeners identified AzBio Sentences in noise, before and after application of TF masks. Masks generated with 8 or 16 frequency bands per octave supported nearly-perfect identification. Word recognition accuracy was slightly lower and more variable with 4 bands per octave. When TF masks were generated with a local threshold criterion of 0 dB SNR, the mean speech reception threshold was -9.5 dB SNR, compared to -5.7 dB for unprocessed sentences in noise. Speech reception thresholds decreased by about 1 dB per dB of additional decrease in the local threshold criterion. Information reported here about the dependence of speech intelligibility on frequency and level parameters has relevance for the development of non-ideal TF masks for clinical applications such as speech processing for hearing aids.

摘要

二进制时频 (TF) 掩码可用于分离语音和噪声。先前的研究表明,使用适当的参数,理想的 TF 掩码即使在非常低的语音噪声比 (SNR) 下也可以提取出高度可理解的语音。两项心理物理实验提供了有关可懂度对频率分辨率和定义理想 TF 掩码的阈值标准的依赖性的更多信息。在应用 TF 掩码之前和之后,听众在噪声中识别了 AzBio 句子。每八度 8 或 16 个频带生成的掩码支持近乎完美的识别。每八度 4 个频带的识别准确率略低,且变化更大。当使用 0 dB SNR 的局部阈值标准生成 TF 掩码时,平均言语接受阈值为-9.5 dB SNR,而未经处理的噪声中的句子为-5.7 dB SNR。随着局部阈值标准额外降低 1 dB,言语接受阈值降低约 1 dB。这里报告的关于语音可懂度对频率和水平参数的依赖性的信息对于为助听器等临床应用开发非理想 TF 掩码具有重要意义。

相似文献

1
Recognition of speech in noise after application of time-frequency masks: dependence on frequency and threshold parameters.应用时频掩蔽后噪声中的语音识别:频率和阈值参数的依赖性。
J Acoust Soc Am. 2013 Apr;133(4):2390-6. doi: 10.1121/1.4792143.
2
Perceptual learning for speech in noise after application of binary time-frequency masks.应用二值时频掩蔽后语音在噪声中的感知学习。
J Acoust Soc Am. 2013 Mar;133(3):1687-92. doi: 10.1121/1.4789896.
3
The effects of selective consonant amplification on sentence recognition in noise by hearing-impaired listeners.选择性辅音增强对听力障碍者在噪声中句子识别的影响。
J Acoust Soc Am. 2011 Nov;130(5):3028-37. doi: 10.1121/1.3641407.
4
An algorithm to improve speech recognition in noise for hearing-impaired listeners.一种用于改善听力障碍者在噪声环境下语音识别的算法。
J Acoust Soc Am. 2013 Oct;134(4):3029-38. doi: 10.1121/1.4820893.
5
Intelligibility of reverberant noisy speech with ideal binary masking.用理想二值掩蔽评估混响噪声语音的可懂度。
J Acoust Soc Am. 2011 Oct;130(4):2153-61. doi: 10.1121/1.3631668.
6
Adaptation to Noise in Spectrotemporal Modulation Detection and Word Recognition.声谱时变调制检测和单词识别中的噪声适应。
Trends Hear. 2024 Jan-Dec;28:23312165241266322. doi: 10.1177/23312165241266322.
7
Spectrotemporal modulation sensitivity for hearing-impaired listeners: dependence on carrier center frequency and the relationship to speech intelligibility.听力受损听众的频谱时间调制敏感性:对载波中心频率的依赖性以及与言语可懂度的关系。
J Acoust Soc Am. 2014 Jul;136(1):301-16. doi: 10.1121/1.4881918.
8
Improving word recognition in noise among hearing-impaired subjects with a single-channel cochlear noise-reduction algorithm.提高单通道耳蜗降噪算法助听受试者噪声中单词识别能力。
J Acoust Soc Am. 2012 Sep;132(3):1718-31. doi: 10.1121/1.4739441.
9
Impact of SNR, masker type and noise reduction processing on sentence recognition performance and listening effort as indicated by the pupil dilation response.信噪比、掩蔽类型和降噪处理对瞳孔扩张反应所指示的句子识别性能和听力努力的影响。
Hear Res. 2018 Aug;365:90-99. doi: 10.1016/j.heares.2018.05.003. Epub 2018 May 6.
10
The Influence of Noise Reduction on Speech Intelligibility, Response Times to Speech, and Perceived Listening Effort in Normal-Hearing Listeners.噪声降低对正常听力者言语可懂度、言语反应时间和感知聆听努力的影响。
Trends Hear. 2017 Jan-Dec;21:2331216517716844. doi: 10.1177/2331216517716844.

引用本文的文献

1
An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona).一种理想的压缩式面罩,用于在不牺牲环境声音识别能力的情况下提高语音清晰度a)。
J Acoust Soc Am. 2024 Dec 1;156(6):3958-3969. doi: 10.1121/10.0034599.
2
The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise.时频掩蔽在背景噪声下改善构音障碍语音可懂度的应用。
J Speech Lang Hear Res. 2023 May 9;66(5):1853-1866. doi: 10.1044/2023_JSLHR-22-00558. Epub 2023 Mar 21.
3
The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study.从语音中去除噪声的最佳阈值在正常和听力障碍人群中相似——时频掩蔽研究。
J Acoust Soc Am. 2019 Jun;145(6):EL581. doi: 10.1121/1.5112828.
4
An ideal quantized mask to increase intelligibility and quality of speech in noise.一种理想的量化掩蔽,可提高噪声中的语音可懂度和质量。
J Acoust Soc Am. 2018 Sep;144(3):1392. doi: 10.1121/1.5053115.
5
Speech-cue transmission by an algorithm to increase consonant recognition in noise for hearing-impaired listeners.一种通过算法进行语音提示传输以提高听力受损听众在噪声环境中辅音识别能力的方法。
J Acoust Soc Am. 2014 Dec;136(6):3325. doi: 10.1121/1.4901712.
6
Masked sentence recognition assessed at ascending target-to-masker ratios: modest effects of repeating stimuli.在目标-掩蔽比上升时评估的掩蔽句子识别:重复刺激的适度影响。
Ear Hear. 2015 Mar-Apr;36(2):e14-22. doi: 10.1097/AUD.0000000000000113.

本文引用的文献

1
Development and validation of the AzBio sentence lists.发展和验证 AzBio 句子列表。
Ear Hear. 2012 Jan-Feb;33(1):112-7. doi: 10.1097/AUD.0b013e31822c2549.
2
An algorithm that improves speech intelligibility in noise for normal-hearing listeners.一种可提高听力正常的听众在噪声环境中语音清晰度的算法。
J Acoust Soc Am. 2009 Sep;126(3):1486-94. doi: 10.1121/1.3184603.
3
Role of mask pattern in intelligibility of ideal binary-masked noisy speech.掩码模式在理想二元掩码噪声语音可懂度中的作用。
J Acoust Soc Am. 2009 Sep;126(3):1415-26. doi: 10.1121/1.3179673.
4
Objective measures of listening effort: effects of background noise and noise reduction.听力努力的客观测量:背景噪声和降噪的影响
J Speech Lang Hear Res. 2009 Oct;52(5):1230-40. doi: 10.1044/1092-4388(2009/08-0111). Epub 2009 Apr 20.
5
Speech intelligibility in background noise with ideal binary time-frequency masking.基于理想二元时频掩蔽的背景噪声下语音清晰度
J Acoust Soc Am. 2009 Apr;125(4):2336-47. doi: 10.1121/1.3083233.
6
Speech perception of noise with binary gains.具有二元增益的噪声语音感知
J Acoust Soc Am. 2008 Oct;124(4):2303-7. doi: 10.1121/1.2967865.
7
Time-frequency masking for speech separation and its potential for hearing aid design.用于语音分离的时频掩蔽及其在助听器设计中的潜力。
Trends Amplif. 2008 Dec;12(4):332-53. doi: 10.1177/1084713808326455. Epub 2008 Oct 30.
8
Effect of spectral resolution on the intelligibility of ideal binary masked speech.光谱分辨率对理想二元掩蔽语音可懂度的影响。
J Acoust Soc Am. 2008 Apr;123(4):EL59-64. doi: 10.1121/1.2884086.
9
Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.影响理想二元掩蔽语音可懂度的因素:对降噪的启示
J Acoust Soc Am. 2008 Mar;123(3):1673-82. doi: 10.1121/1.2832617.
10
Speech recognition materials and ceiling effects: considerations for cochlear implant programs.语音识别材料与天花板效应:人工耳蜗植入项目的考量因素
Audiol Neurootol. 2008;13(3):193-205. doi: 10.1159/000113510. Epub 2008 Jan 22.