• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估时频因素对噪声环境下言语可懂度的重要性。

Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.

作者信息

Yu Chengzhu, Wójcicki Kamil K, Loizou Philipos C, Hansen John H L, Johnson Michael T

机构信息

Department of Electrical Engineering, Erik Jonsson School of Enigneering and Computer Science, University of Texas at Dallas, Richardson, Texas 75083.

Speech and Signal Processing Laboratory, Marquette University, 1515 West Wisconsin Avenue, Milwaukee, Wisconsin 53201-1881.

出版信息

J Acoust Soc Am. 2014 May;135(5):3007-16. doi: 10.1121/1.4869088.

DOI:10.1121/1.4869088
PMID:24815280
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4032418/
Abstract

Recent studies on binary masking techniques make the assumption that each time-frequency (T-F) unit contributes an equal amount to the overall intelligibility of speech. The present study demonstrated that the importance of each T-F unit to speech intelligibility varies in accordance with speech content. Specifically, T-F units are categorized into two classes, speech-present T-F units and speech-absent T-F units. Results indicate that the importance of each speech-present T-F unit to speech intelligibility is highly related to the loudness of its target component, while the importance of each speech-absent T-F unit varies according to the loudness of its masker component. Two types of mask errors are also considered, which include miss and false alarm errors. Consistent with previous work, false alarm errors are shown to be more harmful to speech intelligibility than miss errors when the mixture signal-to-noise ratio (SNR) is below 0 dB. However, the relative importance between the two types of error is conditioned on the SNR level of the input speech signal. Based on these observations, a mask-based objective measure, the loudness weighted hit-false, is proposed for predicting speech intelligibility. The proposed objective measure shows significantly higher correlation with intelligibility compared to two existing mask-based objective measures.

摘要

近期关于二元掩蔽技术的研究假设,每个时频(T-F)单元对语音的整体可懂度贡献相等。本研究表明,每个T-F单元对语音可懂度的重要性会根据语音内容而变化。具体而言,T-F单元被分为两类,即有语音的T-F单元和无语音的T-F单元。结果表明,每个有语音的T-F单元对语音可懂度的重要性与其目标成分的响度高度相关,而每个无语音的T-F单元的重要性则根据其掩蔽成分的响度而变化。还考虑了两种类型的掩蔽错误,即漏报和误报错误。与先前的研究一致,当混合信号噪声比(SNR)低于0 dB时,误报错误对语音可懂度的危害比漏报错误更大。然而,这两种错误类型之间的相对重要性取决于输入语音信号的SNR水平。基于这些观察结果,提出了一种基于掩蔽的客观度量,即响度加权命中-错误率,用于预测语音可懂度。与现有的两种基于掩蔽的客观度量相比,所提出的客观度量与可懂度的相关性显著更高。

相似文献

1
Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.评估时频因素对噪声环境下言语可懂度的重要性。
J Acoust Soc Am. 2014 May;135(5):3007-16. doi: 10.1121/1.4869088.
2
An algorithm that improves speech intelligibility in noise for normal-hearing listeners.一种可提高听力正常的听众在噪声环境中语音清晰度的算法。
J Acoust Soc Am. 2009 Sep;126(3):1486-94. doi: 10.1121/1.3184603.
3
Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.添加背景噪声可提高理想二值掩蔽噪声语音的可懂度。
J Acoust Soc Am. 2011 Apr;129(4):2227-36. doi: 10.1121/1.3559707.
4
An ideal quantized mask to increase intelligibility and quality of speech in noise.一种理想的量化掩蔽,可提高噪声中的语音可懂度和质量。
J Acoust Soc Am. 2018 Sep;144(3):1392. doi: 10.1121/1.5053115.
5
Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.影响理想二元掩蔽语音可懂度的因素:对降噪的启示
J Acoust Soc Am. 2008 Mar;123(3):1673-82. doi: 10.1121/1.2832617.
6
Role of mask pattern in intelligibility of ideal binary-masked noisy speech.掩码模式在理想二元掩码噪声语音可懂度中的作用。
J Acoust Soc Am. 2009 Sep;126(3):1415-26. doi: 10.1121/1.3179673.
7
Channel selection in the modulation domain for improved speech intelligibility in noise.在调制域中进行信道选择,以提高噪声环境下的语音可懂度。
J Acoust Soc Am. 2012 Apr;131(4):2904-13. doi: 10.1121/1.3688488.
8
Perceptual effects of noise reduction by time-frequency masking of noisy speech.噪声语音的时频掩蔽降噪的感知效果。
J Acoust Soc Am. 2012 Oct;132(4):2690-9. doi: 10.1121/1.4747006.
9
Speech intelligibility in reverberation with ideal binary masking: effects of early reflections and signal-to-noise ratio threshold.混响环境下理想二值掩蔽对言语可懂度的影响:早期反射声和信噪比阈的作用。
J Acoust Soc Am. 2013 Mar;133(3):1707-17. doi: 10.1121/1.4789895.
10
Intelligibility of reverberant noisy speech with ideal binary masking.用理想二值掩蔽评估混响噪声语音的可懂度。
J Acoust Soc Am. 2011 Oct;130(4):2153-61. doi: 10.1121/1.3631668.

引用本文的文献

1
Measuring time-frequency importance functions of speech with bubble noise.用气泡噪声测量语音的时频重要性函数。
J Acoust Soc Am. 2016 Oct;140(4):2542. doi: 10.1121/1.4964102.
2
A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments.双耳分组模型在多说话人环境下预测言语可懂度。
Trends Hear. 2016 Oct 3;20:2331216516669919. doi: 10.1177/2331216516669919.
3
Noise Perturbation for Supervised Speech Separation.用于监督语音分离的噪声扰动
Speech Commun. 2016 Apr 1;78:1-10. doi: 10.1016/j.specom.2015.12.006.
4
An algorithm to increase speech intelligibility for hearing-impaired listeners in novel segments of the same noise type.一种用于提高听力受损听众在相同噪声类型的新片段中的言语可懂度的算法。
J Acoust Soc Am. 2015 Sep;138(3):1660-9. doi: 10.1121/1.4929493.

本文引用的文献

1
A NEW MASK-BASED OBJECTIVE MEASURE FOR PREDICTING THE INTELLIGIBILITY OF BINARY MASKED SPEECH.一种基于掩码的新客观测量方法,用于预测二元掩码语音的可懂度。
Proc IEEE Int Conf Acoust Speech Signal Process. 2013. doi: 10.1109/ICASSP.2013.6639025.
2
Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions.当前语音增强算法未能提高语音清晰度的原因及建议的解决方案。
IEEE Trans Audio Speech Lang Process. 2011;19(1):47-56. doi: 10.1109/TASL.2010.2045180.
3
An algorithm that improves speech intelligibility in noise for normal-hearing listeners.一种可提高听力正常的听众在噪声环境中语音清晰度的算法。
J Acoust Soc Am. 2009 Sep;126(3):1486-94. doi: 10.1121/1.3184603.
4
Role of mask pattern in intelligibility of ideal binary-masked noisy speech.掩码模式在理想二元掩码噪声语音可懂度中的作用。
J Acoust Soc Am. 2009 Sep;126(3):1415-26. doi: 10.1121/1.3179673.
5
Multiband product rule and consonant identification.
J Acoust Soc Am. 2009 Jul;126(1):347-53. doi: 10.1121/1.3143785.
6
Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions.基于新的频段重要性函数预测噪声环境下言语可懂度的客观测量方法。
J Acoust Soc Am. 2009 May;125(5):3387-405. doi: 10.1121/1.3097493.
7
Speech perception of noise with binary gains.具有二元增益的噪声语音感知
J Acoust Soc Am. 2008 Oct;124(4):2303-7. doi: 10.1121/1.2967865.
8
Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.影响理想二元掩蔽语音可懂度的因素:对降噪的启示
J Acoust Soc Am. 2008 Mar;123(3):1673-82. doi: 10.1121/1.2832617.
9
Monaural speech segregation based on pitch tracking and amplitude modulation.基于音高跟踪和幅度调制的单耳语音分离
IEEE Trans Neural Netw. 2004 Sep;15(5):1135-50. doi: 10.1109/TNN.2004.832812.
10
Subjective comparison and evaluation of speech enhancement algorithms.语音增强算法的主观比较与评估
Speech Commun. 2007 Jul;49(7):588-601. doi: 10.1016/j.specom.2006.12.006.