• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

噪声抑制对可懂度的影响。II:验证物理指标的尝试。

Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics.

作者信息

Hilkhuysen Gaston, Gaubitch Nikolay, Brookes Mike, Huckvale Mark

机构信息

Department of Speech, Language and Hearing Sciences, University College London, 2 Wakefield Street, London WC1N 1PF, United Kingdom.

Electrical and Electronic Engineering Department, Imperial College, Exhibition Road, London SW7 2BT, United Kingdom.

出版信息

J Acoust Soc Am. 2014 Jan;135(1):439-50. doi: 10.1121/1.4837238.

DOI:10.1121/1.4837238
PMID:24437784
Abstract

Using the data presented in the accompanying paper [Hilkhuysen et al., J. Acoust. Soc. Am. 131, 531-539 (2012)], the ability of six metrics to predict intelligibility of speech in noise before and after noise suppression was studied. The metrics considered were the Speech Intelligibility Index (SII), the fractional Articulation Index (fAI), the coherence intelligibility index based on the mid-levels in speech (CSIImid), an extension of the Normalized Coherence Metric (NCM+), a part of the speech-based envelope power model (pre-sEPSM), and the Short Term Objective Intelligibility measure (STOI). Three of the measures, SII, CSIImid, and NCM+, overpredicted intelligibility after noise reduction, whereas fAI underpredicted these intelligibilities. The pre-sEPSM metric worked well for speech in babble but failed with car noise. STOI gave the best predictions, but overall the size of intelligibility prediction errors were greater than the change in intelligibility caused by noise suppression. Suggestions for improvements of the metrics are discussed.

摘要

利用随附论文[希尔库伊森等人,《美国声学学会杂志》131,531 - 539(2012年)]中给出的数据,研究了六种指标在噪声抑制前后预测噪声中语音可懂度的能力。所考虑的指标有语音可懂度指数(SII)、分数清晰度指数(fAI)、基于语音中值电平的相干可懂度指数(CSIImid)、归一化相干度量的扩展(NCM +)、基于语音的包络功率模型的一部分(预sEPSM)以及短期客观可懂度度量(STOI)。其中三种指标,即SII、CSIImid和NCM +,在降噪后对可懂度的预测过高,而fAI则对这些可懂度的预测过低。预sEPSM指标在嘈杂语音中表现良好,但在汽车噪声环境下失效。STOI给出了最佳预测,但总体而言,可懂度预测误差的大小大于噪声抑制所导致的可懂度变化。文中讨论了对这些指标进行改进的建议。

相似文献

1
Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics.噪声抑制对可懂度的影响。II:验证物理指标的尝试。
J Acoust Soc Am. 2014 Jan;135(1):439-50. doi: 10.1121/1.4837238.
2
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.
3
The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.短时强度和包络功率对语音清晰度及心理声学掩蔽的作用。
J Acoust Soc Am. 2017 Aug;142(2):1098. doi: 10.1121/1.4999059.
4
Modifying the normalized covariance metric measure to account for nonlinear distortions introduced by noise-reduction algorithms.修改归一化协方差度量标准以考虑降噪算法引入的非线性失真。
J Acoust Soc Am. 2013 May;133(5):EL405-11. doi: 10.1121/1.4800189.
5
Modelling speech intelligibility in adverse conditions.在不利条件下的言语可懂度建模。
Adv Exp Med Biol. 2013;787:343-51. doi: 10.1007/978-1-4614-1590-9_38.
6
Information-bearing acoustic change outperforms duration in predicting intelligibility of full-spectrum and noise-vocoded sentences.带信息的声学变化在预测全谱和噪声编码句子的可理解度方面优于时长。
J Acoust Soc Am. 2014 Mar;135(3):1518-29. doi: 10.1121/1.4863267.
7
Outcome measures based on classification performance fail to predict the intelligibility of binary-masked speech.基于分类性能的结果指标无法预测二元掩蔽语音的可懂度。
J Acoust Soc Am. 2016 Jun;139(6):3033. doi: 10.1121/1.4952439.
8
Modeling the effects of a single reflection on binaural speech intelligibility.对单耳反射对双耳语音可懂度影响的建模。
J Acoust Soc Am. 2014 Mar;135(3):1556-67. doi: 10.1121/1.4863197.
9
A multi-resolution envelope-power based model for speech intelligibility.基于多分辨率包络功率的语音可懂度模型。
J Acoust Soc Am. 2013 Jul;134(1):436-46. doi: 10.1121/1.4807563.
10
Intelligibility prediction for speech mixed with white Gaussian noise at low signal-to-noise ratios.低信噪比下混入白噪声语音的可懂度预测。
J Acoust Soc Am. 2021 Feb;149(2):1346. doi: 10.1121/10.0003557.

引用本文的文献

1
En route to sound coding strategies for optical cochlear implants.通往光学人工耳蜗声音编码策略之路。
iScience. 2023 Aug 25;26(10):107725. doi: 10.1016/j.isci.2023.107725. eCollection 2023 Oct 20.