• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测周期性对掩蔽语音可懂度的影响:不同建模方法的评估及其局限性。

Predicting the effects of periodicity on the intelligibility of masked speech: An evaluation of different modelling approaches and their limitations.

机构信息

Speech, Hearing and Phonetic Sciences, University College London, Chandler House, 2 Wakefield Street, London WC1N 1PF, United Kingdom.

Hearing Systems Section, Department of Health Technology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.

出版信息

J Acoust Soc Am. 2019 Oct;146(4):2562. doi: 10.1121/1.5129050.

DOI:10.1121/1.5129050
PMID:31671986
Abstract

Four existing speech intelligibility models with different theoretical assumptions were used to predict previously published behavioural data. Those data showed that complex tones with pitch-related periodicity are far less effective maskers of speech than aperiodic noise. This so-called masker-periodicity benefit (MPB) far exceeded the fluctuating-masker benefit (FMB) obtained from slow masker envelope fluctuations. In contrast, the normal-hearing listeners hardly benefitted from periodicity in the target speech. All tested models consistently underestimated MPB and FMB, while most of them also overestimated the intelligibility of vocoded speech. To understand these shortcomings, the internal signal representations of the models were analysed in detail. The best-performing model, the correlation-based version of the speech-based envelope power spectrum model (sEPSM), combined an auditory processing front end with a modulation filterbank and a correlation-based back end. This model was then modified to further improve the predictions. The resulting second version of the sEPSM outperformed the original model with all tested maskers and accounted for about half the MPB, which can be attributed to reduced modulation masking caused by the periodic maskers. However, as the sEPSM2 failed to account for the other half of the MPB, the results also indicate that future models should consider the contribution of pitch-related effects, such as enhanced stream segregation, to further improve their predictive power.

摘要

四种具有不同理论假设的现有语音可懂度模型被用于预测先前发表的行为数据。这些数据表明,具有音高相关周期性的复合音作为语音掩蔽的效果远不如非周期性噪声。这种所谓的掩蔽器周期性优势(MPB)远远超过了从慢掩蔽器包络波动中获得的波动掩蔽器优势(FMB)。相比之下,正常听力的听众几乎没有受益于目标语音的周期性。所有测试的模型都一致低估了 MPB 和 FMB,而大多数模型也高估了声码化语音的可懂度。为了理解这些缺点,详细分析了模型的内部信号表示。表现最好的模型是基于语音包络功率谱模型(sEPSM)的相关版本,它将听觉处理前端与调制滤波器组和基于相关的后端相结合。然后对该模型进行了修改,以进一步提高预测能力。改进后的 sEPSM 第二代模型在所有测试掩蔽器上的表现均优于原始模型,并且可以解释约一半的 MPB,这归因于周期性掩蔽器引起的调制掩蔽减少。然而,由于 sEPSM2 无法解释 MPB 的另一半,结果还表明,未来的模型应该考虑与音高相关的效应(例如增强的流分离)的贡献,以进一步提高其预测能力。

相似文献

1
Predicting the effects of periodicity on the intelligibility of masked speech: An evaluation of different modelling approaches and their limitations.预测周期性对掩蔽语音可懂度的影响:不同建模方法的评估及其局限性。
J Acoust Soc Am. 2019 Oct;146(4):2562. doi: 10.1121/1.5129050.
2
Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners.正常听力和听力受损听众在波动掩蔽声中语音的听觉及视听清晰度
J Acoust Soc Am. 2009 May;125(5):3358-72. doi: 10.1121/1.3110132.
3
A multi-resolution envelope-power based model for speech intelligibility.基于多分辨率包络功率的语音可懂度模型。
J Acoust Soc Am. 2013 Jul;134(1):436-46. doi: 10.1121/1.4807563.
4
The role of periodicity in perceiving speech in quiet and in background noise.周期性在安静环境和背景噪声中语音感知中的作用。
J Acoust Soc Am. 2015 Dec;138(6):3586-99. doi: 10.1121/1.4936945.
5
Predicting the intelligibility of vocoded speech.语音编码语音可懂度预测。
Ear Hear. 2011 May-Jun;32(3):331-8. doi: 10.1097/AUD.0b013e3181ff3515.
6
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.
7
Effects of periodic masker interruption on the intelligibility of interrupted speech.周期性掩蔽中断对中断语音可懂度的影响。
J Acoust Soc Am. 2007 Sep;122(3):1693. doi: 10.1121/1.2756177.
8
Envelope and intensity based prediction of psychoacoustic masking and speech intelligibility.基于包络和强度的心理声学掩蔽及语音可懂度预测。
J Acoust Soc Am. 2016 Aug;140(2):1023. doi: 10.1121/1.4960574.
9
The role of envelope periodicity in the perception of masked speech with simulated and real cochlear implants.掩蔽语音中,模拟和真实人工耳蜗中声窗周期对语音感知的作用。
J Acoust Soc Am. 2018 Aug;144(2):885. doi: 10.1121/1.5049584.
10
Predicting binaural speech intelligibility using the signal-to-noise ratio in the envelope power spectrum domain.在包络功率谱域中利用信噪比预测双耳言语可懂度。
J Acoust Soc Am. 2016 Jul;140(1):192. doi: 10.1121/1.4954254.

引用本文的文献

1
Web-based psychoacoustics: Hearing screening, infrastructure, and validation.基于网络的心理声学:听力筛查、基础设施和验证。
Behav Res Methods. 2024 Mar;56(3):1433-1448. doi: 10.3758/s13428-023-02101-9. Epub 2023 Jun 8.
2
Spectral weighting for sentence recognition in steady-state and amplitude-modulated noise.在稳态噪声和调幅噪声中用于句子识别的频谱加权。
JASA Express Lett. 2023 May 1;3(5). doi: 10.1121/10.0017934.
3
A comparative study of eight human auditory models of monaural processing.八种单耳听觉处理的人体听觉模型的比较研究。
Acta Acust (2020). 2022;6. doi: 10.1051/aacus/2022008. Epub 2022 May 4.
4
An overview of the HASPI and HASQI metrics for predicting speech intelligibility and speech quality for normal hearing, hearing loss, and hearing aids.HASPI 和 HASQI 指标概述,用于预测正常听力、听力损失和助听器的语音可懂度和语音质量。
Hear Res. 2022 Dec;426:108608. doi: 10.1016/j.heares.2022.108608. Epub 2022 Sep 13.
5
Investigating the role of harmonic cancellation in speech-on-speech masking.研究谐波抵消在语音掩蔽中的作用。
Hear Res. 2022 Dec;426:108562. doi: 10.1016/j.heares.2022.108562. Epub 2022 Jun 17.
6
Predicting speech intelligibility in hearing-impaired listeners using a physiologically inspired auditory model.利用生理启发式听觉模型预测听力受损者的言语可懂度。
Hear Res. 2022 Dec;426:108553. doi: 10.1016/j.heares.2022.108553. Epub 2022 Jun 9.
7
Speech Categorization Reveals the Role of Early-Stage Temporal-Coherence Processing in Auditory Scene Analysis.言语分类揭示了早期时间相干性处理在听觉场景分析中的作用。
J Neurosci. 2022 Jan 12;42(2):240-254. doi: 10.1523/JNEUROSCI.1610-21.2021. Epub 2021 Nov 11.
8
Modulation masking and fine structure shape neural envelope coding to predict speech intelligibility across diverse listening conditions.调制掩蔽和精细结构形状神经包络编码可预测各种听力条件下的言语可懂度。
J Acoust Soc Am. 2021 Sep;150(3):2230. doi: 10.1121/10.0006385.
9
Speech Intelligibility Prediction using Spectro-Temporal Modulation Analysis.基于频谱-时间调制分析的语音可懂度预测
IEEE/ACM Trans Audio Speech Lang Process. 2021;29:210-225. doi: 10.1109/taslp.2020.3039929. Epub 2020 Nov 24.
10
A harmonic-cancellation-based model to predict speech intelligibility against a harmonic masker.一种基于谐波抵消的模型,用于预测针对谐波掩蔽的语音可懂度。
J Acoust Soc Am. 2020 Nov;148(5):3246. doi: 10.1121/10.0002492.