• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在具有不同频谱-时间语音特征量的掩蔽噪声中,单耳语音的可懂度与可检测性

Monaural speech intelligibility and detection in maskers with varying amounts of spectro-temporal speech features.

作者信息

Schubotz Wiebke, Brand Thomas, Kollmeier Birger, Ewert Stephan D

机构信息

Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, D-26111 Oldenburg, Germany.

出版信息

J Acoust Soc Am. 2016 Jul;140(1):524. doi: 10.1121/1.4955079.

DOI:10.1121/1.4955079
PMID:27475175
Abstract

Speech intelligibility is strongly affected by the presence of maskers. Depending on the spectro-temporal structure of the masker and its similarity to the target speech, different masking aspects can occur which are typically referred to as energetic, amplitude modulation, and informational masking. In this study speech intelligibility and speech detection was measured in maskers that vary systematically in the time-frequency domain from steady-state noise to a single interfering talker. Male and female target speech was used in combination with maskers based on speech for the same or different gender. Observed data were compared to predictions of the speech intelligibility index, extended speech intelligibility index, multi-resolution speech-based envelope-power-spectrum model, and the short-time objective intelligibility measure. The different models served as analysis tool to help distinguish between the different masking aspects. Comparison shows that overall masking can to a large extent be explained by short-term energetic masking. However, the other masking aspects (amplitude modulation an informational masking) influence speech intelligibility as well. Additionally, it was obvious that all models showed considerable deviations from the data. Therefore, the current study provides a benchmark for further evaluation of speech prediction models.

摘要

掩蔽声的存在会对言语可懂度产生强烈影响。根据掩蔽声的频谱 - 时间结构及其与目标语音的相似性,会出现不同的掩蔽情况,通常分别称为能量掩蔽、幅度调制掩蔽和信息掩蔽。在本研究中,在从稳态噪声到单个干扰说话者的时频域中系统变化的掩蔽声条件下测量了言语可懂度和言语检测。使用男性和女性目标语音,并结合基于相同或不同性别的语音的掩蔽声。将观察到的数据与言语可懂度指数、扩展言语可懂度指数、多分辨率基于语音的包络 - 功率谱模型以及短时客观可懂度测量的预测结果进行比较。不同的模型用作分析工具,以帮助区分不同的掩蔽情况。比较表明,总体掩蔽在很大程度上可以由短期能量掩蔽来解释。然而,其他掩蔽情况(幅度调制掩蔽和信息掩蔽)也会影响言语可懂度。此外,很明显所有模型与数据都存在相当大的偏差。因此,本研究为进一步评估语音预测模型提供了一个基准。

相似文献

1
Monaural speech intelligibility and detection in maskers with varying amounts of spectro-temporal speech features.在具有不同频谱-时间语音特征量的掩蔽噪声中,单耳语音的可懂度与可检测性
J Acoust Soc Am. 2016 Jul;140(1):524. doi: 10.1121/1.4955079.
2
Release from informational masking in a monaural competing-speech task with vocoded copies of the maskers presented contralaterally.在对侧呈现带声码器处理的掩蔽声副本的单耳竞争言语任务中从信息掩蔽中释放。
J Acoust Soc Am. 2015 Feb;137(2):702-13. doi: 10.1121/1.4906167.
3
Informational Masking Effects on Neural Encoding of Stimulus Onset and Acoustic Change.信息掩蔽对刺激起始和声学变化的神经编码的影响。
Ear Hear. 2019 Jan/Feb;40(1):156-167. doi: 10.1097/AUD.0000000000000604.
4
The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.短时强度和包络功率对语音清晰度及心理声学掩蔽的作用。
J Acoust Soc Am. 2017 Aug;142(2):1098. doi: 10.1121/1.4999059.
5
Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort.双耳语音可懂度和感知聆听努力中语音掩蔽的能量和信息成分。
Trends Hear. 2019 Jan-Dec;23:2331216519854597. doi: 10.1177/2331216519854597.
6
Explaining intelligibility in speech-modulated maskers using acoustic glimpse analysis.使用声瞬变分析解释语音调制掩蔽器中的可懂度。
J Acoust Soc Am. 2018 Jun;143(6):EL449. doi: 10.1121/1.5041466.
7
Modulation masking and glimpsing of natural and vocoded speech during single-talker modulated noise: Effect of the modulation spectrum.单说话者调制噪声期间自然语音和编码语音的调制掩蔽与瞥视:调制频谱的影响
J Acoust Soc Am. 2016 Sep;140(3):1800. doi: 10.1121/1.4962494.
8
Application of a short-time version of the Equalization-Cancellation model to speech intelligibility experiments with speech maskers.均衡消除模型的短时间版本在带语音掩蔽器的语音可懂度实验中的应用。
J Acoust Soc Am. 2014 Aug;136(2):768-76. doi: 10.1121/1.4884767.
9
Spectro-temporal glimpsing of speech in noise: Regularity and coherence of masking patterns reduces uncertainty and increases intelligibility.语音在噪声中的时频窥探:掩蔽模式的规律性和连贯性降低了不确定性,提高了可懂度。
J Acoust Soc Am. 2020 Sep;148(3):1552. doi: 10.1121/10.0001971.
10
The effect of nearby maskers on speech intelligibility in reverberant, multi-talker environments.在混响、多说话者环境中,附近掩蔽声对言语可懂度的影响。
J Acoust Soc Am. 2017 Mar;141(3):2214. doi: 10.1121/1.4979000.

引用本文的文献

1
A standardised test to evaluate audio-visual speech intelligibility in French.一项评估法语视听语音清晰度的标准化测试。
Heliyon. 2024 Jan 14;10(2):e24750. doi: 10.1016/j.heliyon.2024.e24750. eCollection 2024 Jan 30.
2
Binaural detection thresholds and audio quality of speech and music signals in complex acoustic environments.复杂声学环境中语音和音乐信号的双耳检测阈值及音频质量
Front Psychol. 2022 Nov 24;13:994047. doi: 10.3389/fpsyg.2022.994047. eCollection 2022.
3
The importance of processing resolution in "ideal time-frequency segregation" of masked speech and the implications for predicting speech intelligibility.
处理分辨率在掩蔽语音“理想时频分离”中的重要性及其对预测语音可懂度的影响。
J Acoust Soc Am. 2020 Mar;147(3):1648. doi: 10.1121/10.0000893.
4
Switching Streams Across Ears to Evaluate Informational Masking of Speech-on-Speech.双耳间信息流切换以评估言语对言语的信息掩蔽
Ear Hear. 2020 Jan/Feb;41(1):208-216. doi: 10.1097/AUD.0000000000000741.
5
Efficiency in glimpsing vowel sequences in fluctuating makers: Effects of temporal fine structure and temporal regularity.波动音标的元音序列闪烁中的效率:时频结构和时频规则的影响。
J Acoust Soc Am. 2019 Apr;145(4):2518. doi: 10.1121/1.5098949.
6
Determining the energetic and informational components of speech-on-speech masking in listeners with sensorineural hearing loss.测定感音神经性听力损失患者言语-言语掩蔽中的能量和信息成分。
J Acoust Soc Am. 2019 Jan;145(1):440. doi: 10.1121/1.5087555.