• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有理想时频分离的多说话者语音感知:嗓音特征和说话者数量的影响。

Multitalker speech perception with ideal time-frequency segregation: effects of voice characteristics and number of talkers.

作者信息

Brungart Douglas S, Chang Peter S, Simpson Brian D, Wang DeLiang

机构信息

Air Force Research Laboratory, Human Effectiveness Directorate, Wright-Patterson AFB, Ohio 45433, USA.

出版信息

J Acoust Soc Am. 2009 Jun;125(6):4006-22. doi: 10.1121/1.3117686.

DOI:10.1121/1.3117686
PMID:19507982
Abstract

When a target voice is masked by an increasingly similar masker voice, increases in energetic masking are likely to occur due to increased spectro-temporal overlap in the competing speech waveforms. However, the impact of this increase may be obscured by informational masking effects related to the increased confusability of the target and masking utterances. In this study, the effects of target-masker similarity and the number of competing talkers on the energetic component of speech-on-speech masking were measured with an ideal time-frequency segregation (ITFS) technique that retained all the target-dominated time-frequency regions of a multitalker mixture but eliminated all the time-frequency regions dominated by the maskers. The results show that target-masker similarity has a small but systematic impact on energetic masking, with roughly a 1 dB release from masking for same-sex maskers versus same-talker maskers and roughly an additional 1 dB release from masking for different-sex masking voices. The results of a second experiment measuring ITFS performance with up to 18 interfering talkers indicate that energetic masking increased systematically with the number of competing talkers. These results suggest that energetic masking differences related to target-masker similarity have a much smaller impact on multitalker listening performance than energetic masking effects related to the number of competing talkers in the stimulus and non-energetic masking effects related to the confusability of the target and masking voices.

摘要

当目标语音被越来越相似的掩蔽语音掩盖时,由于竞争语音波形中频谱-时间重叠增加,能量掩蔽很可能会增强。然而,这种增强的影响可能会被与目标语音和掩蔽语音可混淆性增加相关的信息掩蔽效应所掩盖。在本研究中,采用理想时频分离(ITFS)技术测量了目标-掩蔽语音相似度和竞争说话者数量对语音对语音掩蔽能量成分的影响,该技术保留了多说话者混合语音中所有以目标语音为主导的时频区域,但消除了所有以掩蔽语音为主导的时频区域。结果表明,目标-掩蔽语音相似度对能量掩蔽有微小但系统的影响,同性掩蔽语音与同一说话者掩蔽语音相比,掩蔽解除约1 dB,不同性别的掩蔽语音相比,掩蔽解除约额外增加1 dB。第二个实验测量了多达18个干扰说话者的ITFS性能,结果表明能量掩蔽随着竞争说话者数量的增加而系统性增强。这些结果表明,与目标-掩蔽语音相似度相关的能量掩蔽差异对多说话者听力性能的影响,远小于与刺激中竞争说话者数量相关的能量掩蔽效应以及与目标语音和掩蔽语音可混淆性相关的非能量掩蔽效应。

相似文献

1
Multitalker speech perception with ideal time-frequency segregation: effects of voice characteristics and number of talkers.具有理想时频分离的多说话者语音感知:嗓音特征和说话者数量的影响。
J Acoust Soc Am. 2009 Jun;125(6):4006-22. doi: 10.1121/1.3117686.
2
Effects of target-masker contextual similarity on the multimasker penalty in a three-talker diotic listening task.在三说话者双耳聆听任务中,目标-掩蔽声上下文相似性对多掩蔽声惩罚的影响。
J Acoust Soc Am. 2010 Nov;128(5):2998-10. doi: 10.1121/1.3479547.
3
A model for multitalker speech perception.一种多说话者语音感知模型。
J Acoust Soc Am. 2008 Nov;124(5):3213-24. doi: 10.1121/1.2982413.
4
Effect of target-masker similarity on across-ear interference in a dichotic cocktail-party listening task.目标-掩蔽声相似性对双耳分听鸡尾酒会聆听任务中跨耳干扰的影响。
J Acoust Soc Am. 2007 Sep;122(3):1724. doi: 10.1121/1.2756797.
5
Spectral and temporal changes to speech produced in the presence of energetic and informational maskers.在能量掩蔽和信息掩蔽存在的情况下产生的语音的光谱和时间变化。
J Acoust Soc Am. 2010 Oct;128(4):2059-69. doi: 10.1121/1.3478775.
6
Informational and energetic masking effects in the perception of multiple simultaneous talkers.多个同时说话者感知中的信息性和能量掩蔽效应。
J Acoust Soc Am. 2001 Nov;110(5 Pt 1):2527-38. doi: 10.1121/1.1408946.
7
The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception.外语鸡尾酒会问题:非母语语音感知中的能量掩蔽效应和信息掩蔽效应
J Acoust Soc Am. 2008 Jan;123(1):414-27. doi: 10.1121/1.2804952.
8
Informational and energetic masking effects in the perception of two simultaneous talkers.同时感知两个说话者时的信息性和能量性掩蔽效应。
J Acoust Soc Am. 2001 Mar;109(3):1101-9. doi: 10.1121/1.1345696.
9
Contributions of talker characteristics and spatial location to auditory streaming.说话者特征和空间位置对听觉流的影响。
J Acoust Soc Am. 2008 Mar;123(3):1562-70. doi: 10.1121/1.2831774.
10
Speech recognition with varying numbers and types of competing talkers by normal-hearing, cochlear-implant, and implant simulation subjects.听力正常者、人工耳蜗植入者及植入模拟受试者在存在不同数量和类型的竞争说话者情况下的语音识别。
J Acoust Soc Am. 2008 Jan;123(1):450-61. doi: 10.1121/1.2805617.

引用本文的文献

1
Evaluation of Speaker-Conditioned Target Speaker Extraction Algorithms for Hearing-Impaired Listeners.针对听力受损听众的说话者条件目标说话者提取算法评估
Trends Hear. 2025 Jan-Dec;29:23312165251365802. doi: 10.1177/23312165251365802. Epub 2025 Aug 11.
2
Factors underlying masking release by voice-gender differences and spatial separation cues in multi-talker listening environments in listeners with and without hearing loss.听力正常和听力损失听众在多说话者聆听环境中,语音性别差异和空间分离线索导致掩蔽解除的潜在因素。
Front Neurosci. 2022 Nov 23;16:1059639. doi: 10.3389/fnins.2022.1059639. eCollection 2022.
3
Speech perception in noise: Masking and unmasking.
噪声中的语音感知:掩蔽与解掩蔽
J Otol. 2021 Apr;16(2):109-119. doi: 10.1016/j.joto.2020.12.001. Epub 2020 Dec 11.
4
Spectro-temporal glimpsing of speech in noise: Regularity and coherence of masking patterns reduces uncertainty and increases intelligibility.语音在噪声中的时频窥探:掩蔽模式的规律性和连贯性降低了不确定性,提高了可懂度。
J Acoust Soc Am. 2020 Sep;148(3):1552. doi: 10.1121/10.0001971.
5
Paying attention to speech: The role of working memory capacity and professional experience.关注言语:工作记忆容量与专业经验的作用。
Atten Percept Psychophys. 2020 Oct;82(7):3594-3605. doi: 10.3758/s13414-020-02091-2.
6
The importance of processing resolution in "ideal time-frequency segregation" of masked speech and the implications for predicting speech intelligibility.处理分辨率在掩蔽语音“理想时频分离”中的重要性及其对预测语音可懂度的影响。
J Acoust Soc Am. 2020 Mar;147(3):1648. doi: 10.1121/10.0000893.
7
Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort.双耳语音可懂度和感知聆听努力中语音掩蔽的能量和信息成分。
Trends Hear. 2019 Jan-Dec;23:2331216519854597. doi: 10.1177/2331216519854597.
8
Error patterns of native and non-native listeners' perception of speech in noise.母语和非母语听者在噪声中感知言语的错误模式。
J Acoust Soc Am. 2019 Feb;145(2):EL129. doi: 10.1121/1.5087271.
9
Determining the energetic and informational components of speech-on-speech masking in listeners with sensorineural hearing loss.测定感音神经性听力损失患者言语-言语掩蔽中的能量和信息成分。
J Acoust Soc Am. 2019 Jan;145(1):440. doi: 10.1121/1.5087555.
10
Developmental Effects in Masking Release for Speech-in-Speech Perception Due to a Target/Masker Sex Mismatch.由于目标/掩蔽者性别不匹配导致语音感知掩蔽释放中的发育效应。
Ear Hear. 2018 Sep/Oct;39(5):935-945. doi: 10.1097/AUD.0000000000000554.