• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在存在扩展高频线索的情况下,语音内语音识别的声道重要性。

Band importance for speech-in-speech recognition in the presence of extended high-frequency cues.

机构信息

Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, USA.

Department of Otolaryngology/HNS, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA.

出版信息

J Acoust Soc Am. 2024 Aug 1;156(2):1202-1213. doi: 10.1121/10.0028269.

DOI:10.1121/10.0028269
PMID:39158325
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11335358/
Abstract

Band importance functions for speech-in-noise recognition, typically determined in the presence of steady background noise, indicate a negligible role for extended high frequencies (EHFs; 8-20 kHz). However, recent findings indicate that EHF cues support speech recognition in multi-talker environments, particularly when the masker has reduced EHF levels relative to the target. This scenario can occur in natural auditory scenes when the target talker is facing the listener, but the maskers are not. In this study, we measured the importance of five bands from 40 to 20 000 Hz for speech-in-speech recognition by notch-filtering the bands individually. Stimuli consisted of a female target talker recorded from 0° and a spatially co-located two-talker female masker recorded either from 0° or 56.25°, simulating a masker either facing the listener or facing away, respectively. Results indicated peak band importance in the 0.4-1.3 kHz band and a negligible effect of removing the EHF band in the facing-masker condition. However, in the non-facing condition, the peak was broader and EHF importance was higher and comparable to that of the 3.3-8.3 kHz band in the facing-masker condition. These findings suggest that EHFs contain important cues for speech recognition in listening conditions with mismatched talker head orientations.

摘要

用于语音噪声识别的频带重要性函数,通常在稳态背景噪声存在的情况下确定,表明扩展高频(EHF;8-20 kHz)的作用可以忽略不计。然而,最近的研究结果表明,EHF 线索在多说话人环境中支持语音识别,特别是当掩蔽器的 EHF 水平相对于目标降低时。当目标说话人面对听众,但掩蔽器不面对听众时,这种情况可能会出现在自然听觉场景中。在这项研究中,我们通过单独对五个从 40 到 20000 Hz 的频带进行带阻滤波,测量了这些频带在语音内语音识别中的重要性。刺激由来自 0°的女性目标说话人录制,以及来自 0°或 56.25°的空间共定位的两个女性掩蔽器录制,分别模拟掩蔽器面向听众或背向听众。结果表明,在面对掩蔽器的情况下,峰值频带重要性在 0.4-1.3 kHz 频带,去除 EHF 频带的影响可以忽略不计。然而,在非面对情况下,峰值更宽,EHF 的重要性更高,与面对掩蔽器情况下的 3.3-8.3 kHz 频带相当。这些发现表明,在说话人头部方向不匹配的聆听条件下,EHF 包含了语音识别的重要线索。

相似文献

1
Band importance for speech-in-speech recognition in the presence of extended high-frequency cues.在存在扩展高频线索的情况下,语音内语音识别的声道重要性。
J Acoust Soc Am. 2024 Aug 1;156(2):1202-1213. doi: 10.1121/10.0028269.
2
Spectral weights for localization and speech-in-speech recognition with spatial separation of talkers on the horizontal plane.用于在水平面上对说话者进行空间分离的定位和语音中语音识别的谱权重。
J Acoust Soc Am. 2025 Jul 1;158(1):186-200. doi: 10.1121/10.0037072.
3
On the cocktail-party problem: Do children use their exquisite hearing at frequencies above 8 kHz?关于鸡尾酒会问题:儿童是否会利用其在8千赫兹以上频率的敏锐听力?
Hear Res. 2025 Aug;464:109327. doi: 10.1016/j.heares.2025.109327. Epub 2025 Jun 10.
4
Testing the role of temporal coherence on speech intelligibility with noise and single-talker maskers.测试时间相干性在噪声和单说话人掩蔽下语音可懂度中的作用。
J Acoust Soc Am. 2024 Nov 1;156(5):3285-3297. doi: 10.1121/10.0034420.
5
Frequency importance for sentence recognition in co-located noise, co-located speech, and spatially separated speech.在噪声环境、共位语音和空间分离语音中,句子识别的频率重要性。
J Acoust Soc Am. 2024 Nov 1;156(5):3275-3284. doi: 10.1121/10.0034412.
6
Effects of Masker Intelligibility and Talker Sex on Speech-in-Speech Recognition by Mandarin Speakers Across the Lifespan.掩蔽音清晰度和说话者性别对各年龄段普通话使用者的语音中语音识别的影响。
Ear Hear. 2025;46(4):1085-1094. doi: 10.1097/AUD.0000000000001655. Epub 2025 Mar 18.
7
Effect of Masker Head Orientation, Listener Age, and Extended High-Frequency Sensitivity on Speech Recognition in Spatially Separated Speech.掩蔽头方向、听众年龄和高频扩展灵敏度对空间分离语音中的语音识别的影响。
Ear Hear. 2022 Jan/Feb;43(1):90-100. doi: 10.1097/AUD.0000000000001081.
8
High-arousal emotional speech enhances speech intelligibility and emotion recognition in noise.高唤醒度情感语音可提高噪声环境下的语音清晰度和情感识别能力。
J Acoust Soc Am. 2025 Jun 1;157(6):4085-4096. doi: 10.1121/10.0036812.
9
Degradation in Binaural and Spatial Hearing, and Auditory Temporal Processing Abilities, as a Function of Aging.双耳及空间听觉以及听觉时间处理能力随衰老的退化
bioRxiv. 2025 Feb 20:2024.07.08.602575. doi: 10.1101/2024.07.08.602575.
10
Attenuation and distortion components of age-related hearing loss: Contributions to recognizing temporal-envelope filtered speech in modulated noise.年龄相关听力损失的衰减和失真成分:对在调制噪声中识别时域包络滤波语音的贡献。
J Acoust Soc Am. 2024 Jul 1;156(1):93-106. doi: 10.1121/10.0026450.

本文引用的文献

1
Differential benefits of unmasking extended high-frequency content of target or background speech.掩蔽目标或背景语音的扩展高频内容的差异收益。
J Acoust Soc Am. 2023 Jul 1;154(1):454-462. doi: 10.1121/10.0020175.
2
Spectral weighting for sentence recognition in steady-state and amplitude-modulated noise.在稳态噪声和调幅噪声中用于句子识别的频谱加权。
JASA Express Lett. 2023 May 1;3(5). doi: 10.1121/10.0017934.
3
Extending the High-Frequency Bandwidth and Predicting Speech-in-Noise Recognition: Building on the Work of Pat Stelmachowicz.扩展高频带宽并预测噪声中的语音识别:基于帕特·斯特尔马乔维茨的工作
Semin Hear. 2023 Mar 1;44(Suppl 1):S64-S74. doi: 10.1055/s-0043-1764133. eCollection 2023 Feb.
4
On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments.关于在语音感知实验中使用 TIMIT、QuickSIN、NU-6 和其他广泛使用的带限语音材料。
J Acoust Soc Am. 2022 Sep;152(3):1639. doi: 10.1121/10.0013993.
5
Extended high-frequency audiometry in research and clinical practice.扩展高频测听在研究和临床实践中的应用。
J Acoust Soc Am. 2022 Mar;151(3):1944. doi: 10.1121/10.0009766.
6
The Importance of Extended High-Frequency Speech Information in the Recognition of Digits, Words, and Sentences in Quiet and Noise.扩展高频语音信息在安静和噪声环境中对数字、单词及句子识别的重要性
Ear Hear. 2022 May/Jun;43(3):913-920. doi: 10.1097/AUD.0000000000001142.
7
Extended High-frequency Hearing Impairment Despite a Normal Audiogram: Relation to Early Aging, Speech-in-noise Perception, Cochlear Function, and Routine Earphone Use.尽管听力图正常,但高频听力仍受损:与早期衰老、噪声下言语感知、耳蜗功能和常规耳机使用的关系。
Ear Hear. 2022 May/Jun;43(3):822-835. doi: 10.1097/AUD.0000000000001140.
8
Band importance for speech-in-speech recognition.语音中语音识别的频段重要性。
JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.
9
Effect of Masker Head Orientation, Listener Age, and Extended High-Frequency Sensitivity on Speech Recognition in Spatially Separated Speech.掩蔽头方向、听众年龄和高频扩展灵敏度对空间分离语音中的语音识别的影响。
Ear Hear. 2022 Jan/Feb;43(1):90-100. doi: 10.1097/AUD.0000000000001081.
10
Extended high-frequency hearing and head orientation cues benefit children during speech-in-speech recognition.扩展高频听力和头部方向线索有助于儿童在语音干扰环境下进行语音识别。
Hear Res. 2021 Jul;406:108230. doi: 10.1016/j.heares.2021.108230. Epub 2021 Apr 8.