• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双耳时间精细结构和包络线索在鸡尾酒会式聆听中的作用。

Role of Binaural Temporal Fine Structure and Envelope Cues in Cocktail-Party Listening.

作者信息

Swaminathan Jayaganesh, Mason Christine R, Streeter Timothy M, Best Virginia, Roverud Elin, Kidd Gerald

机构信息

Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215

Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215.

出版信息

J Neurosci. 2016 Aug 3;36(31):8250-7. doi: 10.1523/JNEUROSCI.4421-15.2016.

DOI:10.1523/JNEUROSCI.4421-15.2016
PMID:27488643
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4971368/
Abstract

UNLABELLED

While conversing in a crowded social setting, a listener is often required to follow a target speech signal amid multiple competing speech signals (the so-called "cocktail party" problem). In such situations, separation of the target speech signal in azimuth from the interfering masker signals can lead to an improvement in target intelligibility, an effect known as spatial release from masking (SRM). This study assessed the contributions of two stimulus properties that vary with separation of sound sources, binaural envelope (ENV) and temporal fine structure (TFS), to SRM in normal-hearing (NH) human listeners. Target speech was presented from the front and speech maskers were either colocated with or symmetrically separated from the target in azimuth. The target and maskers were presented either as natural speech or as "noise-vocoded" speech in which the intelligibility was conveyed only by the speech ENVs from several frequency bands; the speech TFS within each band was replaced with noise carriers. The experiments were designed to preserve the spatial cues in the speech ENVs while retaining/eliminating them from the TFS. This was achieved by using the same/different noise carriers in the two ears. A phenomenological auditory-nerve model was used to verify that the interaural correlations in TFS differed across conditions, whereas the ENVs retained a high degree of correlation, as intended. Overall, the results from this study revealed that binaural TFS cues, especially for frequency regions below 1500 Hz, are critical for achieving SRM in NH listeners. Potential implications for studying SRM in hearing-impaired listeners are discussed.

SIGNIFICANCE STATEMENT

Acoustic signals received by the auditory system pass first through an array of physiologically based band-pass filters. Conceptually, at the output of each filter, there are two principal forms of temporal information: slowly varying fluctuations in the envelope (ENV) and rapidly varying fluctuations in the temporal fine structure (TFS). The importance of these two types of information in everyday listening (e.g., conversing in a noisy social situation; the "cocktail-party" problem) has not been established. This study assessed the contributions of binaural ENV and TFS cues for understanding speech in multiple-talker situations. Results suggest that, whereas the ENV cues are important for speech intelligibility, binaural TFS cues are critical for perceptually segregating the different talkers and thus for solving the cocktail party problem.

摘要

未加标注

在拥挤的社交场合交谈时,听众常常需要在多个相互竞争的语音信号(即所谓的“鸡尾酒会”问题)中追踪目标语音信号。在这种情况下,从方位上分离目标语音信号与干扰掩蔽信号可提高目标可懂度,这种效应称为空间掩蔽释放(SRM)。本研究评估了两种随声源分离而变化的刺激特性——双耳包络(ENV)和时间精细结构(TFS)——对正常听力(NH)人类听众SRM的贡献。目标语音从前方呈现,语音掩蔽声在方位上与目标语音共置或对称分离。目标语音和掩蔽声以自然语音或“噪声编码”语音呈现,其中可懂度仅由几个频段的语音包络传达;每个频段内的语音TFS被噪声载波取代。实验旨在保留语音包络中的空间线索,同时在TFS中保留/消除这些线索。这通过在双耳中使用相同/不同的噪声载波来实现。使用现象学听觉神经模型来验证不同条件下TFS中的双耳相关性不同,而包络如预期那样保持高度相关性。总体而言,本研究结果表明,双耳TFS线索,尤其是对于1500 Hz以下的频率区域,对于NH听众实现SRM至关重要。讨论了对研究听力受损听众SRM的潜在影响。

意义声明

听觉系统接收到的声信号首先通过一系列基于生理的带通滤波器。从概念上讲,在每个滤波器的输出端,有两种主要的时间信息形式:包络(ENV)中缓慢变化的波动和时间精细结构(TFS)中快速变化的波动。这两种类型的信息在日常听力(例如,在嘈杂的社交场合交谈;“鸡尾酒会”问题)中的重要性尚未确定。本研究评估了双耳ENV和TFS线索在多说话者情况下对理解语音的贡献。结果表明,虽然ENV线索对语音可懂度很重要,但双耳TFS线索对于在感知上分离不同的说话者从而解决鸡尾酒会问题至关重要。

相似文献

1
Role of Binaural Temporal Fine Structure and Envelope Cues in Cocktail-Party Listening.双耳时间精细结构和包络线索在鸡尾酒会式聆听中的作用。
J Neurosci. 2016 Aug 3;36(31):8250-7. doi: 10.1523/JNEUROSCI.4421-15.2016.
2
The Effect of Simulated Interaural Frequency Mismatch on Speech Understanding and Spatial Release From Masking.模拟耳间频率失配对言语理解和掩蔽空间释放的影响。
Ear Hear. 2018 Sep/Oct;39(5):895-905. doi: 10.1097/AUD.0000000000000541.
3
Can basic auditory and cognitive measures predict hearing-impaired listeners' localization and spatial speech recognition abilities?基本的听觉和认知测量能否预测听力受损者的定位和空间言语识别能力?
J Acoust Soc Am. 2011 Sep;130(3):1542-58. doi: 10.1121/1.3608122.
4
Spatial Release From Masking in Simulated Cochlear Implant Users With and Without Access to Low-Frequency Acoustic Hearing.有和没有低频听觉的模拟人工耳蜗使用者的掩蔽空间释放
Trends Hear. 2015 Dec 30;19:2331216515616940. doi: 10.1177/2331216515616940.
5
Human Frequency Following Responses to Vocoded Speech.人类对语音编码语音的频率跟随反应。
Ear Hear. 2017 Sep/Oct;38(5):e256-e267. doi: 10.1097/AUD.0000000000000432.
6
The impact of temporal fine structure and signal envelope on auditory motion perception.时频结构和信号包络对听觉运动感知的影响。
PLoS One. 2020 Aug 21;15(8):e0238125. doi: 10.1371/journal.pone.0238125. eCollection 2020.
7
The effect of audiovisual and binaural listening on the acceptable noise level (ANL): establishing an ANL conceptual model.视听和双耳聆听对可接受噪声水平(ANL)的影响:建立ANL概念模型。
J Am Acad Audiol. 2014 Feb;25(2):141-53. doi: 10.3766/jaaa.25.2.3.
8
Binaural Glimpses at the Cocktail Party?鸡尾酒会上的双耳一瞥?
J Assoc Res Otolaryngol. 2016 Oct;17(5):461-73. doi: 10.1007/s10162-016-0575-7. Epub 2016 Jul 13.
9
Release from informational masking in a monaural competing-speech task with vocoded copies of the maskers presented contralaterally.在对侧呈现带声码器处理的掩蔽声副本的单耳竞争言语任务中从信息掩蔽中释放。
J Acoust Soc Am. 2015 Feb;137(2):702-13. doi: 10.1121/1.4906167.
10
Benefits of Acoustic Beamforming for Solving the Cocktail Party Problem.声束形成在解决鸡尾酒会问题中的优势。
Trends Hear. 2015 Jun 30;19:2331216515593385. doi: 10.1177/2331216515593385.

引用本文的文献

1
Reduced Neural Speech Tracking in Adolescents with Listening Difficulty.听力困难青少年的神经语音跟踪能力下降。
medRxiv. 2025 Jun 24:2025.06.24.25330187. doi: 10.1101/2025.06.24.25330187.
2
The Slowest Timescales of Neural Synchronization Reveal the Strongest Influence of Auditory Distraction.神经同步的最慢时间尺度揭示了听觉干扰的最强影响。
bioRxiv. 2025 May 5:2025.05.05.652235. doi: 10.1101/2025.05.05.652235.
3
Temporal envelope cues and simulations of cochlear implant signal processing.时间包络线索与人工耳蜗信号处理模拟
Speech Commun. 2019 May;109:24-33. doi: 10.1016/j.specom.2019.03.003. Epub 2019 Mar 21.
4
Review of Binaural Processing With Asymmetrical Hearing Outcomes in Patients With Bilateral Cochlear Implants.双侧人工耳蜗植入患者的非对称听力结果的双耳处理评估。
Trends Hear. 2024 Jan-Dec;28:23312165241229880. doi: 10.1177/23312165241229880.
5
A review of auditory processing and cognitive change during normal ageing, and the implications for setting hearing aids for older adults.正常衰老过程中听觉处理与认知变化的综述及其对老年人助听器适配的启示
Front Neurol. 2023 Jun 20;14:1122420. doi: 10.3389/fneur.2023.1122420. eCollection 2023.
6
Strength of target source segregation cues affects the outcome of speech-on-speech masking experiments.目标声源分离线索的强度会影响语音掩蔽实验的结果。
J Acoust Soc Am. 2023 May 1;153(5):2780. doi: 10.1121/10.0019307.
7
Factors underlying masking release by voice-gender differences and spatial separation cues in multi-talker listening environments in listeners with and without hearing loss.听力正常和听力损失听众在多说话者聆听环境中,语音性别差异和空间分离线索导致掩蔽解除的潜在因素。
Front Neurosci. 2022 Nov 23;16:1059639. doi: 10.3389/fnins.2022.1059639. eCollection 2022.
8
The hunt for hidden hearing loss in humans: From preclinical studies to effective interventions.人类隐匿性听力损失的探寻:从临床前研究到有效干预措施
Front Neurosci. 2022 Sep 15;16:1000304. doi: 10.3389/fnins.2022.1000304. eCollection 2022.
9
Effects of better-ear glimpsing, binaural unmasking, and spectral resolution on spatial release from masking in cochlear-implant users.更好的掩蔽 glimpsing、双耳掩蔽消除和频谱分辨率对人工耳蜗使用者空间掩蔽释放的影响。
J Acoust Soc Am. 2022 Aug;152(2):1230. doi: 10.1121/10.0013746.
10
Speech intelligibility and talker gender classification with noise-vocoded and tone-vocoded speech.基于噪声声码和音调声码语音的语音清晰度及说话者性别分类
JASA Express Lett. 2021 Sep;1(9):094401. doi: 10.1121/10.0006285. Epub 2021 Sep 20.

本文引用的文献

1
Musical training, individual differences and the cocktail party problem.音乐训练、个体差异与鸡尾酒会问题。
Sci Rep. 2015 Jun 26;5:11628. doi: 10.1038/srep11628.
2
The cocktail-party problem revisited: early processing and selection of multi-talker speech.再探鸡尾酒会问题:多说话者语音的早期处理与选择
Atten Percept Psychophys. 2015 Jul;77(5):1465-87. doi: 10.3758/s13414-015-0882-9.
3
Perceptual consequences of "hidden" hearing loss.“隐性”听力损失的感知后果。
Trends Hear. 2014 Sep 9;18:2331216514550621. doi: 10.1177/2331216514550621.
4
Independent impacts of age and hearing loss on spatial release in a complex auditory environment.年龄和听力损失对复杂听觉环境中空间释放的独立影响。
Front Neurosci. 2013 Dec 23;7:252. doi: 10.3389/fnins.2013.00252. eCollection 2013.
5
On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system.在早期听觉系统中语音编码的包络和时序精细结构的平衡。
J Acoust Soc Am. 2013 May;133(5):2818-33. doi: 10.1121/1.4795783.
6
Interaural level differences do not suffice for restoring spatial release from masking in simulated cochlear implant listening.双侧声级差不足以恢复模拟人工耳蜗听力中的掩蔽释放的空间辨别力。
PLoS One. 2012;7(9):e45296. doi: 10.1371/journal.pone.0045296. Epub 2012 Sep 20.
7
Diminished temporal coding with sensorineural hearing loss emerges in background noise.感音神经性听力损失在背景噪声中表现出时间编码能力下降。
Nat Neurosci. 2012 Oct;15(10):1362-4. doi: 10.1038/nn.3216. Epub 2012 Sep 9.
8
The influence of non-spatial factors on measures of spatial release from masking.非空间因素对掩蔽释放度量的影响。
J Acoust Soc Am. 2012 Apr;131(4):3103-10. doi: 10.1121/1.3693656.
9
Psychophysiological analyses demonstrate the importance of neural envelope coding for speech perception in noise.心理生理分析表明,神经包络编码对噪声中言语感知很重要。
J Neurosci. 2012 Feb 1;32(5):1747-56. doi: 10.1523/JNEUROSCI.4493-11.2012.
10
A cocktail party model of spatial release from masking by both noise and speech interferers.鸡尾酒会模型:噪声和语音干扰源的空间掩蔽释放。
J Acoust Soc Am. 2011 Sep;130(3):1463-74. doi: 10.1121/1.3613928.