Department of Linguistics, The Australian Hearing Hub, Macquarie University, Sydney, Australia.
Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois, United States of America.
PLoS Biol. 2021 Oct 20;19(10):e3001439. doi: 10.1371/journal.pbio.3001439. eCollection 2021 Oct.
The ability to navigate "cocktail party" situations by focusing on sounds of interest over irrelevant, background sounds is often considered in terms of cortical mechanisms. However, subcortical circuits such as the pathway underlying the medial olivocochlear (MOC) reflex modulate the activity of the inner ear itself, supporting the extraction of salient features from auditory scene prior to any cortical processing. To understand the contribution of auditory subcortical nuclei and the cochlea in complex listening tasks, we made physiological recordings along the auditory pathway while listeners engaged in detecting non(sense) words in lists of words. Both naturally spoken and intrinsically noisy, vocoded speech-filtering that mimics processing by a cochlear implant (CI)-significantly activated the MOC reflex, but this was not the case for speech in background noise, which more engaged midbrain and cortical resources. A model of the initial stages of auditory processing reproduced specific effects of each form of speech degradation, providing a rationale for goal-directed gating of the MOC reflex based on enhancing the representation of the energy envelope of the acoustic waveform. Our data reveal the coexistence of 2 strategies in the auditory system that may facilitate speech understanding in situations where the signal is either intrinsically degraded or masked by extrinsic acoustic energy. Whereas intrinsically degraded streams recruit the MOC reflex to improve representation of speech cues peripherally, extrinsically masked streams rely more on higher auditory centres to denoise signals.
人们常从皮质机制的角度来理解在“鸡尾酒会”等复杂环境中专注于感兴趣的声音、忽略不相关背景声音的能力。然而,内侧橄榄耳蜗(MOC)反射的亚皮质回路等可以调节内耳的活动,在皮质处理之前支持从听觉场景中提取显著特征。为了理解听觉皮质下核和耳蜗在复杂听力任务中的贡献,我们在听众参与检测单词列表中的非(无)义词时,沿着听觉通路进行了生理记录。自然说话和内在嘈杂的、模仿人工耳蜗(CI)处理的语音滤波都显著激活了 MOC 反射,但背景噪声中的语音则不然,背景噪声更多地利用了中脑和皮质资源。听觉处理初始阶段的模型再现了每种语音退化形式的特定效果,为基于增强声信号能量包络的表示来有针对性地控制 MOC 反射提供了依据。我们的数据揭示了听觉系统中 2 种策略的共存,这可能有助于在信号本身退化或被外部声能掩蔽的情况下理解言语。内在退化的信号流通过 MOC 反射来改善言语线索的外周表示,而外在掩蔽的信号流则更多地依赖于更高的听觉中枢来对信号进行去噪。