McHaney Jacie R, Hancock Kenneth E, Polley Daniel B, Parthasarathy Aravindakshan
Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA.
Deparment of Otolaryngology - Head and Neck Surgery, Harvard Medical School, Boston, MA.
bioRxiv. 2023 Aug 15:2023.08.13.553131. doi: 10.1101/2023.08.13.553131.
Optimal speech perception in noise requires successful separation of the target speech stream from multiple competing background speech streams. The ability to segregate these competing speech streams depends on the fidelity of bottom-up neural representations of sensory information in the auditory system and top-down influences of effortful listening. Here, we use objective neurophysiological measures of bottom-up temporal processing using envelope-following responses (EFRs) to amplitude modulated tones and investigate their interactions with pupil-indexed listening effort, as it relates to performance on the Quick speech in noise (QuickSIN) test in young adult listeners with clinically normal hearing thresholds. We developed an approach using ear-canal electrodes and adjusting electrode montages for modulation rate ranges, which extended the rage of reliable EFR measurements as high as 1024Hz. Pupillary responses revealed changes in listening effort at the two most difficult signal-to-noise ratios (SNR), but behavioral deficits at the hardest SNR only. Neither pupil-indexed listening effort nor the slope of the EFR decay function independently related to QuickSIN performance. However, a linear model using the combination of EFRs and pupil metrics significantly explained variance in QuickSIN performance. These results suggest a synergistic interaction between bottom-up sensory coding and top-down measures of listening effort as it relates to speech perception in noise. These findings can inform the development of next-generation tests for hearing deficits in listeners with normal-hearing thresholds that incorporates a multi-dimensional approach to understanding speech intelligibility deficits.
在噪声环境中实现最佳言语感知需要成功地将目标言语流与多个相互竞争的背景言语流分离。分离这些相互竞争的言语流的能力取决于听觉系统中感觉信息的自下而上神经表征的保真度以及努力聆听的自上而下的影响。在这里,我们使用基于包络跟随反应(EFRs)对调幅音进行自下而上时间处理的客观神经生理学测量方法,并研究它们与瞳孔索引的聆听努力之间的相互作用,因为这与听力阈值临床正常的年轻成年听众在噪声中的快速言语(QuickSIN)测试中的表现有关。我们开发了一种使用耳道电极并针对调制率范围调整电极蒙太奇的方法,该方法将可靠的EFR测量范围扩展到高达1024Hz。瞳孔反应揭示了在两个最困难的信噪比(SNR)下聆听努力的变化,但仅在最难的信噪比下出现行为缺陷。瞳孔索引的聆听努力和EFR衰减函数的斜率均与QuickSIN表现无关。然而,一个使用EFR和瞳孔指标组合的线性模型显著解释了QuickSIN表现中的方差。这些结果表明,自下而上的感觉编码与自上而下的聆听努力测量之间存在协同相互作用,这与噪声中的言语感知有关。这些发现可为开发下一代针对听力阈值正常的听众听力缺陷的测试提供参考,该测试采用多维方法来理解言语可懂度缺陷。