Henry Kenneth S, Abrams Kristina S, Forst Johanna, Mender Matthew J, Neilans Erikson G, Idrobo Fabio, Carney Laurel H
Department of Biomedical Engineering, University of Rochester, Rochester, NY, 14642, USA.
Department of Neuroscience, University of Rochester, Rochester, NY, 14642, USA.
J Assoc Res Otolaryngol. 2017 Feb;18(1):165-181. doi: 10.1007/s10162-016-0594-4. Epub 2016 Oct 20.
Vowels make a strong contribution to speech perception under natural conditions. Vowels are encoded in the auditory nerve primarily through neural synchrony to temporal fine structure and to envelope fluctuations rather than through average discharge rate. Neural synchrony is thought to contribute less to vowel coding in central auditory nuclei, consistent with more limited synchronization to fine structure and the emergence of average-rate coding of envelope fluctuations. However, this hypothesis is largely unexplored, especially in background noise. The present study examined coding mechanisms at the level of the midbrain that support behavioral sensitivity to simple vowel-like sounds using neurophysiological recordings and matched behavioral experiments in the budgerigar. Stimuli were harmonic tone complexes with energy concentrated at one spectral peak, or formant frequency, presented in quiet and in noise. Behavioral thresholds for formant-frequency discrimination decreased with increasing amplitude of stimulus envelope fluctuations, increased in noise, and were similar between budgerigars and humans. Multiunit recordings in awake birds showed that the midbrain encodes vowel-like sounds both through response synchrony to envelope structure and through average rate. Whereas neural discrimination thresholds based on either coding scheme were sufficient to support behavioral thresholds in quiet, only synchrony-based neural thresholds could account for behavioral thresholds in background noise. These results reveal an incomplete transformation to average-rate coding of vowel-like sounds in the midbrain. Model simulations suggest that this transformation emerges due to modulation tuning, which is shared between birds and mammals. Furthermore, the results underscore the behavioral relevance of envelope synchrony in the midbrain for detection of small differences in vowel formant frequency under real-world listening conditions.
在自然条件下,元音对语音感知有重要贡献。元音在听神经中主要通过与时间精细结构和包络波动的神经同步进行编码,而不是通过平均放电率。人们认为神经同步对中枢听觉核中的元音编码贡献较小,这与对精细结构的同步性更有限以及包络波动的平均速率编码的出现相一致。然而,这一假设在很大程度上尚未得到探索,尤其是在背景噪声中。本研究利用虎皮鹦鹉的神经生理学记录和匹配的行为实验,研究了中脑水平上支持对简单元音样声音行为敏感性的编码机制。刺激是能量集中在一个频谱峰值或共振峰频率处的谐波复音,分别在安静和有噪声的环境中呈现。共振峰频率辨别行为阈值随刺激包络波动幅度的增加而降低,在噪声环境中升高,并且在虎皮鹦鹉和人类之间相似。对清醒鸟类的多单元记录表明,中脑通过对包络结构的反应同步和平均放电率来编码元音样声音。虽然基于这两种编码方案的神经辨别阈值在安静环境中足以支持行为阈值,但只有基于同步的神经阈值能够解释背景噪声中的行为阈值。这些结果揭示了中脑对元音样声音的平均速率编码的不完全转变。模型模拟表明,这种转变是由于调制调谐而出现的,鸟类和哺乳动物都有这种调制调谐。此外,研究结果强调了中脑包络同步在现实听力条件下检测元音共振峰频率微小差异方面的行为相关性。