Department of Speech, Language, and Hearing Sciences, Indiana University, Bloomington, IN, United States.
Program in Neuroscience, Indiana University, Bloomington, IN, United States.
Cereb Cortex. 2023 Sep 9;33(18):10076-10086. doi: 10.1093/cercor/bhad266.
So-called duplex speech stimuli with perceptually ambiguous spectral cues to one ear and isolated low- versus high-frequency third formant "chirp" to the opposite ear yield a coherent percept supporting their phonetic categorization. Critically, such dichotic sounds are only perceived categorically upon binaural integration. Here, we used frequency-following responses (FFRs), scalp-recorded potentials reflecting phase-locked subcortical activity, to investigate brainstem responses to fused speech percepts and to determine whether FFRs reflect binaurally integrated category-level representations. We recorded FFRs to diotic and dichotic stop-consonants (/da/, /ga/) that either did or did not require binaural fusion to properly label along with perceptually ambiguous sounds without clear phonetic identity. Behaviorally, listeners showed clear categorization of dichotic speech tokens confirming they were heard with a fused, phonetic percept. Neurally, we found FFRs were stronger for categorically perceived speech relative to category-ambiguous tokens but also differentiated phonetic categories for both diotically and dichotically presented speech sounds. Correlations between neural and behavioral data further showed FFR latency predicted the degree to which listeners labeled tokens as "da" versus "ga." The presence of binaurally integrated, category-level information in FFRs suggests human brainstem processing reflects a surprisingly abstract level of the speech code typically circumscribed to much later cortical processing.
所谓的双重语音刺激,其一只耳朵的频谱线索具有感知上的歧义,而另一只耳朵的第三共振峰则是孤立的低-高频率“啁啾”,这会产生一个连贯的感知,支持它们的语音分类。关键是,只有在双耳整合的情况下,这种双声道声音才能被感知为分类。在这里,我们使用频率跟随反应(FFR),头皮记录的电位反映了相位锁定的皮质下活动,来研究大脑对融合语音感知的反应,并确定 FFR 是否反映了双耳整合的类别水平表示。我们记录了对单声道和双声道停止辅音(/da/,/ga/)的 FFR,这些辅音要么不需要双耳融合才能正确标记,要么与没有明确语音身份的感知上歧义的声音一起。行为上,听众对双声道语音令牌进行了清晰的分类,这证实了他们听到的是融合的、语音感知。在神经上,我们发现 FFR 对于分类感知的语音相对于类别模糊的语音更强,但也区分了双声道和单声道呈现的语音的语音类别。神经和行为数据之间的相关性进一步表明,FFR 潜伏期预测了听众将令牌标记为“da”与“ga”的程度。FFR 中存在双耳整合的类别水平信息表明,人类脑干处理反映了语音代码的一个惊人的抽象水平,通常局限于更晚的皮质处理。