Cheng Fan-Yin, Xu Can, Gold Lisa, Smith Spencer
Department of Speech, Language, and Hearing Sciences, University of Texas at Austin, Austin, TX, United States.
Front Neurosci. 2021 Dec 20;15:747303. doi: 10.3389/fnins.2021.747303. eCollection 2021.
The efferent auditory nervous system may be a potent force in shaping how the brain responds to behaviorally significant sounds. Previous human experiments using the frequency following response (FFR) have shown efferent-induced modulation of subcortical auditory function online and over short- and long-term time scales; however, a contemporary understanding of FFR generation presents new questions about whether previous effects were constrained solely to the auditory subcortex. The present experiment used sine-wave speech (SWS), an acoustically-sparse stimulus in which dynamic pure tones represent speech formant contours, to evoke FFR. Due to the higher stimulus frequencies used in SWS, this approach biased neural responses toward brainstem generators and allowed for three stimuli (/bɔ/, /bu/, and /bo/) to be used to evoke FFR and listeners in a training group were made aware that they were hearing a degraded speech stimulus. All SWS stimuli were rapidly perceived as speech when presented with a SWS carrier phrase, and average token identification reached ceiling performance during a perceptual training phase. Compared to a control group which remained naïve throughout the experiment, training group FFR amplitudes were enhanced post-training for each stimulus. Further, linear support vector machine classification of training group FFR significantly improved post-training compared to the control group, indicating that training-induced neural enhancements were sufficient to bolster machine learning classification accuracy. These results suggest that the efferent auditory system may rapidly modulate auditory brainstem representation of sounds depending on their context and perception as non-speech or speech.
传出听觉神经系统可能是塑造大脑对行为上重要声音反应方式的一股强大力量。先前使用频率跟随反应(FFR)的人体实验表明,传出神经在短期和长期时间尺度上对皮层下听觉功能进行在线调制;然而,对FFR产生的当代理解提出了新的问题,即先前的效应是否仅局限于听觉皮层下区域。本实验使用正弦波语音(SWS),这是一种声学上稀疏的刺激,其中动态纯音代表语音共振峰轮廓,以诱发FFR。由于SWS中使用的刺激频率较高,这种方法使神经反应偏向脑干发生器,并允许使用三种刺激(/bɔ/、/bu/和/bo/)来诱发FFR,并且训练组的听众知道他们听到的是一种退化的语音刺激。当与SWS载波短语一起呈现时,所有SWS刺激都能迅速被感知为语音,并且在感知训练阶段平均令牌识别达到了最高性能。与在整个实验中保持不知情的对照组相比,训练组FFR的振幅在每次刺激训练后都有所增强。此外,与对照组相比,训练组FFR的线性支持向量机分类在训练后显著改善,这表明训练引起的神经增强足以提高机器学习分类的准确性。这些结果表明,传出听觉系统可能会根据声音的背景以及它们被感知为非语音或语音的情况,迅速调节听觉脑干对声音的表征。