Berezutskaya Julia, Freudenburg Zachary V, Güçlü Umut, van Gerven Marcel A J, Ramsey Nick F
Brain Center Rudolf Magnus, Department of Neurology and Neurosurgery, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands, and
Radboud University, Donders Institute for Brain, Cognition and Behaviour, 6525 HR Nijmegen, The Netherlands.
J Neurosci. 2017 Aug 16;37(33):7906-7920. doi: 10.1523/JNEUROSCI.0238-17.2017. Epub 2017 Jul 17.
Despite a large body of research, we continue to lack a detailed account of how auditory processing of continuous speech unfolds in the human brain. Previous research showed the propagation of low-level acoustic features of speech from posterior superior temporal gyrus toward anterior superior temporal gyrus in the human brain (Hullett et al., 2016). In this study, we investigate what happens to these neural representations past the superior temporal gyrus and how they engage higher-level language processing areas such as inferior frontal gyrus. We used low-level sound features to model neural responses to speech outside of the primary auditory cortex. Two complementary imaging techniques were used with human participants (both males and females): electrocorticography (ECoG) and fMRI. Both imaging techniques showed tuning of the perisylvian cortex to low-level speech features. With ECoG, we found evidence of propagation of the temporal features of speech sounds along the ventral pathway of language processing in the brain toward inferior frontal gyrus. Increasingly coarse temporal features of speech spreading from posterior superior temporal cortex toward inferior frontal gyrus were associated with linguistic features such as voice onset time, duration of the formant transitions, and phoneme, syllable, and word boundaries. The present findings provide the groundwork for a comprehensive bottom-up account of speech comprehension in the human brain. We know that, during natural speech comprehension, a broad network of perisylvian cortical regions is involved in sound and language processing. Here, we investigated the tuning to low-level sound features within these regions using neural responses to a short feature film. We also looked at whether the tuning organization along these brain regions showed any parallel to the hierarchy of language structures in continuous speech. Our results show that low-level speech features propagate throughout the perisylvian cortex and potentially contribute to the emergence of "coarse" speech representations in inferior frontal gyrus typically associated with high-level language processing. These findings add to the previous work on auditory processing and underline a distinctive role of inferior frontal gyrus in natural speech comprehension.
尽管已有大量研究,但我们仍缺乏对人类大脑中连续语音的听觉处理过程如何展开的详细描述。先前的研究表明,语音的低级声学特征在人类大脑中从后颞上回向前颞上回传播(Hullett等人,2016年)。在本研究中,我们调查了这些神经表征在颞上回之后会发生什么,以及它们如何与诸如额下回等高级语言处理区域相互作用。我们使用低级声音特征来模拟初级听觉皮层之外对语音的神经反应。对人类参与者(包括男性和女性)使用了两种互补的成像技术:脑皮层电图(ECoG)和功能磁共振成像(fMRI)。两种成像技术均显示了外侧裂周围皮层对低级语音特征的调谐。通过ECoG,我们发现了语音声音的时间特征沿大脑语言处理的腹侧通路向下传播至额下回的证据。从后颞上皮质向前额下回传播的语音时间特征越来越粗糙,这与诸如语音起始时间、共振峰过渡持续时间以及音素、音节和单词边界等语言特征相关。本研究结果为全面自下而上地解释人类大脑中的言语理解奠定了基础。我们知道,在自然言语理解过程中,广泛的外侧裂周围皮层区域网络参与声音和语言处理。在这里,我们使用对一部短故事片的神经反应来研究这些区域内对低级声音特征的调谐情况。我们还研究了沿这些脑区的调谐组织是否与连续语音中语言结构的层次有任何平行关系。我们的结果表明,低级语音特征在整个外侧裂周围皮层中传播,并可能促成通常与高级语言处理相关的额下回中“粗糙”语音表征的出现。这些发现补充了先前关于听觉处理的研究,并强调了额下回在自然言语理解中的独特作用。