State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing 100875, People's Republic of China.
Department of Psychology, Sun Yat-sen University, Guangzhou 510006, People's Republic of China.
Cereb Cortex. 2020 Mar 14;30(3):942-951. doi: 10.1093/cercor/bhz138.
Whether auditory processing of speech relies on reference to the articulatory motor information of speaker remains elusive. Here, we addressed this issue under a two-brain framework. Functional magnetic resonance imaging was applied to record the brain activities of speakers when telling real-life stories and later of listeners when listening to the audio recordings of these stories. Based on between-brain seed-to-voxel correlation analyses, we revealed that neural dynamics in listeners' auditory temporal cortex are temporally coupled with the dynamics in the speaker's larynx/phonation area. Moreover, the coupling response in listener's left auditory temporal cortex follows the hierarchical organization for speech processing, with response lags in A1+, STG/STS, and MTG increasing linearly. Further, listeners showing greater coupling responses understand the speech better. When comprehension fails, such interbrain auditory-articulation coupling vanishes substantially. These findings suggest that a listener's auditory system and a speaker's articulatory system are inherently aligned during naturalistic verbal interaction, and such alignment is associated with high-level information transfer from the speaker to the listener. Our study provides reliable evidence supporting that references to the articulatory motor information of speaker facilitate speech comprehension under a naturalistic scene.
言语的听觉加工是否依赖于说话者的发音运动信息仍然难以捉摸。在这里,我们在双脑框架下解决了这个问题。我们应用功能磁共振成像来记录说话者讲述真实故事时的大脑活动,以及后来的听众听这些故事的录音时的大脑活动。基于大脑间种子到体素相关分析,我们揭示了听众听觉颞叶皮层的神经活动与说话者的喉部/发声区域的动力学在时间上是耦合的。此外,听众左听觉颞叶皮层的耦合反应遵循言语处理的层次组织,A1+、STG/STS 和 MTG 的反应滞后呈线性增加。此外,表现出更大耦合反应的听众能更好地理解言语。当理解失败时,这种大脑间听觉-发音的耦合会大大消失。这些发现表明,在自然的言语互动中,听者的听觉系统和说话者的发音系统本质上是一致的,这种一致性与从说话者到听者的高级信息传递有关。我们的研究提供了可靠的证据,支持在自然场景下,参考说话者的发音运动信息有助于言语理解。