Department of Neurosurgery, Baylor College of Medicine, Houston, United States.
Elife. 2019 Aug 8;8:e48116. doi: 10.7554/eLife.48116.
Visual information about speech content from the talker's mouth is often available before auditory information from the talker's voice. Here we examined perceptual and neural responses to words with and without this visual head start. For both types of words, perception was enhanced by viewing the talker's face, but the enhancement was significantly greater for words with a head start. Neural responses were measured from electrodes implanted over auditory association cortex in the posterior superior temporal gyrus (pSTG) of epileptic patients. The presence of visual speech suppressed responses to auditory speech, more so for words with a visual head start. We suggest that the head start inhibits representations of incompatible auditory phonemes, increasing perceptual accuracy and decreasing total neural responses. Together with previous work showing visual cortex modulation (Ozker et al., 2018b) these results from pSTG demonstrate that multisensory interactions are a powerful modulator of activity throughout the speech perception network.
说话者口部的言语内容的视觉信息通常先于来自说话者声音的听觉信息。在这里,我们研究了对有和没有这种视觉先启的单词的感知和神经反应。对于这两种类型的单词,观看说话者的面部都可以增强感知,但对于有先启的单词,增强效果更为显著。神经反应是通过在癫痫患者的后上颞叶(pSTG)听觉联合皮层上植入的电极测量的。视觉言语的存在抑制了对听觉言语的反应,对于有视觉先启的单词抑制作用更为明显。我们认为,这种先启抑制了不兼容的听觉音素的表示,从而提高了感知准确性,减少了总神经反应。与之前显示视觉皮层调制的工作(Ozker 等人,2018b)一起,这些来自 pSTG 的结果表明,多感官相互作用是整个言语感知网络活动的强大调制器。