Vanderbilt University.
J Cogn Neurosci. 2018 Mar;30(3):319-337. doi: 10.1162/jocn_a_01205. Epub 2017 Nov 13.
Multisensory integration of visual mouth movements with auditory speech is known to offer substantial perceptual benefits, particularly under challenging (i.e., noisy) acoustic conditions. Previous work characterizing this process has found that ERPs to auditory speech are of shorter latency and smaller magnitude in the presence of visual speech. We sought to determine the dependency of these effects on the temporal relationship between the auditory and visual speech streams using EEG. We found that reductions in ERP latency and suppression of ERP amplitude are maximal when the visual signal precedes the auditory signal by a small interval and that increasing amounts of asynchrony reduce these effects in a continuous manner. Time-frequency analysis revealed that these effects are found primarily in the theta (4-8 Hz) and alpha (8-12 Hz) bands, with a central topography consistent with auditory generators. Theta effects also persisted in the lower portion of the band (3.5-5 Hz), and this late activity was more frontally distributed. Importantly, the magnitude of these late theta oscillations not only differed with the temporal characteristics of the stimuli but also served to predict participants' task performance. Our analysis thus reveals that suppression of single-trial brain responses by visual speech depends strongly on the temporal concordance of the auditory and visual inputs. It further illustrates that processes in the lower theta band, which we suggest as an index of incongruity processing, might serve to reflect the neural correlates of individual differences in multisensory temporal perception.
视觉口部运动与听觉言语的多感觉整合被认为提供了显著的感知益处,特别是在具有挑战性的(即嘈杂的)声学条件下。以前的研究表明,在存在视觉言语的情况下,听觉言语的 ERP 潜伏期更短,幅度更小。我们试图使用 EEG 确定这些效应对听觉和视觉言语流之间的时间关系的依赖性。我们发现,当视觉信号比听觉信号早小间隔时,ERP 潜伏期的减少和 ERP 幅度的抑制最大,并且随着异步量的增加,这些效应以连续的方式减小。时频分析表明,这些效应主要出现在 theta(4-8 Hz)和 alpha(8-12 Hz)频段,其中心拓扑结构与听觉发生器一致。theta 效应也在频段的较低部分(3.5-5 Hz)中持续存在,并且这种后期活动分布更为额部。重要的是,这些后期 theta 振荡的幅度不仅与刺激的时间特征不同,而且还可以预测参与者的任务表现。因此,我们的分析表明,视觉言语对单试大脑反应的抑制强烈依赖于听觉和视觉输入的时间一致性。它进一步说明了,我们认为作为不和谐处理指标的较低 theta 频段中的过程可能反映了多感官时间感知个体差异的神经相关性。