Department of Psychology and Program in Neuroscience, Bucknell University Lewisburg, PA, USA.
Department of Psychology, Cornell University Ithaca, NY, USA ; Department of Language and Communication, University of Southern Denmark Odense, Denmark ; Haskins Laboratories, New Haven CT, USA.
Front Psychol. 2014 May 16;5:407. doi: 10.3389/fpsyg.2014.00407. eCollection 2014.
Recent advances in the field of statistical learning have established that learners are able to track regularities of multimodal stimuli, yet it is unknown whether the statistical computations are performed on integrated representations or on separate, unimodal representations. In the present study, we investigated the ability of adults to integrate audio and visual input during statistical learning. We presented learners with a speech stream synchronized with a video of a speaker's face. In the critical condition, the visual (e.g., /gi/) and auditory (e.g., /mi/) signals were occasionally incongruent, which we predicted would produce the McGurk illusion, resulting in the perception of an audiovisual syllable (e.g., /ni/). In this way, we used the McGurk illusion to manipulate the underlying statistical structure of the speech streams, such that perception of these illusory syllables facilitated participants' ability to segment the speech stream. Our results therefore demonstrate that participants can integrate audio and visual input to perceive the McGurk illusion during statistical learning. We interpret our findings as support for modality-interactive accounts of statistical learning.
最近在统计学习领域的进展已经确立,学习者能够跟踪多模态刺激的规律,但尚不清楚统计计算是在综合表示还是在单独的、单模态表示上进行的。在本研究中,我们研究了成年人在统计学习期间整合音频和视觉输入的能力。我们向学习者呈现与说话者面部视频同步的语音流。在关键条件下,视觉(例如,/gi/)和听觉(例如,/mi/)信号偶尔会不一致,我们预计这会产生麦格克错觉,从而导致感知到视听音节(例如,/ni/)。通过这种方式,我们使用麦格克错觉来操纵语音流的基础统计结构,使得对这些幻觉音节的感知促进了参与者分割语音流的能力。因此,我们的结果表明,参与者可以整合音频和视觉输入,在统计学习期间感知麦格克错觉。我们将我们的发现解释为对统计学习的模态交互解释的支持。