Green K P, Kuhl P K, Meltzoff A N, Stevens E B
University of Arizona, Tucson 85721.
Percept Psychophys. 1991 Dec;50(6):524-36. doi: 10.3758/bf03207536.
Studies of the McGurk effect have shown that when discrepant phonetic information is delivered to the auditory and visual modalities, the information is combined into a new percept not originally presented to either modality. In typical experiments, the auditory and visual speech signals are generated by the same talker. The present experiment examined whether a discrepancy in the gender of the talker between the auditory and visual signals would influence the magnitude of the McGurk effect. A male talker's voice was dubbed onto a videotape containing a female talker's face, and vice versa. The gender-incongruent videotapes were compared with gender-congruent videotapes, in which a male talker's voice was dubbed onto a male face and a female talker's voice was dubbed onto a female face. Even though there was a clear incompatibility in talker characteristics between the auditory and visual signals on the incongruent videotapes, the resulting magnitude of the McGurk effect was not significantly different for the incongruent as opposed to the congruent videotapes. The results indicate that the mechanism for integrating speech information from the auditory and the visual modalities is not disrupted by a gender incompatibility even when it is perceptually apparent. The findings are compatible with the theoretical notion that information about voice characteristics of the talker is extracted and used to normalize the speech signal at an early stage of phonetic processing, prior to the integration of the auditory and the visual information.
对麦格克效应的研究表明,当不一致的语音信息分别通过听觉和视觉通道呈现时,这些信息会被整合为一种新的感知,而这种感知并非最初单独通过任何一个通道呈现的。在典型实验中,听觉和视觉语音信号由同一名说话者发出。本实验探究了听觉和视觉信号中说话者性别不一致是否会影响麦格克效应的程度。将男性说话者的声音配音到包含女性说话者面部的录像带上,反之亦然。将性别不一致的录像带与性别一致的录像带进行比较,在性别一致的录像带中,男性说话者的声音配音到男性面部,女性说话者的声音配音到女性面部。尽管在不一致的录像带上,听觉和视觉信号之间说话者特征存在明显不匹配,但与一致的录像带相比,不一致的录像带所产生的麦格克效应程度并无显著差异。结果表明,即使在感知上很明显存在性别不匹配,整合来自听觉和视觉通道的语音信息的机制也不会受到干扰。这些发现与以下理论观点相符:在语音处理的早期阶段,即在整合听觉和视觉信息之前,关于说话者声音特征的信息就已被提取并用于对语音信号进行归一化处理。