整合不同说话者、性别和感觉模态的语音信息：麦格克效应中的女性面孔与男性声音

Integrating speech information across talkers, gender, and sensory modality: female faces and male voices in the McGurk effect.

作者信息

Green K P, Kuhl P K, Meltzoff A N, Stevens E B

机构信息

University of Arizona, Tucson 85721.

出版信息

Percept Psychophys. 1991 Dec;50(6):524-36. doi: 10.3758/bf03207536.

DOI:10.3758/bf03207536

PMID:1780200

Abstract

Studies of the McGurk effect have shown that when discrepant phonetic information is delivered to the auditory and visual modalities, the information is combined into a new percept not originally presented to either modality. In typical experiments, the auditory and visual speech signals are generated by the same talker. The present experiment examined whether a discrepancy in the gender of the talker between the auditory and visual signals would influence the magnitude of the McGurk effect. A male talker's voice was dubbed onto a videotape containing a female talker's face, and vice versa. The gender-incongruent videotapes were compared with gender-congruent videotapes, in which a male talker's voice was dubbed onto a male face and a female talker's voice was dubbed onto a female face. Even though there was a clear incompatibility in talker characteristics between the auditory and visual signals on the incongruent videotapes, the resulting magnitude of the McGurk effect was not significantly different for the incongruent as opposed to the congruent videotapes. The results indicate that the mechanism for integrating speech information from the auditory and the visual modalities is not disrupted by a gender incompatibility even when it is perceptually apparent. The findings are compatible with the theoretical notion that information about voice characteristics of the talker is extracted and used to normalize the speech signal at an early stage of phonetic processing, prior to the integration of the auditory and the visual information.

摘要

对麦格克效应的研究表明，当不一致的语音信息分别通过听觉和视觉通道呈现时，这些信息会被整合为一种新的感知，而这种感知并非最初单独通过任何一个通道呈现的。在典型实验中，听觉和视觉语音信号由同一名说话者发出。本实验探究了听觉和视觉信号中说话者性别不一致是否会影响麦格克效应的程度。将男性说话者的声音配音到包含女性说话者面部的录像带上，反之亦然。将性别不一致的录像带与性别一致的录像带进行比较，在性别一致的录像带中，男性说话者的声音配音到男性面部，女性说话者的声音配音到女性面部。尽管在不一致的录像带上，听觉和视觉信号之间说话者特征存在明显不匹配，但与一致的录像带相比，不一致的录像带所产生的麦格克效应程度并无显著差异。结果表明，即使在感知上很明显存在性别不匹配，整合来自听觉和视觉通道的语音信息的机制也不会受到干扰。这些发现与以下理论观点相符：在语音处理的早期阶段，即在整合听觉和视觉信息之前，关于说话者声音特征的信息就已被提取并用于对语音信号进行归一化处理。

相似文献

Integrating speech information across talkers, gender, and sensory modality: female faces and male voices in the McGurk effect.整合不同说话者、性别和感觉模态的语音信息：麦格克效应中的女性面孔与男性声音

Percept Psychophys. 1991 Dec;50(6):524-36. doi: 10.3758/bf03207536.

A Causal Inference Model Explains Perception of the McGurk Effect and Other Incongruent Audiovisual Speech.一种因果推理模型解释了麦格克效应及其他不一致视听言语的感知。

PLoS Comput Biol. 2017 Feb 16;13(2):e1005229. doi: 10.1371/journal.pcbi.1005229. eCollection 2017 Feb.

The role of visual information in the processing of place and manner features in speech perception.视觉信息在言语感知中位置和方式特征处理中的作用。

Percept Psychophys. 1989 Jan;45(1):34-42. doi: 10.3758/bf03208030.

Stimulus variability and processing dependencies in speech perception.言语感知中的刺激变异性与加工依赖性

Percept Psychophys. 1990 Apr;47(4):379-90. doi: 10.3758/bf03210878.

Facial identity and facial speech processing: familiar faces and voices in the McGurk effect.面部身份与面部言语加工：麦格克效应中的熟悉面孔与声音

Percept Psychophys. 1995 Nov;57(8):1124-33. doi: 10.3758/bf03208369.

McGurk effect in non-English listeners: few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility.非英语听众中的麦格克效应：对于聆听高听觉清晰度日语音节的日本受试者，视觉效应较少。

J Acoust Soc Am. 1991 Oct;90(4 Pt 1):1797-805. doi: 10.1121/1.401660.

Processing of changes in visual speech in the human auditory cortex.人类听觉皮层中视觉言语变化的处理。

Brain Res Cogn Brain Res. 2002 May;13(3):417-25. doi: 10.1016/s0926-6410(02)00053-8.

Listener sensitivity to individual talker differences in voice-onset-time.听众对语音起始时间中个体说话者差异的敏感度。

J Acoust Soc Am. 2004 Jun;115(6):3171-83. doi: 10.1121/1.1701898.

Gaze behavior in audiovisual speech perception: the influence of ocular fixations on the McGurk effect.视听言语感知中的注视行为：眼动注视对麦格克效应的影响。

Percept Psychophys. 2003 May;65(4):553-67. doi: 10.3758/bf03194582.

The unity hypothesis revisited: can the male/female incongruent McGurk effect be disrupted by familiarization and priming?再探统一假说：熟悉化和启动能否破坏男性/女性不一致的麦格克效应？

Front Psychol. 2023 Aug 29;14:1106562. doi: 10.3389/fpsyg.2023.1106562. eCollection 2023.

引用本文的文献

Compensation for coarticulation despite a midway speaker change: Reassessing effects and implications.尽管说话人中途发生变化，但仍对协同发音进行补偿：重新评估影响和意义。

PLoS One. 2024 Jan 12;19(1):e0291992. doi: 10.1371/journal.pone.0291992. eCollection 2024.

Front Psychol. 2023 Aug 29;14:1106562. doi: 10.3389/fpsyg.2023.1106562. eCollection 2023.

Audiovisual speech perception: Moving beyond McGurk.视听言语感知：超越麦格克效应。

J Acoust Soc Am. 2022 Dec;152(6):3216. doi: 10.1121/10.0015262.

The other-race effect on the McGurk effect in infancy.婴儿对 McGurk 效应的异族效应。

Atten Percept Psychophys. 2021 Oct;83(7):2924-2936. doi: 10.3758/s13414-021-02342-w. Epub 2021 Aug 13.

Cross-modal effects in speech perception.言语感知中的跨模态效应。

Annu Rev Linguist. 2019 Jan;5(1):49-66. doi: 10.1146/annurev-linguistics-011718-012353. Epub 2018 Aug 1.

Visual Influences on Auditory Behavioral, Neural, and Perceptual Processes: A Review.视觉对听觉行为、神经和知觉过程的影响：综述。

J Assoc Res Otolaryngol. 2021 Jul;22(4):365-386. doi: 10.1007/s10162-021-00789-0. Epub 2021 May 20.

Effects of age and left hemisphere lesions on audiovisual integration of speech.年龄和左半球病变对言语视听整合的影响。

Brain Lang. 2020 Jul;206:104812. doi: 10.1016/j.bandl.2020.104812. Epub 2020 May 21.

No "Self" Advantage for Audiovisual Speech Aftereffects.视听语音后效不存在“自我”优势。

Front Psychol. 2019 Mar 22;10:658. doi: 10.3389/fpsyg.2019.00658. eCollection 2019.

A causal inference explanation for enhancement of multisensory integration by co-articulation.共发音对多感觉整合增强的因果推理解释。

Sci Rep. 2018 Dec 21;8(1):18032. doi: 10.1038/s41598-018-36772-8.

What accounts for individual differences in susceptibility to the McGurk effect?个体对麦格克效应的易感性差异的原因是什么？

PLoS One. 2018 Nov 12;13(11):e0207160. doi: 10.1371/journal.pone.0207160. eCollection 2018.

本文引用的文献

Phoneme perception in lipreading.唇读中的音素感知。

J Speech Hear Res. 1960 Sep;3:212-22. doi: 10.1044/jshr.0303.212.

The role of visual-auditory "compellingness" in the ventriloquism effect: implications for transitivity among the spatial senses.视觉-听觉“说服力”在腹语效应中的作用：对空间感官间传递性的启示

Percept Psychophys. 1981 Dec;30(6):557-64. doi: 10.3758/bf03202010.

Audiovisual presentation demonstrates that selective adaptation in speech perception is purely auditory.视听展示表明，言语感知中的选择性适应纯粹是听觉性的。

Percept Psychophys. 1981 Oct;30(4):309-14. doi: 10.3758/bf03206144.

Duplex perception of cues for stop consonants: evidence for a phonetic mode.塞音线索的双重感知：语音模式的证据。

Percept Psychophys. 1981 Aug;30(2):133-43. doi: 10.3758/bf03204471.

The detection of auditory visual desynchrony.听觉视觉不同步的检测。

Perception. 1980;9(6):719-21. doi: 10.1068/p090719.

Coarticulation effects in lipreading.唇读中的协同发音效应。

J Speech Hear Res. 1982 Dec;25(4):600-7. doi: 10.1044/jshr.2504.600.

The bimodal perception of speech in infancy.婴儿期言语的双峰感知。

Science. 1982 Dec 10;218(4577):1138-41. doi: 10.1126/science.7146899.

Phonetic prototypes.语音原型

Percept Psychophys. 1982 Apr;31(4):307-14. doi: 10.3758/bf03202653.

Immediate perceptual response to intersensory discrepancy.对跨感觉差异的即时感知反应。

Psychol Bull. 1980 Nov;88(3):638-67.

A possible auditory basis for internal structure of phonetic categories.语音范畴内部结构的一种可能的听觉基础。

J Acoust Soc Am. 1983 Jun;73(6):2124-33. doi: 10.1121/1.389455.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

整合不同说话者、性别和感觉模态的语音信息：麦格克效应中的女性面孔与男性声音

Integrating speech information across talkers, gender, and sensory modality: female faces and male voices in the McGurk effect.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献