听觉-视觉语音和非语音信号感知过程中的跨模态相互作用：一项 fMRI 研究。

Cross-modal interactions during perception of audiovisual speech and nonspeech signals: an fMRI study.

机构信息

Department of General Neurology, University of Tübingen, Tübingen, Germany.

出版信息

J Cogn Neurosci. 2011 Jan;23(1):221-37. doi: 10.1162/jocn.2010.21421.

DOI:10.1162/jocn.2010.21421

PMID:20044895

Abstract

During speech communication, visual information may interact with the auditory system at various processing stages. Most noteworthy, recent magnetoencephalography (MEG) data provided first evidence for early and preattentive phonetic/phonological encoding of the visual data stream--prior to its fusion with auditory phonological features [Hertrich, I., Mathiak, K., Lutzenberger, W., & Ackermann, H. Time course of early audiovisual interactions during speech and non-speech central-auditory processing: An MEG study. Journal of Cognitive Neuroscience, 21, 259-274, 2009]. Using functional magnetic resonance imaging, the present follow-up study aims to further elucidate the topographic distribution of visual-phonological operations and audiovisual (AV) interactions during speech perception. Ambiguous acoustic syllables--disambiguated to /pa/ or /ta/ by the visual channel (speaking face)--served as test materials, concomitant with various control conditions (nonspeech AV signals, visual-only and acoustic-only speech, and nonspeech stimuli). (i) Visual speech yielded an AV-subadditive activation of primary auditory cortex and the anterior superior temporal gyrus (STG), whereas the posterior STG responded both to speech and nonspeech motion. (ii) The inferior frontal and the fusiform gyrus of the right hemisphere showed a strong phonetic/phonological impact (differential effects of visual /pa/ vs. /ta/) upon hemodynamic activation during presentation of speaking faces. Taken together with the previous MEG data, these results point at a dual-pathway model of visual speech information processing: On the one hand, access to the auditory system via the anterior supratemporal “what" path may give rise to direct activation of "auditory objects." On the other hand, visual speech information seems to be represented in a right-hemisphere visual working memory, providing a potential basis for later interactions with auditory information such as the McGurk effect.

摘要

在言语交流中，视觉信息可能会在各个处理阶段与听觉系统相互作用。最值得注意的是，最近的脑磁图（MEG）数据首次提供了证据，证明在视觉数据流与听觉语音特征融合之前，就已经对其进行了早期和非注意语音/音系编码[Hertrich，I.，Mathiak，K.，Lutzenberger，W.，&Ackermann，H. 语音和非语音中枢听觉处理过程中视听早期相互作用的时间进程：一项 MEG 研究。认知神经科学杂志，21，259-274，2009]。使用功能磁共振成像，本后续研究旨在进一步阐明视觉-语音操作和视听（AV）相互作用在言语感知中的拓扑分布。作为测试材料的是具有歧义的声学音节-通过视觉通道（说话的脸）分辨为/pa/或/ta/-伴随着各种对照条件（非语音 AV 信号、视觉仅和声学仅语音、非语音刺激）。（i）视觉语音导致初级听觉皮层和前上颞叶（STG）的 AV 亚加性激活，而 STG 后部则对语音和非语音运动都有反应。（ii）右半球的下额叶和梭状回表现出强烈的语音/音系影响（在呈现说话脸时，视觉/pa/与/ta/的差异效应），对血流动力学激活有影响。结合之前的 MEG 数据，这些结果表明视觉语音信息处理存在双通路模型：一方面，通过前上颞叶的“什么”路径进入听觉系统可能会导致“听觉对象”的直接激活。另一方面，视觉语音信息似乎在右半球的视觉工作记忆中得到表示，为后来与听觉信息的相互作用提供了潜在基础，例如麦格克效应。

相似文献

Cross-modal interactions during perception of audiovisual speech and nonspeech signals: an fMRI study.

J Cogn Neurosci. 2011 Jan;23(1):221-37. doi: 10.1162/jocn.2010.21421.

Time course of early audiovisual interactions during speech and nonspeech central auditory processing: a magnetoencephalography study.

J Cogn Neurosci. 2009 Feb;21(2):259-74. doi: 10.1162/jocn.2008.21019.

Sequential audiovisual interactions during speech perception: a whole-head MEG study.

Neuropsychologia. 2007 Mar 25;45(6):1342-54. doi: 10.1016/j.neuropsychologia.2006.09.019. Epub 2006 Oct 25.

Interactions between auditory and visual semantic stimulus classes: evidence for common processing networks for speech and body actions.

J Cogn Neurosci. 2011 Sep;23(9):2291-308. doi: 10.1162/jocn.2010.21593. Epub 2010 Oct 18.

A systematic investigation of the functional neuroanatomy of auditory and visual phonological processing.

Neuroimage. 2005 Jul 1;26(3):647-61. doi: 10.1016/j.neuroimage.2005.02.024. Epub 2005 Apr 9.

Retinotopic effects during spatial audio-visual integration.

Neuropsychologia. 2007 Feb 1;45(3):531-9. doi: 10.1016/j.neuropsychologia.2006.05.018. Epub 2006 Jun 23.

Physical and perceptual factors shape the neural mechanisms that integrate audiovisual signals in speech comprehension.

J Neurosci. 2011 Aug 3;31(31):11338-50. doi: 10.1523/JNEUROSCI.6510-10.2011.

Evidence for rapid auditory perception as the foundation of speech processing: a sparse temporal sampling fMRI study.

Eur J Neurosci. 2004 Nov;20(9):2447-56. doi: 10.1111/j.1460-9568.2004.03687.x.

Distinct fMRI responses to laughter, speech, and sounds along the human peri-sylvian cortex.

Brain Res Cogn Brain Res. 2005 Jul;24(2):291-306. doi: 10.1016/j.cogbrainres.2005.02.008. Epub 2005 Mar 29.

Time course of multisensory interactions during audiovisual speech perception in humans: a magnetoencephalographic study.

Neurosci Lett. 2004 Jun 10;363(2):112-5. doi: 10.1016/j.neulet.2004.03.076.

引用本文的文献

Duplicated Heschl's gyrus associations with phonological decoding.

Brain Struct Funct. 2024 Dec;229(9):2137-2147. doi: 10.1007/s00429-024-02831-2. Epub 2024 Jul 16.

The Stroop effect and mental imagery.

Perception. 2024 Jan;53(1):61-67. doi: 10.1177/03010066231212152. Epub 2023 Nov 9.

Benefit of visual speech information for word comprehension in post-stroke aphasia.

Cortex. 2023 Aug;165:86-100. doi: 10.1016/j.cortex.2023.04.011. Epub 2023 May 16.

Audio-visual combination of syllables involves time-sensitive dynamics following from fusion failure.

Sci Rep. 2020 Oct 22;10(1):18009. doi: 10.1038/s41598-020-75201-7.

Eye-Tracking Evidence that Happy Faces Impair Verbal Message Comprehension: The Case of Health Warnings in Direct-to-Consumer Pharmaceutical Television Commercials.

Int J Advert. 2017;36(1):82-106. doi: 10.1080/02650487.2016.1196030. Epub 2016 Jul 4.

Multimodal mental imagery.

Cortex. 2018 Aug;105:125-134. doi: 10.1016/j.cortex.2017.07.006. Epub 2017 Jul 17.

Auditory, Visual and Audiovisual Speech Processing Streams in Superior Temporal Sulcus.

Front Hum Neurosci. 2017 Apr 7;11:174. doi: 10.3389/fnhum.2017.00174. eCollection 2017.

Can you hear me yet? An intracranial investigation of speech and non-speech audiovisual interactions in human cortex.

Lang Cogn Neurosci. 2016;31(2):284-302. doi: 10.1080/23273798.2015.1101145. Epub 2015 Oct 19.

Skill dependent audiovisual integration in the fusiform induces repetition suppression.

Brain Lang. 2015 Feb;141:110-23. doi: 10.1016/j.bandl.2014.12.002. Epub 2015 Jan 9.

Training of ultra-fast speech comprehension induces functional reorganization of the central-visual system in late-blind humans.

Front Hum Neurosci. 2013 Oct 23;7:701. doi: 10.3389/fnhum.2013.00701. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

听觉-视觉语音和非语音信号感知过程中的跨模态相互作用：一项 fMRI 研究。

Cross-modal interactions during perception of audiovisual speech and nonspeech signals: an fMRI study.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献