不同噪声环境下语音的视听增强与 SNR 的相关性：一项结合行为和电生理的研究。

Correlation between audio-visual enhancement of speech in different noise environments and SNR: a combined behavioral and electrophysiological study.

机构信息

School of Computer Science and Technology, Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin University, Tianjin 300072, PR China.

出版信息

Neuroscience. 2013 Sep 5;247:145-51. doi: 10.1016/j.neuroscience.2013.05.007. Epub 2013 May 11.

DOI:10.1016/j.neuroscience.2013.05.007

PMID:23673276

Abstract

In the present study, we investigated the multisensory gain as the difference of speech recognition accuracies between the audio-visual (AV) and auditory-only (A) conditions, and the multisensory gain as the difference between the event-related potentials (ERPs) evoked under the AV condition and the sum of the ERPs evoked under the A and visual-only (V) conditions in different noise environments. Videos of a female speaker articulating the Chinese monosyllable words accompanied with different levels of pink noise were used as the stimulus materials. The selected signal-to-noise ratios (SNRs) were -16, -12, -8, -4 and 0 dB. Under the A, V and AV conditions the accuracy of the speech recognition was measured and the ERPs evoked under different conditions were analyzed, respectively. The behavioral results showed that the maximum gain as the difference of speech recognition accuracies between the AV and A conditions was at the -12 dB SNR. The ERP results showed that the multisensory gain as the difference between the ERPs evoked under the AV condition and the sum of ERPs evoked under the A and V conditions at the -12 dB SNR was significantly higher than those at the other SNRs in the time window of 130-200 ms in the area from frontal to central region. The multisensory gains in audio-visual speech recognition at different SNRs were not completely accordant with the principle of inverse effectiveness, but confirmed to cross-modal stochastic resonance.

摘要

在本研究中，我们研究了多感觉增益，即视听（AV）和仅听觉（A）条件下语音识别准确率的差异，以及在不同噪声环境下，AV 条件下诱发的事件相关电位（ERP）与 A 和仅视觉（V）条件下诱发的 ERP 之和之间的多感觉增益。使用带有不同水平粉红噪声的女性说话者发音的中文单音节词的视频作为刺激材料。选择的信噪比（SNR）分别为-16、-12、-8、-4 和 0dB。在 A、V 和 AV 条件下，测量了语音识别的准确率，并分别分析了不同条件下诱发的 ERP。行为结果表明，AV 和 A 条件下语音识别准确率差异的最大增益出现在-12dB SNR。ERP 结果表明，在 130-200ms 的时间窗口内，在额区到中央区的区域，-12dB SNR 时 AV 条件下诱发的 ERP 与 A 和 V 条件下诱发的 ERP 之和之间的多感觉增益明显高于其他 SNR 的多感觉增益。不同 SNR 下视听语音识别的多感觉增益与反效性原则不完全一致，但证实了跨模态随机共振。

相似文献

Correlation between audio-visual enhancement of speech in different noise environments and SNR: a combined behavioral and electrophysiological study.不同噪声环境下语音的视听增强与 SNR 的相关性：一项结合行为和电生理的研究。

Neuroscience. 2013 Sep 5;247:145-51. doi: 10.1016/j.neuroscience.2013.05.007. Epub 2013 May 11.

Effects of spatial congruity on audio-visual multimodal integration.空间一致性对视听多模态整合的影响。

J Cogn Neurosci. 2005 Sep;17(9):1396-409. doi: 10.1162/0898929054985383.

Neural correlates of multisensory integration of ecologically valid audiovisual events.生态有效视听事件多感官整合的神经关联

J Cogn Neurosci. 2007 Dec;19(12):1964-73. doi: 10.1162/jocn.2007.19.12.1964.

Using EEG and stimulus context to probe the modelling of auditory-visual speech.利用脑电图和刺激背景探究视听语音建模

Cortex. 2016 Feb;75:220-230. doi: 10.1016/j.cortex.2015.03.010. Epub 2015 Apr 17.

Eye Can Hear Clearly Now: Inverse Effectiveness in Natural Audiovisual Speech Processing Relies on Long-Term Crossmodal Temporal Integration.现在眼睛能“听清”了：自然视听言语处理中的反向有效性依赖于长期跨模态时间整合。

J Neurosci. 2016 Sep 21;36(38):9888-95. doi: 10.1523/JNEUROSCI.1396-16.2016.

The Effect of Signal to Noise Ratio on Cortical Auditory-Evoked Potentials Elicited to Speech Stimuli in Infants and Adults With Normal Hearing.信噪比对正常听力婴儿和成人言语刺激诱发皮质听觉诱发电位的影响。

Ear Hear. 2018 Mar/Apr;39(2):305-317. doi: 10.1097/AUD.0000000000000487.

Inverse effectiveness and multisensory interactions in visual event-related potentials with audiovisual speech.视听语音诱发视觉事件相关电位的逆效和多感官相互作用。

Brain Topogr. 2012 Jul;25(3):308-26. doi: 10.1007/s10548-012-0220-7. Epub 2012 Feb 25.

Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments.你明白我的意思吗？探索在嘈杂环境中语音理解的视觉增强。

Cereb Cortex. 2007 May;17(5):1147-53. doi: 10.1093/cercor/bhl024. Epub 2006 Jun 19.

Haptic and visual information speed up the neural processing of auditory speech in live dyadic interactions.在实时双向互动中，触觉和视觉信息会加速听觉言语的神经处理过程。

Neuropsychologia. 2014 May;57:71-7. doi: 10.1016/j.neuropsychologia.2014.02.004. Epub 2014 Feb 11.

ERP evidence that auditory-visual speech facilitates working memory in younger and older adults.ERP 研究证据表明，视听语音对年轻成年人和老年成年人的工作记忆具有促进作用。

Psychol Aging. 2013 Jun;28(2):481-94. doi: 10.1037/a0031243. Epub 2013 Feb 18.

引用本文的文献

Prior multisensory learning can facilitate auditory-only voice-identity and speech recognition in noise.先前的多感官学习可以促进仅听觉模式下的语音身份识别以及噪声环境中的语音识别。

Q J Exp Psychol (Hove). 2024 Sep 20;78(7):17470218241278649. doi: 10.1177/17470218241278649.

Hearing, seeing, and feeling speech: the neurophysiological correlates of trimodal speech perception.听觉、视觉和触觉语音：三模态语音感知的神经生理学关联

Front Hum Neurosci. 2023 Aug 29;17:1225976. doi: 10.3389/fnhum.2023.1225976. eCollection 2023.

Deficient Audiovisual Speech Perception in Schizophrenia: An ERP Study.精神分裂症患者视听言语感知缺陷：一项事件相关电位研究。

Brain Sci. 2023 Jun 19;13(6):970. doi: 10.3390/brainsci13060970.

Effect of face masks on speech perception in noise of individuals with hearing aids.面罩对佩戴助听器者在噪声环境中言语感知的影响。

Front Neurosci. 2022 Dec 1;16:1036767. doi: 10.3389/fnins.2022.1036767. eCollection 2022.

Random noise stimulation in the treatment of patients with neurological disorders.随机噪声刺激在神经系统疾病患者治疗中的应用。

Neural Regen Res. 2022 Dec;17(12):2557-2562. doi: 10.4103/1673-5374.339474.

The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。

Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.

Stimulus intensity modulates multisensory temporal processing.刺激强度调节多感官时间处理。

Neuropsychologia. 2016 Jul 29;88:92-100. doi: 10.1016/j.neuropsychologia.2016.02.016. Epub 2016 Feb 23.

EEG gamma-band activity during audiovisual speech comprehension in different noise environments.不同噪声环境下视听言语理解过程中的脑电图伽马波段活动。

Cogn Neurodyn. 2015 Aug;9(4):389-98. doi: 10.1007/s11571-015-9333-5. Epub 2015 Feb 22.

Effect of mechanical tactile noise on amplitude of visual evoked potentials: multisensory stochastic resonance.机械触觉噪声对视觉诱发电位幅度的影响：多感官随机共振

J Neurophysiol. 2015 Oct;114(4):2132-43. doi: 10.1152/jn.00457.2015. Epub 2015 Jul 8.

Neural dynamics of audiovisual speech integration under variable listening conditions: an individual participant analysis.视听言语整合的神经动力学：变听条件下的个体参与者分析。

Front Psychol. 2013 Sep 10;4:615. doi: 10.3389/fpsyg.2013.00615. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

不同噪声环境下语音的视听增强与 SNR 的相关性：一项结合行为和电生理的研究。

Correlation between audio-visual enhancement of speech in different noise environments and SNR: a combined behavioral and electrophysiological study.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献