• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在嘈杂环境中语音识别的多感官益处。

Multisensory benefits for speech recognition in noisy environments.

作者信息

Oh Yonghee, Schwalm Meg, Kalpin Nicole

机构信息

Department of Otolaryngology-Head and Neck Surgery and Communicative Disorders, University of Louisville, Louisville, KY, United States.

Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville, FL, United States.

出版信息

Front Neurosci. 2022 Oct 20;16:1031424. doi: 10.3389/fnins.2022.1031424. eCollection 2022.

DOI:10.3389/fnins.2022.1031424
PMID:36340778
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9630463/
Abstract

A series of our previous studies explored the use of an abstract visual representation of the amplitude envelope cues from target sentences to benefit speech perception in complex listening environments. The purpose of this study was to expand this auditory-visual speech perception to the tactile domain. Twenty adults participated in speech recognition measurements in four different sensory modalities (AO, auditory-only; AV, auditory-visual; AT, auditory-tactile; AVT, auditory-visual-tactile). The target sentences were fixed at 65 dB sound pressure level and embedded within a simultaneous speech-shaped noise masker of varying degrees of signal-to-noise ratios (-7, -5, -3, -1, and 1 dB SNR). The amplitudes of both abstract visual and vibrotactile stimuli were temporally synchronized with the target speech envelope for comparison. Average results showed that adding temporally-synchronized multimodal cues to the auditory signal did provide significant improvements in word recognition performance across all three multimodal stimulus conditions (AV, AT, and AVT), especially at the lower SNR levels of -7, -5, and -3 dB for both male (8-20% improvement) and female (5-25% improvement) talkers. The greatest improvement in word recognition performance (15-19% improvement for males and 14-25% improvement for females) was observed when both visual and tactile cues were integrated (AVT). Another interesting finding in this study is that temporally synchronized abstract visual and vibrotactile stimuli additively stack in their influence on speech recognition performance. Our findings suggest that a multisensory integration process in speech perception requires salient temporal cues to enhance speech recognition ability in noisy environments.

摘要

我们之前的一系列研究探索了使用目标句子幅度包络线索的抽象视觉表示,以在复杂聆听环境中促进语音感知。本研究的目的是将这种视听语音感知扩展到触觉领域。20名成年人参与了四种不同感官模式下的语音识别测量(AO,仅听觉;AV,视听;AT,听触觉;AVT,视听触觉)。目标句子的声压级固定为65分贝,并嵌入到具有不同信噪比(-7、-5、-3、-1和1分贝信噪比)的同步语音形状噪声掩蔽器中。抽象视觉和振动触觉刺激的幅度在时间上与目标语音包络同步,以便进行比较。平均结果表明,在所有三种多模态刺激条件(AV、AT和AVT)下,向听觉信号添加时间同步的多模态线索确实显著提高了单词识别性能,尤其是对于男性(提高8 - 20%)和女性(提高5 - 25%)说话者,在-7、-5和-3分贝的较低信噪比水平下。当视觉和触觉线索都被整合时(AVT),观察到单词识别性能的最大提高(男性提高15 - 19%,女性提高14 - 25%)。本研究中的另一个有趣发现是,时间同步的抽象视觉和振动触觉刺激对语音识别性能的影响是累加的。我们的研究结果表明,语音感知中的多感官整合过程需要显著的时间线索来提高嘈杂环境中的语音识别能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3532/9630463/3c6a4fc054da/fnins-16-1031424-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3532/9630463/2d872e9bab56/fnins-16-1031424-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3532/9630463/3c6a4fc054da/fnins-16-1031424-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3532/9630463/2d872e9bab56/fnins-16-1031424-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3532/9630463/3c6a4fc054da/fnins-16-1031424-g002.jpg

相似文献

1
Multisensory benefits for speech recognition in noisy environments.在嘈杂环境中语音识别的多感官益处。
Front Neurosci. 2022 Oct 20;16:1031424. doi: 10.3389/fnins.2022.1031424. eCollection 2022.
2
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.
3
The impact of temporally coherent visual and vibrotactile cues on speech recognition in noise.时相干视觉和振动触觉线索对噪声中语音识别的影响。
JASA Express Lett. 2023 Feb;3(2):025203. doi: 10.1121/10.0017326.
4
Multisensory speech perception in autism spectrum disorder: From phoneme to whole-word perception.自闭症谱系障碍中的多感官语音感知:从音位到全字感知。
Autism Res. 2017 Jul;10(7):1280-1290. doi: 10.1002/aur.1776. Epub 2017 Mar 24.
5
The use of visible speech cues for improving auditory detection of spoken sentences.使用可见语音线索来提高对口语句子的听觉检测。
J Acoust Soc Am. 2000 Sep;108(3 Pt 1):1197-208. doi: 10.1121/1.1288668.
6
Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.多说话人聆听环境下视觉语音包络对视听语音感知的影响。
J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.
7
Deficits in audiovisual speech perception in normal aging emerge at the level of whole-word recognition.正常衰老过程中视听言语感知的缺陷出现在整词识别层面。
Neurobiol Aging. 2015 Jan;36(1):283-91. doi: 10.1016/j.neurobiolaging.2014.08.003. Epub 2014 Aug 7.
8
Correlation between audio-visual enhancement of speech in different noise environments and SNR: a combined behavioral and electrophysiological study.不同噪声环境下语音的视听增强与 SNR 的相关性:一项结合行为和电生理的研究。
Neuroscience. 2013 Sep 5;247:145-51. doi: 10.1016/j.neuroscience.2013.05.007. Epub 2013 May 11.
9
Eye Can Hear Clearly Now: Inverse Effectiveness in Natural Audiovisual Speech Processing Relies on Long-Term Crossmodal Temporal Integration.现在眼睛能“听清”了:自然视听言语处理中的反向有效性依赖于长期跨模态时间整合。
J Neurosci. 2016 Sep 21;36(38):9888-95. doi: 10.1523/JNEUROSCI.1396-16.2016.
10
Detection and Recognition of Asynchronous Auditory/Visual Speech: Effects of Age, Hearing Loss, and Talker Accent.异步听觉/视觉语音的检测与识别:年龄、听力损失和说话者口音的影响
Front Psychol. 2022 Jan 28;12:772867. doi: 10.3389/fpsyg.2021.772867. eCollection 2021.

引用本文的文献

1
Vibrotactile speech cues are associated with enhanced auditory processing in middle and superior temporal gyri.振动触觉语音线索与颞中回和颞上回听觉处理增强有关。
Sci Rep. 2025 Jul 12;15(1):25202. doi: 10.1038/s41598-025-07718-8.
2
Exposure to vibrotactile music improves audiometric performances in individuals with cochlear implants.接触振动触觉音乐可改善人工耳蜗植入者的听力测试表现。
Sci Rep. 2025 Jun 12;15(1):20054. doi: 10.1038/s41598-025-02946-4.

本文引用的文献

1
Interaction between voice-gender difference and spatial separation in release from masking in multi-talker listening environments.多说话者聆听环境中掩蔽释放时语音性别差异与空间分离之间的相互作用。
JASA Express Lett. 2021 Aug;1(8):084404. doi: 10.1121/10.0005831. Epub 2021 Aug 5.
2
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.
3
Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.
多说话人聆听环境下视觉语音包络对视听语音感知的影响。
J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.
4
Visual analog of the acoustic amplitude envelope benefits speech perception in noise.声强度包络的视觉模拟有助于噪声下的言语感知。
J Acoust Soc Am. 2020 Mar;147(3):EL246. doi: 10.1121/10.0000737.
5
Tri-modal speech: Audio-visual-tactile integration in speech perception.三模态语音:语音感知中的视听触整合。
J Acoust Soc Am. 2019 Nov;146(5):3495. doi: 10.1121/1.5134064.
6
Immediate improvement of speech-in-noise perception through multisensory stimulation via an auditory to tactile sensory substitution.通过听觉到触觉的感官替代进行多感官刺激,可立即改善噪声环境下的言语感知能力。
Restor Neurol Neurosci. 2019;37(2):155-166. doi: 10.3233/RNN-190898.
7
Cross-modal orienting of visual attention.视觉注意的跨模态定向
Neuropsychologia. 2016 Mar;83:170-178. doi: 10.1016/j.neuropsychologia.2015.06.003. Epub 2015 Jun 11.
8
Effects of vibrotactile feedback on human learning of arm motions.振动触觉反馈对人类手臂运动学习的影响。
IEEE Trans Neural Syst Rehabil Eng. 2015 Jan;23(1):51-63. doi: 10.1109/TNSRE.2014.2327229. Epub 2014 Jun 2.
9
Attention and the multiple stages of multisensory integration: A review of audiovisual studies.注意力与多感官整合的多个阶段:视听研究综述
Acta Psychol (Amst). 2010 Jul;134(3):372-84. doi: 10.1016/j.actpsy.2010.03.010. Epub 2010 Apr 27.
10
Tactile enhancement of auditory and visual speech perception in untrained perceivers.未经训练的感知者对听觉和视觉言语感知的触觉增强。
J Acoust Soc Am. 2008 Apr;123(4):EL72-6. doi: 10.1121/1.2884349.