• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多说话人聆听环境下视觉语音包络对视听语音感知的影响。

Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.

机构信息

Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville.

出版信息

J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.

DOI:10.1044/2021_JSLHR-20-00688
PMID:34100628
Abstract

Purpose This study investigated the effects of visually presented speech envelope information with various modulation rates and depths on audiovisual speech perception in noise. Method Forty adults (21.25 ± 1.45 years) participated in audiovisual sentence recognition measurements in noise. Target speech sentences were auditorily presented in multitalker babble noises at a -3 dB SNR. Acoustic amplitude envelopes of target signals were extracted through low-pass filters with different cutoff frequencies (4, 10, and 30 Hz) and a fixed modulation depth at 100% (Experiment 1) or extracted with various modulation depths (0%, 25%, 50%, 75%, and 100%) and a fixed 10-Hz modulation rate (Experiment 2). The extracted target envelopes were synchronized with the amplitude of a spherical-shaped ball and presented as visual stimuli. Subjects were instructed to attend to both auditory and visual stimuli of the target sentences and type down their answers. The sentence recognition accuracy was compared between audio-only and audiovisual conditions. Results In Experiment 1, a significant improvement in speech intelligibility was observed when the visual analog (a sphere) synced with the acoustic amplitude envelope modulated at a 10-Hz modulation rate compared to the audio-only condition. In Experiment 2, the visual analog with 75% modulation depth resulted in better audiovisual speech perception in noise compared to the other modulation depth conditions. Conclusion An abstract visual analog of acoustic amplitude envelopes can be efficiently delivered by the visual system and integrated online with auditory signals to enhance speech perception in noise, independent of particular articulation movements.

摘要

目的 本研究旨在探讨具有不同调制率和深度的视觉呈现语音包络信息对噪声中视听语音感知的影响。

方法 40 名成年人(21.25±1.45 岁)参与了噪声中的视听句子识别测量。目标语音句子在多说话者背景噪声中以-3dB SNR 听觉呈现。通过具有不同截止频率(4、10 和 30Hz)的低通滤波器提取目标信号的声振幅包络,并保持 100%的固定调制深度(实验 1),或通过具有不同调制深度(0%、25%、50%、75%和 100%)和固定 10Hz 调制率提取目标包络(实验 2)。提取的目标包络与球形球的振幅同步,并作为视觉刺激呈现。要求受试者同时关注目标句子的听觉和视觉刺激,并输入答案。比较了仅音频和视听条件下的句子识别准确率。

结果 在实验 1 中,与仅音频条件相比,当视觉模拟(球体)与调制率为 10Hz 的声振幅包络同步时,语音可懂度显著提高。在实验 2 中,与其他调制深度条件相比,75%调制深度的视觉模拟在噪声中产生了更好的视听语音感知。

结论 可以通过视觉系统有效地传递声振幅包络的抽象视觉模拟,并在线与听觉信号集成,从而增强噪声中的语音感知,而不依赖于特定的发音运动。

相似文献

1
Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.多说话人聆听环境下视觉语音包络对视听语音感知的影响。
J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.
2
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.
3
Visual analog of the acoustic amplitude envelope benefits speech perception in noise.声强度包络的视觉模拟有助于噪声下的言语感知。
J Acoust Soc Am. 2020 Mar;147(3):EL246. doi: 10.1121/10.0000737.
4
The use of visible speech cues for improving auditory detection of spoken sentences.使用可见语音线索来提高对口语句子的听觉检测。
J Acoust Soc Am. 2000 Sep;108(3 Pt 1):1197-208. doi: 10.1121/1.1288668.
5
Sustained envelope periodicity representations are associated with speech-in-noise performance in difficult listening conditions for younger and older adults.在困难的聆听条件下,持续包络周期表示与年轻和老年成年人的语音降噪表现相关。
J Neurophysiol. 2019 Oct 1;122(4):1685-1696. doi: 10.1152/jn.00845.2018. Epub 2019 Jul 31.
6
Audio-visual speech perception in noise: Implanted children and young adults versus normal hearing peers.噪声环境下的视听言语感知:植入人工耳蜗的儿童和年轻人与听力正常的同龄人对比
Int J Pediatr Otorhinolaryngol. 2017 Jan;92:146-150. doi: 10.1016/j.ijporl.2016.11.022. Epub 2016 Nov 25.
7
The benefit obtained from visually displayed text from an automatic speech recognizer during listening to speech presented in noise.在收听有噪声干扰的语音时,从自动语音识别器的可视文本显示中获得的益处。
Ear Hear. 2008 Dec;29(6):838-52. doi: 10.1097/AUD.0b013e31818005bd.
8
Non-native listeners' recognition of high-variability speech using PRESTO.非母语听众使用PRESTO对高变异性语音的识别。
J Am Acad Audiol. 2014 Oct;25(9):869-92. doi: 10.3766/jaaa.25.9.9.
9
The effect of audiovisual and binaural listening on the acceptable noise level (ANL): establishing an ANL conceptual model.视听和双耳聆听对可接受噪声水平(ANL)的影响:建立ANL概念模型。
J Am Acad Audiol. 2014 Feb;25(2):141-53. doi: 10.3766/jaaa.25.2.3.
10
Human neuromagnetic steady-state responses to amplitude-modulated tones, speech, and music.人类对调幅音调、语音和音乐的神经磁稳态反应。
Ear Hear. 2014 Jul-Aug;35(4):461-7. doi: 10.1097/AUD.0000000000000033.

引用本文的文献

1
Impact of High- and Low-Pass Acoustic Filtering on Audiovisual Speech Redundancy and Benefit in Children.高通和低通声学滤波对儿童视听语音冗余及益处的影响
Ear Hear. 2025;46(3):735-746. doi: 10.1097/AUD.0000000000001622. Epub 2025 Jan 31.
2
The effects of temporal cues, point-light displays, and faces on speech identification and listening effort.时线索、点光显示和面部对言语识别和听力努力的影响。
PLoS One. 2023 Nov 29;18(11):e0290826. doi: 10.1371/journal.pone.0290826. eCollection 2023.
3
Speech-derived haptic stimulation enhances speech recognition in a multi-talker background.
语音衍生的触觉刺激可增强多说话者背景下的语音识别能力。
Sci Rep. 2023 Oct 3;13(1):16621. doi: 10.1038/s41598-023-43644-3.
4
Multisensory benefits for speech recognition in noisy environments.在嘈杂环境中语音识别的多感官益处。
Front Neurosci. 2022 Oct 20;16:1031424. doi: 10.3389/fnins.2022.1031424. eCollection 2022.
5
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.