• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

声强度包络的视觉模拟有助于噪声下的言语感知。

Visual analog of the acoustic amplitude envelope benefits speech perception in noise.

机构信息

Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville, Florida 32610, USA.

Department of Linguistics, University of Florida, Gainesville, Florida 32611,

出版信息

J Acoust Soc Am. 2020 Mar;147(3):EL246. doi: 10.1121/10.0000737.

DOI:10.1121/10.0000737
PMID:32237828
Abstract

The nature of the visual input that integrates with the audio signal to yield speech processing advantages remains controversial. This study tests the hypothesis that the information extracted for audiovisual integration includes co-occurring suprasegmental dynamic changes in the acoustic and visual signal. English sentences embedded in multi-talker babble noise were presented to native English listeners in audio-only and audiovisual modalities. A significant intelligibility enhancement with the visual analogs congruent to the acoustic amplitude envelopes was observed. These results suggest that dynamic visual modulation provides speech rhythmic information that can be integrated online with the audio signal to enhance speech intelligibility.

摘要

视觉输入的本质与音频信号相结合,产生了言语处理优势,这一问题仍存在争议。本研究检验了这样一个假设,即用于视听整合的信息包括声学和视觉信号中同时发生的超音段动态变化。将嵌入多说话人背景噪声的英语句子仅以音频和视听两种方式呈现给以英语为母语的听众。观察到与声学幅度包络一致的视觉模拟有显著的可懂度增强。这些结果表明,动态视觉调制提供了言语节奏信息,可以与音频信号在线整合,从而提高言语可懂度。

相似文献

1
Visual analog of the acoustic amplitude envelope benefits speech perception in noise.声强度包络的视觉模拟有助于噪声下的言语感知。
J Acoust Soc Am. 2020 Mar;147(3):EL246. doi: 10.1121/10.0000737.
2
Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.多说话人聆听环境下视觉语音包络对视听语音感知的影响。
J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.
3
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.
4
Enhancing speech intelligibility: interactions among context, modality, speech style, and masker.提高言语可懂度:语境、模态、言语风格和掩蔽音之间的相互作用。
J Speech Lang Hear Res. 2014 Oct;57(5):1908-18. doi: 10.1044/JSLHR-H-13-0076.
5
Reduced efficiency of audiovisual integration for nonnative speech.非母语语音的视听整合效率降低。
J Acoust Soc Am. 2013 Nov;134(5):EL387-93. doi: 10.1121/1.4822320.
6
Non-native listeners' recognition of high-variability speech using PRESTO.非母语听众使用PRESTO对高变异性语音的识别。
J Am Acad Audiol. 2014 Oct;25(9):869-92. doi: 10.3766/jaaa.25.9.9.
7
Visual Speech Benefit in Clear and Degraded Speech Depends on the Auditory Intelligibility of the Talker and the Number of Background Talkers.视觉语音对言语清晰度和可懂度的影响取决于说话者的听觉可懂度和背景说话者的数量。
Trends Hear. 2019 Jan-Dec;23:2331216519837866. doi: 10.1177/2331216519837866.
8
English sentence recognition in speech-shaped noise and multi-talker babble for English-, Chinese-, and Korean-native listeners.英语、汉语和韩语母语者在语音噪声和多说话人背景噪声中的英语句子识别。
J Acoust Soc Am. 2012 Nov;132(5):EL391-7. doi: 10.1121/1.4757730.
9
Pre- and Postoperative Binaural Unmasking for Bimodal Cochlear Implant Listeners.双耳双模人工耳蜗植入患者术前术后的双侧掩蔽
Ear Hear. 2017 Sep/Oct;38(5):554-567. doi: 10.1097/AUD.0000000000000420.
10
Speech Perception in Noise With Formant Enhancement for Older Listeners.为老年听众增强共振峰的噪声下言语感知。
J Speech Lang Hear Res. 2019 Sep 20;62(9):3290-3301. doi: 10.1044/2019_JSLHR-S-18-0089. Epub 2019 Sep 3.

引用本文的文献

1
Spectral Features Analysis for Print Quality Prediction in Additive Manufacturing: An Acoustics-Based Approach.增材制造中用于打印质量预测的光谱特征分析:一种基于声学的方法。
Sensors (Basel). 2024 Jul 26;24(15):4864. doi: 10.3390/s24154864.
2
The effects of temporal cues, point-light displays, and faces on speech identification and listening effort.时线索、点光显示和面部对言语识别和听力努力的影响。
PLoS One. 2023 Nov 29;18(11):e0290826. doi: 10.1371/journal.pone.0290826. eCollection 2023.
3
Dissociable Neural Correlates of Multisensory Coherence and Selective Attention.
分离多感觉一致性和选择性注意的神经相关物。
J Neurosci. 2023 Jun 21;43(25):4697-4708. doi: 10.1523/JNEUROSCI.1310-22.2023. Epub 2023 May 23.
4
Speech-In-Noise Comprehension is Improved When Viewing a Deep-Neural-Network-Generated Talking Face.观看深度神经网络生成的说话人脸可提高噪声环境下的言语理解能力。
Trends Hear. 2022 Jan-Dec;26:23312165221136934. doi: 10.1177/23312165221136934.
5
Multisensory benefits for speech recognition in noisy environments.在嘈杂环境中语音识别的多感官益处。
Front Neurosci. 2022 Oct 20;16:1031424. doi: 10.3389/fnins.2022.1031424. eCollection 2022.
6
Independent mechanisms of temporal and linguistic cue correspondence benefiting audiovisual speech processing.独立的时间和语言线索对应机制有益于视听语音处理。
Atten Percept Psychophys. 2022 Aug;84(6):2016-2026. doi: 10.3758/s13414-022-02440-3. Epub 2022 Feb 24.
7
The Impact of Temporally Coherent Visual Cues on Speech Perception in Complex Auditory Environments.时间连贯视觉线索对复杂听觉环境中语音感知的影响。
Front Neurosci. 2021 Jun 7;15:678029. doi: 10.3389/fnins.2021.678029. eCollection 2021.
8
Development of the Mechanisms Underlying Audiovisual Speech Perception Benefit.视听言语感知益处背后机制的发展
Brain Sci. 2021 Jan 5;11(1):49. doi: 10.3390/brainsci11010049.