• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

噪声环境下语音和掩蔽书面文本的视听感知。

Audiovisual perception of speech in noise and masked written text.

作者信息

Zekveld Adriana A, Kramer Sophia E, Vlaming Marcel S M G, Houtgast Tammo

机构信息

EMGO Institute, ENT/Audiology, VU University Medical Center, Amsterdam, The Netherlands.

出版信息

Ear Hear. 2008 Jan;29(1):99-111. doi: 10.1097/AUD.0b013e31815d6d8d.

DOI:10.1097/AUD.0b013e31815d6d8d
PMID:18091101
Abstract

OBJECTIVE

The aim of this study was to examine the support obtained from degraded visual information in the comprehension of speech in noise.

DESIGN

We presented sentences auditorily (speech reception threshold test), visually (text reception threshold test), and audiovisually. Presenting speech in noise and masked written text enabled the quantification and systematic variation of the amount of information presented in both modalities. Eighteen persons with normal hearing (aged 19 to 31 yr) participated. For half of them a bar pattern masked the text and for the other half random dots masked the text. The text was presented simultaneously or delayed relative to the speech. Using an adaptive procedure, the amount of information required for a correct reproduction of 50% of the sentences was determined for both the unimodal and the audiovisual stimuli. Bimodal support was defined as the difference between the observed bimodal performance and that predicted by an independent channels model. Nonparametric tests were used to evaluate the bimodal support and the effect of delaying the text.

RESULTS

Masked text substantially supported the comprehension of speech in noise; the bimodal support ranged from 15% to 25% correct. A negative effect of delaying the text was observed in some conditions for the participants who were presented the text masked by the bar pattern.

CONCLUSIONS

The ability of participants to reproduce bimodally presented sentences exceeds the performance as predicted by an independent channels model. This indicates that a relatively small amount of visual information can substantially augment speech comprehension in noise, which supports the use of visual information to improve speech comprehension by participants with hearing impairment, even if the visual information is incomplete.

摘要

目的

本研究旨在考察在噪声环境下言语理解中从退化视觉信息获得的支持。

设计

我们通过听觉(言语接受阈值测试)、视觉(文本接受阈值测试)以及视听结合的方式呈现句子。在噪声中呈现言语以及对书面文本进行掩蔽,能够对两种模态中呈现的信息量进行量化和系统变化。18名听力正常的人(年龄在19至31岁之间)参与了研究。其中一半人的文本被条形图案掩蔽,另一半人的文本被随机点掩蔽。文本与言语同时呈现或延迟呈现。使用自适应程序,确定单模态和视听刺激正确再现50%句子所需的信息量。双模态支持被定义为观察到的双模态表现与独立通道模型预测的表现之间的差异。使用非参数检验来评估双模态支持以及延迟文本的影响。

结果

掩蔽文本显著支持了噪声环境下的言语理解;双模态支持的正确率范围为15%至25%。在某些条件下,对于文本被条形图案掩蔽的参与者,观察到了延迟文本的负面影响。

结论

参与者双模态呈现句子的再现能力超过了独立通道模型预测的表现。这表明相对少量的视觉信息能够显著增强噪声环境下的言语理解,这支持了听力受损的参与者使用视觉信息来提高言语理解,即使视觉信息不完整。

相似文献

1
Audiovisual perception of speech in noise and masked written text.噪声环境下语音和掩蔽书面文本的视听感知。
Ear Hear. 2008 Jan;29(1):99-111. doi: 10.1097/AUD.0b013e31815d6d8d.
2
The influence of age, hearing, and working memory on the speech comprehension benefit derived from an automatic speech recognition system.年龄、听力和工作记忆对从自动语音识别系统获得的语音理解增益的影响。
Ear Hear. 2009 Apr;30(2):262-72. doi: 10.1097/AUD.0b013e3181987063.
3
The benefit obtained from visually displayed text from an automatic speech recognizer during listening to speech presented in noise.在收听有噪声干扰的语音时,从自动语音识别器的可视文本显示中获得的益处。
Ear Hear. 2008 Dec;29(6):838-52. doi: 10.1097/AUD.0b013e31818005bd.
4
The development of the text reception threshold test: a visual analogue of the speech reception threshold test.文本接受阈值测试的发展:言语接受阈值测试的视觉模拟。
J Speech Lang Hear Res. 2007 Jun;50(3):576-84. doi: 10.1044/1092-4388(2007/040).
5
Estimates of basilar-membrane nonlinearity effects on masking of tones and speech.基底膜非线性对纯音和言语掩蔽影响的估计。
Ear Hear. 2007 Feb;28(1):2-17. doi: 10.1097/AUD.0b013e3180310212.
6
Measuring cognitive factors in speech comprehension: the value of using the Text Reception Threshold test as a visual equivalent of the SRT test.测量言语理解中的认知因素:使用文本接受阈测试作为 SRT 测试的视觉等效物的价值。
Scand J Psychol. 2009 Oct;50(5):507-15. doi: 10.1111/j.1467-9450.2009.00747.x.
7
The influence of semantically related and unrelated text cues on the intelligibility of sentences in noise.语义相关和不相关的文本提示对噪声中句子可理解性的影响。
Ear Hear. 2011 Nov-Dec;32(6):e16-25. doi: 10.1097/AUD.0b013e318228036a.
8
Development of the Listening in Spatialized Noise-Sentences Test (LISN-S).空间噪声句子听力测试(LISN-S)的开发。
Ear Hear. 2007 Apr;28(2):196-211. doi: 10.1097/AUD.0b013e318031267f.
9
Auditory speech recognition and visual text recognition in younger and older adults: similarities and differences between modalities and the effects of presentation rate.年轻人和老年人的听觉语音识别与视觉文本识别:模态之间的异同及呈现速率的影响
J Speech Lang Hear Res. 2007 Apr;50(2):283-303. doi: 10.1044/1092-4388(2007/021).
10
New measures of masked text recognition in relation to speech-in-noise perception and their associations with age and cognitive abilities.与语音噪声感知相关的掩蔽文本识别的新度量及其与年龄和认知能力的关联。
J Speech Lang Hear Res. 2012 Feb;55(1):194-209. doi: 10.1044/1092-4388(2011/11-0008). Epub 2011 Dec 22.

引用本文的文献

1
The effect of modality onset asynchrony and processing time on the recognition of text-supplemented speech.不同呈现方式起始时间差异和加工时间对补充文本语音识别的影响。
JASA Express Lett. 2023 Feb;3(2):025202. doi: 10.1121/10.0017215.
2
Recognition of Interrupted Speech, Text, and Text-Supplemented Speech by Older Adults: Effect of Interruption Rate.老年人对中断言语、文本和补充文本的言语的识别:中断率的影响。
J Speech Lang Hear Res. 2022 Nov 17;65(11):4404-4416. doi: 10.1044/2022_JSLHR-22-00247. Epub 2022 Oct 14.
3
Integration of partial information for spoken and written sentence recognition by older listeners.
老年听众整合部分信息以识别口语和书面句子。
J Acoust Soc Am. 2016 Jun;139(6):EL240. doi: 10.1121/1.4954634.
4
Integration of Partial Information Within and Across Modalities: Contributions to Spoken and Written Sentence Recognition.跨模态及模态内部分信息整合:对口语和书面句子识别的贡献
J Speech Lang Hear Res. 2015 Dec;58(6):1805-17. doi: 10.1044/2015_JSLHR-H-14-0272.
5
Cognitive spare capacity and speech communication: a narrative overview.认知储备能力与言语交流:叙述性综述
Biomed Res Int. 2014;2014:869726. doi: 10.1155/2014/869726. Epub 2014 May 27.
6
How linguistic closure and verbal working memory relate to speech recognition in noise--a review.语言封闭性和言语工作记忆与噪声中的语音识别如何相关——综述
Trends Amplif. 2013 Jun;17(2):75-93. doi: 10.1177/1084713813495459. Epub 2013 Aug 13.
7
Lipreading, processing speed, and working memory in younger and older adults.唇读、加工速度和工作记忆在年轻和老年成年人中的表现。
J Speech Lang Hear Res. 2009 Dec;52(6):1555-65. doi: 10.1044/1092-4388(2009/08-0137). Epub 2009 Aug 28.