时线索、点光显示和面部对言语识别和听力努力的影响。

The effects of temporal cues, point-light displays, and faces on speech identification and listening effort.

机构信息

Department of Psychology, Carleton College, Northfield, MN, United States of America.

Department of Psychological & Brain Sciences, Washington University in St. Louis, St. Louis, MO, United States of America.

出版信息

PLoS One. 2023 Nov 29;18(11):e0290826. doi: 10.1371/journal.pone.0290826. eCollection 2023.

DOI:10.1371/journal.pone.0290826

PMID:38019831

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10686424/

Abstract

Among the most robust findings in speech research is that the presence of a talking face improves the intelligibility of spoken language. Talking faces supplement the auditory signal by providing fine phonetic cues based on the placement of the articulators, as well as temporal cues to when speech is occurring. In this study, we varied the amount of information contained in the visual signal, ranging from temporal information alone to a natural talking face. Participants were presented with spoken sentences in energetic or informational masking in four different visual conditions: audio-only, a modulating circle providing temporal cues to salient features of the speech, a digitally rendered point-light display showing lip movement, and a natural talking face. We assessed both sentence identification accuracy and self-reported listening effort. Audiovisual benefit for intelligibility was observed for the natural face in both informational and energetic masking, but the digitally rendered point-light display only provided benefit in energetic masking. Intelligibility for speech accompanied by the modulating circle did not differ from the audio-only conditions in either masker type. Thus, the temporal cues used here were insufficient to improve speech intelligibility in noise, but some types of digital point-light displays may contain enough phonetic detail to produce modest improvements in speech identification in noise.

摘要

在言语研究中，最有力的发现之一是，有说话人脸的存在可以提高口语的可理解性。说话人脸通过提供基于发音器官位置的精细语音线索以及讲话发生的时间线索来补充听觉信号。在这项研究中，我们改变了视觉信号中包含的信息量，从仅包含时间信息到自然说话人脸。参与者在四种不同的视觉条件下接受了有力或信息掩蔽的口语句子：仅音频、提供讲话显著特征的时变线索的调制圆、显示唇动的数字呈现的点光显示以及自然说话人脸。我们评估了句子识别准确性和自我报告的听力努力程度。在信息掩蔽和能量掩蔽下，自然人脸都观察到了可懂度的视听增益，但数字呈现的点光显示仅在能量掩蔽下提供增益。调制圆伴随的语音在两种掩蔽类型下的可懂度都与仅音频条件没有差异。因此，这里使用的时间线索不足以提高噪声中的语音可懂度，但某些类型的数字点光显示可能包含足够的语音细节，从而在噪声中适度提高语音识别。

相似文献

The effects of temporal cues, point-light displays, and faces on speech identification and listening effort.

PLoS One. 2023 Nov 29;18(11):e0290826. doi: 10.1371/journal.pone.0290826. eCollection 2023.

Energetic and Informational Components of Speech-on-Speech Masking in Binaural Speech Intelligibility and Perceived Listening Effort.

Trends Hear. 2019 Jan-Dec;23:2331216519854597. doi: 10.1177/2331216519854597.

Contribution of binaural masking release to improved speech intelligibility for different masker types.

Eur J Neurosci. 2020 Mar;51(5):1339-1352. doi: 10.1111/ejn.13980. Epub 2018 Jul 26.

Intelligibility of whispered speech in stationary and modulated noise maskers.

J Acoust Soc Am. 2012 Oct;132(4):2514-23. doi: 10.1121/1.4747614.

The contribution of visual information to the perception of speech in noise with and without informative temporal fine structure.

Hear Res. 2016 Jun;336:17-28. doi: 10.1016/j.heares.2016.04.002. Epub 2016 Apr 13.

The effects of working memory capacity and semantic cues on the intelligibility of speech in noise.

J Acoust Soc Am. 2013 Sep;134(3):2225-34. doi: 10.1121/1.4817926.

Masking release with changing fundamental frequency: Electric acoustic stimulation resembles normal hearing subjects.

Hear Res. 2017 Jul;350:226-234. doi: 10.1016/j.heares.2017.05.004. Epub 2017 May 11.

Effect of priming on energetic and informational masking in a same-different task.

Ear Hear. 2012 Jan-Feb;33(1):124-33. doi: 10.1097/AUD.0b013e31822b5bee.

Talking points: A modulating circle reduces listening effort without improving speech recognition.

Psychon Bull Rev. 2019 Feb;26(1):291-297. doi: 10.3758/s13423-018-1489-7.

Pupil dilation uncovers extra listening effort in the presence of a single-talker masker.

Ear Hear. 2012 Mar-Apr;33(2):291-300. doi: 10.1097/AUD.0b013e3182310019.

本文引用的文献

Independent mechanisms of temporal and linguistic cue correspondence benefiting audiovisual speech processing.

Atten Percept Psychophys. 2022 Aug;84(6):2016-2026. doi: 10.3758/s13414-022-02440-3. Epub 2022 Feb 24.

Face mask type affects audiovisual speech intelligibility and subjective listening effort in young and older adults.

Cogn Res Princ Implic. 2021 Jul 18;6(1):49. doi: 10.1186/s41235-021-00314-0.

Effects of Visual Speech Envelope on Audiovisual Speech Perception in Multitalker Listening Environments.

J Speech Lang Hear Res. 2021 Jul 16;64(7):2845-2853. doi: 10.1044/2021_JSLHR-20-00688. Epub 2021 Jun 8.

Visual analog of the acoustic amplitude envelope benefits speech perception in noise.

J Acoust Soc Am. 2020 Mar;147(3):EL246. doi: 10.1121/10.0000737.

Talking Points: A Modulating Circle Increases Listening Effort Without Improving Speech Recognition in Young Adults.

Psychon Bull Rev. 2020 Jun;27(3):536-543. doi: 10.3758/s13423-020-01713-y.

About Face: Seeing the Talker Improves Spoken Word Recognition but Increases Listening Effort.

J Cogn. 2019 Nov 22;2(1):44. doi: 10.5334/joc.89.

Measuring Listening Effort: Convergent Validity, Sensitivity, and Links With Cognitive and Personality Measures.

J Speech Lang Hear Res. 2018 Jun 19;61(6):1463-1486. doi: 10.1044/2018_JSLHR-H-17-0257.

Headphone screening to facilitate web-based auditory experiments.

Atten Percept Psychophys. 2017 Oct;79(7):2064-2072. doi: 10.3758/s13414-017-1361-2.

Audiovisual sentence recognition not predicted by susceptibility to the McGurk effect.

Atten Percept Psychophys. 2017 Feb;79(2):396-403. doi: 10.3758/s13414-016-1238-9.

Hearing Impairment and Cognitive Energy: The Framework for Understanding Effortful Listening (FUEL).

Ear Hear. 2016 Jul-Aug;37 Suppl 1:5S-27S. doi: 10.1097/AUD.0000000000000312.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

时线索、点光显示和面部对言语识别和听力努力的影响。

The effects of temporal cues, point-light displays, and faces on speech identification and listening effort.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献