Department of Psychology, Carleton College, Northfield, MN, United States of America.
Department of Psychological & Brain Sciences, Washington University in St. Louis, St. Louis, MO, United States of America.
PLoS One. 2023 Nov 29;18(11):e0290826. doi: 10.1371/journal.pone.0290826. eCollection 2023.
Among the most robust findings in speech research is that the presence of a talking face improves the intelligibility of spoken language. Talking faces supplement the auditory signal by providing fine phonetic cues based on the placement of the articulators, as well as temporal cues to when speech is occurring. In this study, we varied the amount of information contained in the visual signal, ranging from temporal information alone to a natural talking face. Participants were presented with spoken sentences in energetic or informational masking in four different visual conditions: audio-only, a modulating circle providing temporal cues to salient features of the speech, a digitally rendered point-light display showing lip movement, and a natural talking face. We assessed both sentence identification accuracy and self-reported listening effort. Audiovisual benefit for intelligibility was observed for the natural face in both informational and energetic masking, but the digitally rendered point-light display only provided benefit in energetic masking. Intelligibility for speech accompanied by the modulating circle did not differ from the audio-only conditions in either masker type. Thus, the temporal cues used here were insufficient to improve speech intelligibility in noise, but some types of digital point-light displays may contain enough phonetic detail to produce modest improvements in speech identification in noise.
在言语研究中,最有力的发现之一是,有说话人脸的存在可以提高口语的可理解性。说话人脸通过提供基于发音器官位置的精细语音线索以及讲话发生的时间线索来补充听觉信号。在这项研究中,我们改变了视觉信号中包含的信息量,从仅包含时间信息到自然说话人脸。参与者在四种不同的视觉条件下接受了有力或信息掩蔽的口语句子:仅音频、提供讲话显著特征的时变线索的调制圆、显示唇动的数字呈现的点光显示以及自然说话人脸。我们评估了句子识别准确性和自我报告的听力努力程度。在信息掩蔽和能量掩蔽下,自然人脸都观察到了可懂度的视听增益,但数字呈现的点光显示仅在能量掩蔽下提供增益。调制圆伴随的语音在两种掩蔽类型下的可懂度都与仅音频条件没有差异。因此,这里使用的时间线索不足以提高噪声中的语音可懂度,但某些类型的数字点光显示可能包含足够的语音细节,从而在噪声中适度提高语音识别。