Suppr超能文献

面孔和声音之间的空间对准能提高对视听语音的选择性注意。

Spatial alignment between faces and voices improves selective attention to audio-visual speech.

机构信息

Speech and Hearing Bioscience and Technology Program, Harvard University, 243 Charles Street, Boston, Massachusetts 02114, USA.

Department of Biomedical Engineering, University of Rochester, 430 Elmwood Avenue, Rochester, New York 14620, USA.

出版信息

J Acoust Soc Am. 2021 Oct;150(4):3085. doi: 10.1121/10.0006415.

Abstract

The ability to see a talker's face improves speech intelligibility in noise, provided that the auditory and visual speech signals are approximately aligned in time. However, the importance of spatial alignment between corresponding faces and voices remains unresolved, particularly in multi-talker environments. In a series of online experiments, we investigated this using a task that required participants to selectively attend a target talker in noise while ignoring a distractor talker. In experiment 1, we found improved task performance when the talkers' faces were visible, but only when corresponding faces and voices were presented in the same hemifield (spatially aligned). In experiment 2, we tested for possible influences of eye position on this result. In auditory-only conditions, directing gaze toward the distractor voice reduced performance, but this effect could not fully explain the cost of audio-visual (AV) spatial misalignment. Lowering the signal-to-noise ratio (SNR) of the speech from +4 to -4 dB increased the magnitude of the AV spatial alignment effect (experiment 3), but accurate closed-set lipreading caused a floor effect that influenced results at lower SNRs (experiment 4). Taken together, these results demonstrate that spatial alignment between faces and voices contributes to the ability to selectively attend AV speech.

摘要

观看说话者的面部能够提高噪声环境下的言语可懂度,前提是听觉和视觉言语信号在时间上大致对齐。然而,对应面部和声音之间的空间对准的重要性仍未得到解决,特别是在多说话者环境中。在一系列在线实验中,我们使用需要参与者在噪声中选择性地关注目标说话者而忽略干扰说话者的任务来研究这个问题。在实验 1 中,我们发现当说话者的面部可见时,任务表现会有所提高,但只有当相应的面部和声音在同一半视野(空间对齐)中呈现时才会提高。在实验 2 中,我们测试了眼睛位置对此结果可能产生的影响。在仅听觉条件下,将目光转向干扰声音会降低表现,但这种效果无法完全解释视听(AV)空间失配的代价。将语音的信噪比(SNR)从+4 分贝降低到-4 分贝增加了 AV 空间对准效果的幅度(实验 3),但准确的闭集唇读导致了一个地板效应,这会影响较低 SNR 下的结果(实验 4)。综上所述,这些结果表明,面部和声音之间的空间对准有助于选择性地关注视听言语的能力。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验