Suppr超能文献

听觉场景复杂性会影响视听多说话人虚拟环境中言语感知过程中的运动行为。

Acoustic scene complexity affects motion behavior during speech perception in audio-visual multi-talker virtual environments.

机构信息

Hearing Systems Section, Department of Health Technology, Technical University of Denmark, 2800, Kgs. Lyngby, Denmark.

出版信息

Sci Rep. 2024 Aug 16;14(1):19028. doi: 10.1038/s41598-024-70026-0.

Abstract

In real-world listening situations, individuals typically utilize head and eye movements to receive and enhance sensory information while exploring acoustic scenes. However, the specific patterns of such movements have not yet been fully characterized. Here, we studied how movement behavior is influenced by scene complexity, varied in terms of reverberation and the number of concurrent talkers. Thirteen normal-hearing participants engaged in a speech comprehension and localization task, requiring them to indicate the spatial location of a spoken story in the presence of other stories in virtual audio-visual scenes. We observed delayed initial head movements when more simultaneous talkers were present in the scene. Both reverberation and a higher number of talkers extended the search period, increased the number of fixated source locations, and resulted in more gaze jumps. The period preceding the participants' responses was prolonged when more concurrent talkers were present, and listeners continued to move their eyes in the proximity of the target talker. In scenes with more reverberation, the final head position when making the decision was farther away from the target. These findings demonstrate that the complexity of the acoustic scene influences listener behavior during speech comprehension and localization in audio-visual scenes.

摘要

在现实生活中的听力环境中,个体通常会利用头部和眼部运动来接收和增强对声音场景的感知信息。然而,这些运动的具体模式尚未得到充分的描述。在这里,我们研究了运动行为是如何受到场景复杂性的影响的,这些场景在混响和同时说话者数量方面存在差异。13 名听力正常的参与者参与了一个语音理解和定位任务,要求他们在虚拟视听场景中听到其他故事的同时,指出一个被说出的故事的空间位置。我们观察到,当场景中有更多的同时说话者时,初始头部运动会延迟。混响和更多的说话者都会延长搜索期,增加注视源位置的数量,并导致更多的眼球跳动。当有更多的同时说话者存在时,参与者做出反应之前的时间会延长,并且听众会继续在目标说话者的附近移动眼睛。在混响较大的场景中,做出决策时的最终头部位置会离目标更远。这些发现表明,听觉场景的复杂性会影响视听场景中语音理解和定位时的听众行为。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7831/11329770/da2fc72b4d48/41598_2024_70026_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验