跨模态语义一致性引导现实生活场景中的自发定向。

Crossmodal semantic congruence guides spontaneous orienting in real-life scenes.

作者信息

Kvasova Daria, Coll Llucia, Stewart Travis, Soto-Faraco Salvador

机构信息

Center for Brain and Cognition, Department of Communication and Information Technologies, Universitat Pompeu Fabra, Carrer de Ramón Trias i Fargas 25-27, Barcelona, 08005, Spain.

Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain.

出版信息

Psychol Res. 2024 Oct;88(7):2138-2148. doi: 10.1007/s00426-024-02018-8. Epub 2024 Aug 6.

DOI:10.1007/s00426-024-02018-8

PMID:39105825

Abstract

In real-world scenes, the different objects and events are often interconnected within a rich web of semantic relationships. These semantic links help parse information efficiently and make sense of the sensory environment. It has been shown that, during goal-directed search, hearing the characteristic sound of an everyday life object helps finding the affiliate objects in artificial visual search arrays as well as in naturalistic, real-life videoclips. However, whether crossmodal semantic congruence also triggers orienting during spontaneous, not goal-directed observation is unknown. Here, we investigated this question addressing whether crossmodal semantic congruence can attract spontaneous, overt visual attention when viewing naturalistic, dynamic scenes. We used eye-tracking whilst participants (N = 45) watched video clips presented alongside sounds of varying semantic relatedness with objects present within the scene. We found that characteristic sounds increased the probability of looking at, the number of fixations to, and the total dwell time on semantically corresponding visual objects, in comparison to when the same scenes were presented with semantically neutral sounds or just with background noise only. Interestingly, hearing object sounds not met with an object in the scene led to increased visual exploration. These results suggest that crossmodal semantic information has an impact on spontaneous gaze on realistic scenes, and therefore on how information is sampled. Our findings extend beyond known effects of object-based crossmodal interactions with simple stimuli arrays and shed new light on the role that audio-visual semantic relationships out in the perception of everyday life scenarios.

摘要

在现实世界场景中，不同的物体和事件常常在丰富的语义关系网络中相互关联。这些语义联系有助于高效解析信息并理解感官环境。研究表明，在目标导向搜索过程中，听到日常生活物体的特征声音有助于在人工视觉搜索阵列以及自然主义的现实生活视频片段中找到相关物体。然而，跨模态语义一致性在自发的而非目标导向的观察过程中是否也会引发定向反应尚不清楚。在此，我们研究了这个问题，即观看自然主义的动态场景时，跨模态语义一致性是否能够吸引自发的、明显的视觉注意力。我们采用眼动追踪技术，让参与者（N = 45）观看与场景中出现的物体具有不同语义相关性的声音同时呈现的视频片段。我们发现，与相同场景仅伴有语义中性声音或仅有背景噪音呈现时相比，特征声音增加了注视语义对应视觉物体的概率、注视次数以及总停留时间。有趣的是，听到场景中未出现物体的声音会导致视觉探索增加。这些结果表明，跨模态语义信息会对现实场景中的自发注视产生影响，进而影响信息的采样方式。我们的研究结果超越了基于物体的跨模态与简单刺激阵列相互作用的已知效应，为视听语义关系在日常生活场景感知中的作用提供了新的见解。

相似文献

Crossmodal semantic congruence guides spontaneous orienting in real-life scenes.

Psychol Res. 2024 Oct;88(7):2138-2148. doi: 10.1007/s00426-024-02018-8. Epub 2024 Aug 6.

The roles of scene gist and spatial dependency among objects in the semantic guidance of attention in real-world scenes.

Vision Res. 2014 Dec;105:10-20. doi: 10.1016/j.visres.2014.08.019. Epub 2014 Sep 6.

Crossmodal Semantic Congruence Interacts with Object Contextual Consistency in Complex Visual Scenes to Enhance Short-Term Memory Performance.

Brain Sci. 2021 Sep 13;11(9):1206. doi: 10.3390/brainsci11091206.

Characteristic Sounds Facilitate Object Search in Real-Life Scenes.

Front Psychol. 2019 Nov 5;10:2511. doi: 10.3389/fpsyg.2019.02511. eCollection 2019.

Influence of semantic consistency and perceptual features on visual attention during scene viewing in toddlers.

Infant Behav Dev. 2017 Nov;49:248-266. doi: 10.1016/j.infbeh.2017.09.008. Epub 2017 Oct 10.

Does gravity matter? Effects of semantic and syntactic inconsistencies on the allocation of attention during scene perception.

J Vis. 2009 Mar 27;9(3):24.1-15. doi: 10.1167/9.3.24.

Semantic guidance of eye movements in real-world scenes.

Vision Res. 2011 May 25;51(10):1192-205. doi: 10.1016/j.visres.2011.03.010. Epub 2011 Mar 21.

Exploring the Semantic-Inconsistency Effect in Scenes Using a Continuous Measure of Linguistic-Semantic Similarity.

Psychol Sci. 2024 Jun;35(6):623-634. doi: 10.1177/09567976241238217. Epub 2024 Apr 23.

Stuck on semantics: Processing of irrelevant object-scene inconsistencies modulates ongoing gaze behavior.

Atten Percept Psychophys. 2017 Jan;79(1):154-168. doi: 10.3758/s13414-016-1203-7.

Quantifying task-related gaze.

Atten Percept Psychophys. 2024 May;86(4):1318-1329. doi: 10.3758/s13414-024-02883-w. Epub 2024 Apr 9.

引用本文的文献

Beyond modular and non-modular states: theoretical considerations, exemplifications, and practical implications.

Front Psychol. 2025 Jan 23;16:1456587. doi: 10.3389/fpsyg.2025.1456587. eCollection 2025.

本文引用的文献

Characteristic Sounds Facilitate Object Search in Real-Life Scenes.

Front Psychol. 2019 Nov 5;10:2511. doi: 10.3389/fpsyg.2019.02511. eCollection 2019.

Multisensory enhancement of attention depends on whether you are already paying attention.

Cognition. 2019 Jun;187:38-49. doi: 10.1016/j.cognition.2019.02.008. Epub 2019 Feb 27.

Multisensory brand search: How the meaning of sounds guides consumers' visual attention.

J Exp Psychol Appl. 2016 Jun;22(2):196-210. doi: 10.1037/xap0000084.

Crossmodal semantic congruence can affect visuo-spatial processing and activity of the fronto-parietal attention networks.

Front Integr Neurosci. 2015 Jul 10;9:45. doi: 10.3389/fnint.2015.00045. eCollection 2015.

How saliency, faces, and sound influence gaze in dynamic social scenes.

J Vis. 2014 Jul 3;14(8):5. doi: 10.1167/14.8.5.

Guidance of visual attention by semantic information in real-world scenes.

Front Psychol. 2014 Feb 6;5:54. doi: 10.3389/fpsyg.2014.00054. eCollection 2014.

Isolating shape from semantics in haptic-visual priming.

Exp Brain Res. 2013 Jun;227(3):311-22. doi: 10.1007/s00221-013-3489-1. Epub 2013 May 18.

Spatial orienting in complex audiovisual environments.

Hum Brain Mapp. 2014 Apr;35(4):1597-614. doi: 10.1002/hbm.22276. Epub 2013 Apr 24.

Subcortical, modality-specific pathways contribute to multisensory processing in humans.

Cereb Cortex. 2014 Aug;24(8):2169-77. doi: 10.1093/cercor/bht069. Epub 2013 Mar 25.

Attention and the multiple stages of multisensory integration: A review of audiovisual studies.

Acta Psychol (Amst). 2010 Jul;134(3):372-84. doi: 10.1016/j.actpsy.2010.03.010. Epub 2010 Apr 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

跨模态语义一致性引导现实生活场景中的自发定向。

Crossmodal semantic congruence guides spontaneous orienting in real-life scenes.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献