Center for Mind and Brain, University of California, Davis, United States of America.
Center for Mind and Brain, University of California, Davis, United States of America; Department of Psychology, University of California, Davis, United States of America.
Cognition. 2022 Dec;229:105231. doi: 10.1016/j.cognition.2022.105231. Epub 2022 Jul 28.
Semantic guidance theories propose that attention in real-world scenes is strongly associated with semantically informative scene regions. That is, we look where there are recognizable and informative objects that help us make sense of our visual environment. In contrast, image guidance theories propose that local differences in semantically uninterpreted image features such as luminance, color, and edge orientation primarily determine where we look in scenes. While it is clear that both semantic guidance and image guidance play a role in where we look in scenes, the degree of their relative contributions and how they interact with each other remains poorly understood. In the current study, we presented real-world scenes in upright and inverted orientations and used general linear mixed effects models to understand how semantic guidance, image guidance, and observer center bias were associated with fixation location and fixation duration. We observed distinct patterns of change under inversion. Semantic guidance was severely disrupted by scene inversion, while image guidance was mildly impaired and observer center bias was enhanced. In addition, we found that fixation durations for semantically rich regions decreased when viewing inverted scenes relative to upright scene viewing, while fixation durations for image salience and center bias were unaffected by inversion. Together these results provide important new constraints on theories and computational models of attention in real-world scenes.
语义引导理论认为,现实场景中的注意力与具有语义信息的场景区域密切相关。也就是说,我们会看向那些有可识别和提供信息的物体的地方,这些物体帮助我们理解我们的视觉环境。相比之下,图像引导理论则提出,在语义上未解释的图像特征(如亮度、颜色和边缘方向)的局部差异主要决定了我们在场景中看哪里。虽然很明显,语义引导和图像引导都在我们观察场景的位置上发挥作用,但它们的相对贡献程度以及它们如何相互作用仍然知之甚少。在当前的研究中,我们呈现了直立和倒置的现实场景,并使用广义线性混合效应模型来了解语义引导、图像引导和观察者中心偏差与注视位置和注视持续时间的关系。我们观察到在反转时出现了明显的变化模式。语义引导在场景反转时受到严重干扰,而图像引导受到轻微影响,观察者中心偏差增强。此外,我们发现相对于观看直立场景,观看倒置场景时语义丰富区域的注视持续时间减少,而图像显著性和中心偏差的注视持续时间不受反转影响。这些结果共同为现实场景中注意力的理论和计算模型提供了重要的新约束。