Department of Psychology, Yale University.
J Exp Psychol Gen. 2024 Oct;153(10):2441-2453. doi: 10.1037/xge0001636.
During visual processing, input that is continuous in space and time is segmented, resulting in the representation of discrete tokens-objects or events. And there has been a great deal of research about how object representations are generalized into types-as when we see an object as an instance of a broader category (e.g., an animal or plant). There has been much less attention, however, to the possibility that vision represents dynamic information in terms of a small number of primitive event types (such as twisting or bouncing). (In models that posit a "language of vision," these would be the foundational visual verbs.) Here we ask whether such event types are extracted spontaneously during visual perception, even when entirely task irrelevant during passive viewing. We exploited the phenomenon of categorical perception-wherein differences are more readily noticed when they are represented in terms of different underlying categories. Observers were better at detecting changes to images or short videos when the changes involved switches in the underlying event type-even when the changes that maintained the same event type were objectively larger (in terms of both brute image metrics and higher level feature change). We observed this categorical "cross-event-type" advantage for visual working memory for twisting versus rotating, scooping versus pouring, and rolling versus bouncing. Moreover, additional control experiments confirmed that such effects could not be explained by appeal to lower-level non-categorical stimulus differences. This spontaneous perception of "visual verbs" might promote both generalization and prediction about how events are likely to unfold. (PsycInfo Database Record (c) 2024 APA, all rights reserved).
在视觉处理过程中,连续的空间和时间输入被分割,从而产生离散的标记——对象或事件的表示。已经有大量的研究关注对象表示如何被概括为类型,例如当我们将一个物体视为更广泛类别(例如,动物或植物)的一个实例时。然而,对于视觉是否可以根据少数基本事件类型(例如扭曲或弹跳)来表示动态信息,关注较少。(在假设“视觉语言”的模型中,这些将是基础视觉动词。)在这里,我们询问在被动观看时,即使完全与任务无关,是否会自发地提取这种事件类型。我们利用了范畴知觉现象——当差异以不同的基础类别表示时,更容易被注意到。当变化涉及基础事件类型的转换时,观察者更善于检测图像或短视频的变化,即使保持相同事件类型的变化在客观上更大(无论是根据原始图像指标还是更高层次的特征变化)。我们观察到这种关于扭曲与旋转、舀取与倾倒、滚动与弹跳的视觉工作记忆的范畴“跨事件类型”优势。此外,额外的控制实验证实,这些效应不能用诉诸于较低层次的非范畴刺激差异来解释。这种对“视觉动词”的自发感知可能促进了对事件展开方式的概括和预测。(PsycInfo 数据库记录(c)2024 APA,保留所有权利)。