Rosenholtz Ruth, Yu Dian, Keshvari Shaiyan
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
J Vis. 2019 Jul 1;19(7):15. doi: 10.1167/19.7.15.
A set of phenomena known as crowding reveal peripheral vision's vulnerability in the face of clutter. Crowding is important both because of its ubiquity, making it relevant for many real-world tasks and stimuli, and because of the window it provides onto mechanisms of visual processing. Here we focus on models of the underlying mechanisms. This review centers on a popular class of models known as pooling models, as well as the phenomenology that appears to challenge a pooling account. Using a candidate high-dimensional pooling model, we gain intuitions about whether a pooling model suffices and reexamine the logic behind the pooling challenges. We show that pooling mechanisms can yield substitution phenomena and therefore predict better performance judging the properties of a set versus a particular item. Pooling models can also exhibit some similarity effects without requiring mechanisms that pool at multiple levels of processing, and without constraining pooling to a particular perceptual group. Moreover, we argue that other similarity effects may in part be due to noncrowding influences like cuing. Unlike low-dimensional straw-man pooling models, high-dimensional pooling preserves rich information about the stimulus, which may be sufficient to support high-level processing. To gain insights into the implications for pooling mechanisms, one needs a candidate high-dimensional pooling model and cannot rely on intuitions from low-dimensional models. Furthermore, to uncover the mechanisms of crowding, experiments need to separate encoding from decision effects. While future work must quantitatively examine all of the challenges to a high-dimensional pooling account, insights from a candidate model allow us to conclude that a high-dimensional pooling mechanism remains viable as a model of the loss of information leading to crowding.
一组被称为拥挤效应的现象揭示了周边视觉在面对杂乱信息时的脆弱性。拥挤效应之所以重要,一方面是因为它无处不在,与许多现实世界的任务和刺激相关;另一方面是因为它为视觉处理机制提供了一个窗口。在这里,我们关注其潜在机制的模型。这篇综述聚焦于一类流行的模型,即合并模型,以及那些似乎对合并模型提出挑战的现象学。通过使用一个候选的高维合并模型,我们对合并模型是否足够有了直观认识,并重新审视了合并挑战背后的逻辑。我们表明,合并机制可以产生替代现象,因此在判断一组物体的属性与单个物体的属性时,预测表现会更好。合并模型还可以展现出一些相似性效应,而无需在多个处理层面进行合并的机制,也无需将合并限制在特定的感知组。此外,我们认为其他相似性效应可能部分归因于诸如线索提示等非拥挤效应的影响。与低维的简易合并模型不同,高维合并保留了关于刺激的丰富信息,这可能足以支持高级处理。为了深入了解合并机制的影响,需要一个候选的高维合并模型,而不能依赖低维模型的直觉。此外,为了揭示拥挤效应的机制,实验需要将编码效应与决策效应区分开来。虽然未来的工作必须定量研究对高维合并模型的所有挑战,但候选模型的见解使我们能够得出结论,高维合并机制作为导致拥挤效应的信息丢失模型仍然可行。