Castellotti Serena, Montagnini Anna, Del Viva Maria Michela
Department of Neurofarba, University of Florence, Florence, Italy.
Institut de Neurosciences de la Timone (UMR 7289), CNRS and Aix-Marseille Université, Marseille, France.
Front Neurosci. 2021 Apr 30;15:645743. doi: 10.3389/fnins.2021.645743. eCollection 2021.
Under fast viewing conditions, the visual system extracts salient and simplified representations of complex visual scenes. Saccadic eye movements optimize such visual analysis through the dynamic sampling of the most informative and salient regions in the scene. However, a general definition of saliency, as well as its role for natural active vision, is still a matter for discussion. Following the general idea that visual saliency may be based on the amount of local information, a recent constrained maximum-entropy model of early vision, applied to natural images, extracts a set of local optimal information-carriers, as candidate salient features. These features proved to be more informative than others in fast vision, when embedded in simplified sketches of natural images. In the present study, for the first time, these features were presented in isolation, to investigate whether they can be visually more salient than other features, even in the absence of any meaningful global arrangement (contour, line, etc.). In four psychophysics experiments, fast discriminability of a compound of features (target) in comparison with a similar compound of features (distractor) was measured as a function of their number and contrast. Results showed that the saliency predictions from the constrained maximum-entropy model are well verified in the data, even when the features are presented in smaller numbers or at lower contrast. In the eye movements experiment, the target and the distractor compounds were presented in the periphery at different angles. Participants were asked to perform a simple choice-saccade task. Results showed that saccades can select informative features spatially interleaved with features even at the shortest latencies. Saccades' choice accuracy and landing position precision improved with SNR. In conclusion, the features predicted by the reference model, turn out to be more salient than others, despite the lack of any clues coming from a global meaningful structure, suggesting that they get preferential treatment during fast image analysis. Also, peripheral fast visual processing of these informative local features is able to guide gaze orientation. We speculate that active vision is efficiently adapted to maximize information in natural visual scenes.
在快速观察条件下,视觉系统会提取复杂视觉场景的显著且简化的表征。眼跳运动通过对场景中最具信息性和显著性的区域进行动态采样来优化这种视觉分析。然而,显著性的一般定义及其在自然主动视觉中的作用仍是一个有待探讨的问题。基于视觉显著性可能基于局部信息量这一总体观点,最近一种应用于自然图像的早期视觉约束最大熵模型提取了一组局部最优信息载体,作为候选显著特征。当嵌入自然图像的简化草图中时,这些特征在快速视觉中被证明比其他特征更具信息性。在本研究中,这些特征首次被单独呈现,以探究即使在没有任何有意义的全局布局(轮廓、线条等)的情况下,它们在视觉上是否比其他特征更显著。在四项心理物理学实验中,测量了一组特征的复合物(目标)与类似的一组特征的复合物(干扰物)相比的快速可辨别性,该可辨别性是其数量和对比度的函数。结果表明,即使这些特征以较少数量或较低对比度呈现,来自约束最大熵模型的显著性预测在数据中也得到了很好的验证。在眼动实验中,目标和干扰物复合物以不同角度呈现在外周。要求参与者执行简单的选择眼跳任务。结果表明,眼跳甚至能在最短潜伏期时在空间上选择与其他特征交错的信息性特征。眼跳的选择准确性和着陆位置精度随信噪比提高。总之,尽管缺乏来自全局有意义结构的任何线索,但参考模型预测的这些特征比其他特征更显著,这表明它们在快速图像分析过程中得到了优先处理。此外,这些信息性局部特征的外周快速视觉处理能够引导注视方向。我们推测主动视觉能有效地进行适应性调整,以在自然视觉场景中最大化信息。