VanRullen Rufin
California Institute of Technology, CNS Program, Division of Biology, MC 139-74, Pasadena, CA 91125, USA.
J Physiol Paris. 2003 Mar-May;97(2-3):365-77. doi: 10.1016/j.jphysparis.2003.09.010.
Visual saliency is a fundamental yet hard to define property of objects or locations in the visual world. In a context where objects and their representations compete to dominate our perception, saliency can be thought of as the "juice" that makes objects win the race. It is often assumed that saliency is extracted and represented in an explicit saliency map, which serves to determine the location of spatial attention at any given time. It is then by drawing attention to a salient object that it can be recognized or categorized. I argue against this classical view that visual "bottom-up" saliency automatically recruits the attentional system prior to object recognition. A number of visual processing tasks are clearly performed too fast for such a costly strategy to be employed. Rather, visual attention could simply act by biasing a saliency-based object recognition system. Under natural conditions of stimulation, saliency can be represented implicitly throughout the ventral visual pathway, independent of any explicit saliency map. At any given level, the most activated cells of the neural population simply represent the most salient locations. The notion of saliency itself grows increasingly complex throughout the system, mostly based on luminance contrast until information reaches visual cortex, gradually incorporating information about features such as orientation or color in primary visual cortex and early extrastriate areas, and finally the identity and behavioral relevance of objects in temporal cortex and beyond. Under these conditions the object that dominates perception, i.e. the object yielding the strongest (or the first) selective neural response, is by definition the one whose features are most "salient"--without the need for any external saliency map. In addition, I suggest that such an implicit representation of saliency can be best encoded in the relative times of the first spikes fired in a given neuronal population. In accordance with our subjective experience that saliency and attention do not modify the appearance of objects, the feed-forward propagation of this first spike wave could serve to trigger saliency-based object recognition outside the realm of awareness, while conscious perceptions could be mediated by the remaining discharges of longer neuronal spike trains.
视觉显著性是视觉世界中物体或位置的一种基本但难以定义的属性。在一个物体及其表征相互竞争以主导我们感知的情境中,显著性可以被视为使物体赢得这场竞争的“动力”。人们通常认为,显著性是在一个明确的显著性图中被提取和表征的,该图用于确定在任何给定时间空间注意力的位置。然后,通过将注意力吸引到一个显著物体上,它才能被识别或分类。我反对这种传统观点,即视觉“自下而上”的显著性在物体识别之前会自动激活注意力系统。许多视觉处理任务执行得太快,以至于无法采用这种代价高昂的策略。相反,视觉注意力可能只是通过对基于显著性的物体识别系统施加偏差来起作用。在自然刺激条件下,显著性可以在整个腹侧视觉通路中被隐式表征,而无需任何明确的显著性图。在任何给定水平上,神经群体中最活跃的细胞简单地代表最显著的位置。显著性本身的概念在整个系统中变得越来越复杂,在信息到达视觉皮层之前主要基于亮度对比度,逐渐在初级视觉皮层和早期纹外区域纳入有关诸如方向或颜色等特征的信息,最后在颞叶皮层及其他区域纳入物体的身份和行为相关性信息。在这些条件下,主导感知的物体,即产生最强(或第一个)选择性神经反应的物体,根据定义就是其特征最“显著”的物体——无需任何外部显著性图。此外,我认为这种显著性的隐式表征可以最好地编码在给定神经元群体中首次发放脉冲的相对时间中。与我们的主观体验一致,即显著性和注意力不会改变物体的外观,这种首次脉冲波的前馈传播可以在意识范围之外触发基于显著性的物体识别,而有意识的感知可能由神经元较长脉冲序列的其余放电介导。