Corney David, Lotto R Beau
UCL Institute of Ophthalmology, University College London, London, United Kingdom.
PLoS Comput Biol. 2007 Sep;3(9):1790-800. doi: 10.1371/journal.pcbi.0030180.
Lightness illusions are fundamental to human perception, and yet why we see them is still the focus of much research. Here we address the question by modelling not human physiology or perception directly as is typically the case but our natural visual world and the need for robust behaviour. Artificial neural networks were trained to predict the reflectance of surfaces in a synthetic ecology consisting of 3-D "dead-leaves" scenes under non-uniform illumination. The networks learned to solve this task accurately and robustly given only ambiguous sense data. In addition--and as a direct consequence of their experience--the networks also made systematic "errors" in their behaviour commensurate with human illusions, which includes brightness contrast and assimilation--although assimilation (specifically White's illusion) only emerged when the virtual ecology included 3-D, as opposed to 2-D scenes. Subtle variations in these illusions, also found in human perception, were observed, such as the asymmetry of brightness contrast. These data suggest that "illusions" arise in humans because (i) natural stimuli are ambiguous, and (ii) this ambiguity is resolved empirically by encoding the statistical relationship between images and scenes in past visual experience. Since resolving stimulus ambiguity is a challenge faced by all visual systems, a corollary of these findings is that human illusions must be experienced by all visual animals regardless of their particular neural machinery. The data also provide a more formal definition of illusion: the condition in which the true source of a stimulus differs from what is its most likely (and thus perceived) source. As such, illusions are not fundamentally different from non-illusory percepts, all being direct manifestations of the statistical relationship between images and scenes.
明度错觉是人类感知的基础,但我们为何会看到这些错觉仍是众多研究的焦点。在此,我们通过对自然视觉世界以及稳健行为的需求进行建模来解决这个问题,而不是像通常那样直接对人类生理或感知进行建模。人工神经网络经过训练,用于预测在由非均匀光照下的三维“枯叶”场景组成的合成生态环境中表面的反射率。这些网络仅根据模糊的感官数据就能准确且稳健地学会解决此任务。此外,作为其经验的直接结果,这些网络在行为上也出现了与人类错觉相符的系统性“错误”,包括亮度对比和同化——尽管同化(特别是怀特错觉)仅在虚拟生态环境包含三维场景而非二维场景时才会出现。我们还观察到了这些错觉中的细微变化,这些变化在人类感知中也能找到,比如亮度对比的不对称性。这些数据表明,人类产生“错觉”的原因在于:(i)自然刺激是模糊的;(ii)这种模糊性通过在过去的视觉经验中对图像与场景之间的统计关系进行编码而凭经验得到解决。由于解决刺激的模糊性是所有视觉系统都面临的挑战,这些发现的一个推论是,所有视觉动物无论其特定的神经机制如何,都必然会经历人类错觉。这些数据还为错觉提供了一个更正式的定义:刺激的真正来源与其最有可能(因而被感知到)的来源不同的情况。因此,错觉与非错觉感知在根本上并无不同,它们都是图像与场景之间统计关系的直接表现。