Department of Psychology and Centre for Vision Research, York University, Toronto, ON, Canada.
Proc Natl Acad Sci U S A. 2011 Jul 26;108(30):12551-3. doi: 10.1073/pnas.1100794108. Epub 2011 Jul 11.
Every biological or artificial visual system faces the problem that images are highly ambiguous, in the sense that every image depicts an infinite number of possible 3D arrangements of shapes, surface colors, and light sources. When estimating 3D shape from shading, the human visual system partly resolves this ambiguity by relying on the light-from-above prior, an assumption that light comes from overhead. However, light comes from overhead only on average, and most images contain visual information that contradicts the light-from-above prior, such as shadows indicating oblique lighting. How does the human visual system perceive 3D shape when there are contradictions between what it assumes and what it sees? Here we show that the visual system combines the light-from-above prior with visual lighting cues using an efficient statistical strategy that assigns a weight to the prior and to the cues and finds a maximum-likelihood lighting direction estimate that is a compromise between the two. The prior receives surprisingly little weight and can be overridden by lighting cues that are barely perceptible. Thus, the light-from-above prior plays a much more limited role in shape perception than previously thought, and instead human vision relies heavily on lighting cues to recover 3D shape. These findings also support the notion that the visual system efficiently integrates priors with cues to solve the difficult problem of recovering 3D shape from 2D images.
每个生物或人工视觉系统都面临着图像高度模糊的问题,也就是说,每个图像都描绘了无数种可能的三维形状、表面颜色和光源排列。当从阴影中估计三维形状时,人类视觉系统部分通过依赖于上方光源的先验假设来解决这种模糊性,即假设光源来自上方。然而,光源平均来自上方,并且大多数图像包含与上方光源先验假设相矛盾的视觉信息,例如表示斜向照明的阴影。当假设与所见之间存在矛盾时,人类视觉系统如何感知三维形状?在这里,我们表明,视觉系统使用一种有效的统计策略,将上方光源的先验假设与视觉照明线索相结合,为先验假设和线索分配权重,并找到一个最大似然光照方向估计值,该值在两者之间进行折衷。先验假设假设接收的权重出乎意料地小,并且可以被几乎无法察觉的照明线索所覆盖。因此,上方光源的先验假设在形状感知中所起的作用比以前认为的要小得多,相反,人类视觉系统严重依赖照明线索来恢复三维形状。这些发现还支持了这样一种观点,即视觉系统有效地将先验假设与线索相结合,以解决从二维图像中恢复三维形状的难题。