Marlow Phillip J, Anderson Barton L
School of Psychology, University of Sydney, Sydney, NSW, Australia.
J Vis. 2013 Dec 2;13(14):2. doi: 10.1167/13.14.2.
Image structure is generated by distinct physical sources that include 3-D surface geometry, surface reflectance, and the light field. A fundamental problem in midlevel vision involves understanding how, and how well, the visual system separates images into their underlying sources. A large body of recent work has shown that the perception of gloss exhibits significant dependencies on both 3-D surface geometry and the light field in which the surface is embedded. We recently proposed that these dependencies arise from the visual system relying on a restricted set of dimensions of specular image structure that are heuristically used to estimate surface gloss (Marlow, Kim, & Anderson, 2012). Here, we report a series of studies that manipulate surface geometry and the structure of the light field to prospectively test whether these image cues predict perceived gloss for a broad range of surface geometries and light fields. We identify generative constraints on each of our proposed cues that allow us to predict which cue will vary most for a given surface geometry, reflectance function, and light field. Our psychophysical data reveal that our generative manipulations succeed in promoting the significance of the intended cue, which in turn increases their predictive value in observers' judgments of gloss. Our results suggest that the perception of gloss in static, monocular images relies on a heuristic weighting of cues to specular image structure, and offer a unified explanation of why surface geometry and the light field modulate perceived gloss.
图像结构由不同的物理源生成,这些物理源包括三维表面几何形状、表面反射率和光场。中级视觉中的一个基本问题涉及理解视觉系统如何以及在多大程度上能够将图像分离为其潜在的源。最近的大量研究表明,光泽感对三维表面几何形状以及表面所处的光场都表现出显著的依赖性。我们最近提出,这些依赖性源于视觉系统依赖于镜面反射图像结构的一组受限维度,这些维度被启发式地用于估计表面光泽(马洛、金和安德森,2012年)。在此,我们报告一系列研究,这些研究操纵表面几何形状和光场结构,以前瞻性地测试这些图像线索是否能预测广泛的表面几何形状和光场下的感知光泽。我们确定了对我们提出的每个线索的生成性约束,这使我们能够预测对于给定的表面几何形状、反射率函数和光场,哪个线索变化最大。我们的心理物理学数据表明,我们的生成性操纵成功地提升了预期线索的显著性,这反过来又增加了它们在观察者对光泽的判断中的预测价值。我们的结果表明,静态单眼图像中的光泽感依赖于对镜面反射图像结构线索的启发式加权,并为表面几何形状和光场如何调节感知光泽提供了一个统一的解释。