Suppr超能文献

视觉敏感度、模糊度与自然场景振幅谱中的变异性来源

Visual sensitivity, blur and the sources of variability in the amplitude spectra of natural scenes.

作者信息

Field D J, Brady N

机构信息

Department of Psychology, Cornell University, Ithaca, NY 14853, USA.

出版信息

Vision Res. 1997 Dec;37(23):3367-83. doi: 10.1016/s0042-6989(97)00181-8.

Abstract

A number of researchers have suggested that in order to understand the response properties of cells in the visual pathway, we must consider the statistical structure of the natural environment. In this paper, we focus on one aspect of that structure, namely, the correlational structure which is described by the amplitude or power spectra of natural scenes. We propose that the principle insight one gains from considering the image spectra is in understanding the relative sensitivity of cells tuned to different spatial frequencies. This study employs a model in which the peak sensitivity is constant as a function of frequency with linear bandwith increasing (i.e., approximately constant in octaves). In such a model, the "response magnitude" (i.e., vector length) of cells increases as a function of their optimal (or central) spatial frequency out to about 20 cyc/deg. The result is a code in which the response to natural scenes, whose amplitude spectra typically fall as 1/f, is roughly constant out to 20 cyc/deg. An important consideration in evaluating this model of sensitivity is the fact that natural scenes show considerable variability in their amplitude spectra, with individual scenes showing falloffs which are often steeper or shallower than 1/f. Using a new measure of image structure (the "rectified contrast spectrum" or "RCS") on a set of calibrated natural images, it is shown that a large part of the variability in the spectra is due to differences in the sparseness of local structure at different scales. That is, an image which is "in focus" will have structure (e.g., edges) which has roughly the same magnitude across scale. That is, the loss of high frequency energy in some images is due to the reduction of the number of regions that contain structure rather than the amplitude of that structure. An "in focus" image will have structure (e.g., edges) across scale that have roughly equal magnitude but may vary in the area covered by structure. The slope of the RCS was found to provide a reasonable prediction of physical blur across a variety of scenes in spite of the variability in their amplitude spectra. It was also found to produce a good prediction of perceived blur as judged by human subjects.

摘要

许多研究人员认为,为了理解视觉通路中细胞的反应特性,我们必须考虑自然环境的统计结构。在本文中,我们关注该结构的一个方面,即由自然场景的幅度谱或功率谱所描述的相关结构。我们提出,从考虑图像谱中获得的主要见解在于理解调谐到不同空间频率的细胞的相对敏感性。本研究采用了一种模型,其中峰值敏感性作为频率的函数是恒定的,线性带宽增加(即,在倍频程中大致恒定)。在这样的模型中,细胞的“反应幅度”(即向量长度)随着其最佳(或中心)空间频率的函数增加,直至约20周/度。结果是一种编码,其中对自然场景的反应,其幅度谱通常以1/f下降,在20周/度范围内大致恒定。评估这种敏感性模型时的一个重要考虑因素是,自然场景在其幅度谱中表现出相当大的变异性,个别场景的下降往往比1/f更陡或更平缓。在一组校准的自然图像上使用一种新的图像结构测量方法(“整流对比度谱”或“RCS”),结果表明,谱中的大部分变异性是由于不同尺度上局部结构的稀疏性差异所致。也就是说,一幅“聚焦”的图像将具有在整个尺度上幅度大致相同的结构(例如边缘)。也就是说,某些图像中高频能量的损失是由于包含结构的区域数量减少,而不是该结构的幅度减小。一幅“聚焦”的图像将在整个尺度上具有幅度大致相等但结构覆盖面积可能不同的结构(例如边缘)。尽管自然场景的幅度谱存在变异性,但发现RCS的斜率能够对各种场景中的物理模糊提供合理的预测。研究还发现,它能够很好地预测人类受试者判断的感知模糊。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验