Chandler Damon M, Hemami Sheila S
School of Electrical and Computer engineering, Oklahoma State University, Stillwater, OK 74078, USA.
IEEE Trans Image Process. 2007 Sep;16(9):2284-98. doi: 10.1109/tip.2007.901820.
This paper presents an efficient metric for quantifying the visual fidelity of natural images based on near-threshold and suprathreshold properties of human vision. The proposed metric, the visual signal-to-noise ratio (VSNR), operates via a two-stage approach. In the first stage, contrast thresholds for detection of distortions in the presence of natural images are computed via wavelet-based models of visual masking and visual summation in order to determine whether the distortions in the distorted image are visible. If the distortions are below the threshold of detection, the distorted image is deemed to be of perfect visual fidelity (VSNR = infinity) and no further analysis is required. If the distortions are suprathreshold, a second stage is applied which operates based on the low-level visual property of perceived contrast, and the mid-level visual property of global precedence. These two properties are modeled as Euclidean distances in distortion-contrast space of a multiscale wavelet decomposition, and VSNR is computed based on a simple linear sum of these distances. The proposed VSNR metric is generally competitive with current metrics of visual fidelity; it is efficient both in terms of its low computational complexity and in terms of its low memory requirements; and it operates based on physical luminances and visual angle (rather than on digital pixel values and pixel-based dimensions) to accommodate different viewing conditions.
本文提出了一种基于人类视觉的近阈值和超阈值特性来量化自然图像视觉保真度的有效度量标准。所提出的度量标准,即视觉信噪比(VSNR),通过两阶段方法运行。在第一阶段,通过基于小波的视觉掩蔽和视觉总和模型计算在自然图像存在下检测失真的对比度阈值,以确定失真图像中的失真是否可见。如果失真低于检测阈值,则失真图像被视为具有完美的视觉保真度(VSNR = 无穷大),无需进一步分析。如果失真是超阈值的,则应用第二阶段,该阶段基于感知对比度的低级视觉特性和全局优先级的中级视觉特性运行。这两个特性在多尺度小波分解的失真-对比度空间中被建模为欧几里得距离,并且VSNR基于这些距离的简单线性和来计算。所提出的VSNR度量标准通常与当前的视觉保真度度量标准具有竞争力;它在计算复杂度低和内存需求低方面都很高效;并且它基于物理亮度和视角(而不是基于数字像素值和基于像素的尺寸)运行,以适应不同的观看条件。