IEEE Trans Image Process. 2021;30:487-500. doi: 10.1109/TIP.2020.3037525. Epub 2020 Nov 24.
The human visual system (HVS) is a hierarchical system, in which visual signals are processed hierarchically. In this paper, the HVS is modeled as a three-level communication system and visual perception is divided into three stages according to the hierarchical predictive coding theory. Then, a novel just noticeable distortion (JND) estimation scheme is proposed. In visual perception, the input signals are predicted constantly and spontaneously in each hierarchy, and neural response is evoked by the central residue and inhibited by surrounding residues. These two types' residues are regarded as the positive and negative visual incentives which cause positive and negative perception effects, respectively. In neuroscience, the effect of incentive on observer is measured by the surprise of this incentive. Thus, we propose a surprise-based measurement method to measure both perception effects. Specifically, considering the biased competition of visual attention, we define the product of the residue self-information (i.e., surprise) and the competition biases as the perceptual surprise to measure the positive perception effect. As for the negative perception effect, it is measured by the average surprise (i.e., the local Shannon entropy). The JND threshold of each stage is estimated individually by considering both perception effects. The total JND threshold is finally obtained by non-linear superposition of three stage thresholds. Furthermore, the proposed JND estimation scheme is incorporated into the codec of Versatile Video Coding for image compression. Experimental results show that the proposed JND model outperforms the relevant existing ones, and over 16% of bit rate can be reduced without jeopardizing the perceptual quality.
人类视觉系统 (HVS) 是一个分层系统,其中视觉信号是分层处理的。在本文中,我们将 HVS 建模为一个三级通信系统,并根据分层预测编码理论将视觉感知分为三个阶段。然后,提出了一种新的可察觉失真 (JND) 估计方案。在视觉感知中,输入信号在每个层次上不断地、自发地进行预测,而神经响应则由中心残差引起,并被周围残差抑制。这两种残差被视为正、负视觉激励,分别引起正、负感知效应。在神经科学中,激励对观察者的影响通过激励的惊讶来衡量。因此,我们提出了一种基于惊讶的测量方法来测量两种感知效应。具体来说,考虑到视觉注意力的偏置竞争,我们定义残差自信息(即惊讶)和竞争偏差的乘积为感知惊讶,用于测量正感知效应。对于负感知效应,它由平均惊讶(即局部香农熵)来衡量。通过考虑两种感知效应,分别估计每个阶段的 JND 阈值。最后,通过非线性叠加三个阶段的阈值得到总 JND 阈值。此外,所提出的 JND 估计方案被纳入多功能视频编码的编解码器中用于图像压缩。实验结果表明,所提出的 JND 模型优于相关的现有模型,并且在不影响感知质量的情况下,可以将比特率降低 16%以上。