Department of Electrical and Computer Engineering, University of California Santa Barbara, Santa Barbara, CA 93106, USA.
IEEE Trans Image Process. 2013 Mar;22(3):1175-85. doi: 10.1109/TIP.2012.2227773. Epub 2012 Nov 16.
Current video coders employ predictive coding with motion compensation to exploit temporal redundancies in the signal. In particular, blocks along a motion trajectory are modeled as an auto-regressive (AR) process, and it is generally assumed that the prediction errors are temporally independent and approximate the innovations of this process. Thus, zero-delay encoding and decoding is considered efficient. This paper is premised on the largely ignored fact that these prediction errors are, in fact, temporally dependent due to quantization effects in the prediction loop. It presents an estimation-theoretic delayed decoding scheme, which exploits information from future frames to improve the reconstruction quality of the current frame. In contrast to the standard decoder that reproduces every block instantaneously once the corresponding quantization indices of residues are available, the proposed delayed decoder efficiently combines all accessible (including any future) information in an appropriately derived probability density function, to obtain the optimal delayed reconstruction per transform coefficient. Experiments demonstrate significant gains over the standard decoder. Requisite information about the source AR model is estimated in a spatio-temporally adaptive manner from a bit-stream conforming to the H.264/AVC standard, i.e., no side information needs to be sent to the decoder in order to employ the proposed approach, thereby compatibility with the standard syntax and existing encoders is retained.
当前的视频编码器采用具有运动补偿的预测编码来利用信号中的时间冗余。具体来说,沿着运动轨迹的块被建模为自回归(AR)过程,通常假设预测误差在时间上是独立的,并且近似于该过程的新息。因此,零延迟编码和解码被认为是有效的。本文基于一个很大程度上被忽视的事实,即由于预测环路中的量化效应,这些预测误差实际上在时间上是相关的。它提出了一种基于估计理论的延迟解码方案,该方案利用来自未来帧的信息来提高当前帧的重建质量。与标准解码器不同,标准解码器一旦可用残差的相应量化索引,就会立即再现每个块,而所提出的延迟解码器通过适当推导的概率密度函数有效地组合所有可访问的(包括任何未来的)信息,以获得每个变换系数的最佳延迟重建。实验表明,与标准解码器相比,该方法有显著的增益。源 AR 模型的必要信息是根据符合 H.264/AVC 标准的比特流以时空自适应的方式进行估计的,也就是说,不需要向解码器发送任何附加信息来采用所提出的方法,从而保留了与标准语法和现有编码器的兼容性。