School of Computer Engineering, Nanyang Technological University, 639798 Singapore.
IEEE Trans Image Process. 2012 Dec;21(12):4770-81. doi: 10.1109/TIP.2012.2206045. Epub 2012 Jun 26.
Discrete cosine transform (DCT) is the orthogonal transform that is most commonly used in image and video compression. The motion-compensation residual (MC-residual) is also compressed with the DCT in most video codecs. However, the MC-residual has different characteristics from a nature image. In this paper, we develop a new orthogonal transform-rotated orthogonal transform (ROT) that can perform better on the MC-residual than the DCT for coding purposes. We derive the proposed ROT based on orthogonal-constrained L1-Norm minimization problem for its sparse property. Using the DCT matrix as the starting point, a better orthogonal transform matrix is derived. In addition, by exploring inter-frame dependency and local motion activity, transmission of substantial side information is avoided. The experiment results confirm that, with small computation overhead, the ROT is adaptive to change of local spatial characteristic of MC-residual frame and provides higher compression efficiency for the MC-residual than DCT, especially for high- and complex-motion videos.
离散余弦变换 (DCT) 是图像和视频压缩中最常用的正交变换。在大多数视频编解码器中,运动补偿残差 (MC-residual) 也使用 DCT 进行压缩。然而,MC-residual 与自然图像具有不同的特征。在本文中,我们开发了一种新的正交变换——旋转正交变换 (ROT),它在编码目的上比 DCT 更适合 MC-residual。我们基于正交约束 L1-Norm 最小化问题推导出了所提出的 ROT,因为它具有稀疏性。我们使用 DCT 矩阵作为起点,推导出了一个更好的正交变换矩阵。此外,通过探索帧间相关性和局部运动活动,可以避免大量的侧信息传输。实验结果证实,在计算开销小的情况下,ROT 自适应于 MC-residual 帧局部空间特征的变化,并且比 DCT 提供了更高的 MC-residual 压缩效率,尤其是对于高运动和复杂运动的视频。