用于下一代视频编码的联合可分离和不可分离变换

Joint Separable and Non-Separable Transforms for Next-Generation Video Coding.

作者信息

Zhao Xin, Chen Jianle, Karczewicz Marta, Said Amir, Seregin Vadim

出版信息

IEEE Trans Image Process. 2018 Feb 5. doi: 10.1109/TIP.2018.2802202.

DOI:10.1109/TIP.2018.2802202

Abstract

Throughout the past few decades, the separable Discrete Cosine Transform (DCT), particularly the DCT type II, has been widely used in image and video compression. It is well known that, under first-order stationary Markov conditions, DCT is an efficient approximation of the optimal Karhunen-Loève transform. However, for natural image and video sources, the adaptivity of a single separable transform with fixed core is rather limited for the highly dynamic image statistics, e.g., textures and arbitrarily directed edges. It is also known that non-separable transforms can achieve better compression efficiency for images with directional texture patterns, yet they are computationally complex, especially when the transform size is large. In order to achieve higher transform coding gains with relatively low-complexity implementations, we propose a joint separable and non-separable transform. The proposed separable primary transform, named Enhanced Multiple Transform (EMT), applies multiple transform cores from a pre-defined subset of sinusoidal transforms, and the transform selection is signaled in a joint block level manner. Moreover, a Non-Separable Secondary Transform (NSST) method is proposed to operate in conjunction with EMT. Unlike the existing non-separable transform schemes which require excessive amounts of memory and computation, the proposed NSST efficiently improves coding gain with much lower complexity. Extensive experimental results show that the proposed methods, in a state-of-the-art video codec, such as HEVC, can provide significant coding gains (average 6.9% and 4.5% bitrate reductions for intra and random-access coding, respectively).

摘要

在过去几十年中，可分离离散余弦变换（DCT），特别是II型DCT，已广泛应用于图像和视频压缩。众所周知，在一阶平稳马尔可夫条件下，DCT是最优卡尔胡宁-洛伊夫变换的有效近似。然而，对于自然图像和视频源，具有固定核的单个可分离变换对于高度动态的图像统计信息（例如纹理和任意方向的边缘）的适应性相当有限。还已知非可分离变换对于具有方向性纹理图案的图像可以实现更好的压缩效率，但它们计算复杂，尤其是当变换尺寸较大时。为了以相对低复杂度的实现获得更高的变换编码增益，我们提出了一种联合可分离和非可分离变换。所提出的可分离主变换，称为增强多变换（EMT），应用来自正弦变换预定义子集的多个变换核，并且变换选择以联合块级方式进行信号传输。此外，还提出了一种非可分离二次变换（NSST）方法与EMT协同操作。与现有的需要大量内存和计算的非可分离变换方案不同，所提出的NSST以低得多的复杂度有效地提高了编码增益。大量实验结果表明，所提出的方法在诸如HEVC这样的最新视频编解码器中，可以提供显著的编码增益（帧内编码和随机访问编码分别平均降低比特率6.9%和4.5%）。

相似文献

Joint Separable and Non-Separable Transforms for Next-Generation Video Coding.用于下一代视频编码的联合可分离和不可分离变换

IEEE Trans Image Process. 2018 Feb 5. doi: 10.1109/TIP.2018.2802202.

Jointly optimized spatial prediction and block transform for video and image coding.联合优化的空间预测和块变换在视频和图像编码中的应用。

IEEE Trans Image Process. 2012 Apr;21(4):1874-84. doi: 10.1109/TIP.2011.2169976. Epub 2011 Sep 29.

Steerable Discrete Cosine Transform.可转向离散余弦变换。

IEEE Trans Image Process. 2017 Jan;26(1):303-314. doi: 10.1109/TIP.2016.2623489. Epub 2016 Oct 31.

Graph-based Transforms for Video Coding.用于视频编码的基于图的变换

IEEE Trans Image Process. 2020 Sep 30;PP. doi: 10.1109/TIP.2020.3026627.

HEVC-Based Perceptually Adaptive Video Coding Using a DCT-Based Local Distortion Detection Probability Model.基于离散余弦变换（DCT）的局部失真检测概率模型的基于高效视频编码（HEVC）的感知自适应视频编码

IEEE Trans Image Process. 2016 Jul;25(7):3343-3357. doi: 10.1109/TIP.2016.2568459. Epub 2016 May 13.

A CU-Level Rate and Distortion Estimation Scheme for RDO of Hardware-Friendly HEVC Encoders Using Low-Complexity Integer DCTs.一种使用低复杂度整数 DCT 的硬件友好型 HEVC 编码器 RDO 的 CU 级率失真估计方案。

IEEE Trans Image Process. 2016 Aug;25(8):3787-800. doi: 10.1109/TIP.2016.2579559. Epub 2016 Jun 9.

DCT/DST-based transform coding for intra prediction in image/video coding.基于 DCT/DST 的变换编码在图像/视频编码中的帧内预测。

IEEE Trans Image Process. 2013 Oct;22(10):3974-81. doi: 10.1109/TIP.2013.2265882. Epub 2013 Jun 3.

A sinusoidal family of unitary transforms.一组正弦单位变换。

IEEE Trans Pattern Anal Mach Intell. 1979 Apr;1(4):356-65. doi: 10.1109/tpami.1979.4766944.

Steerable-Discrete-Cosine-Transform (SDCT): Hardware Implementation and Performance Analysis.可控离散余弦变换（SDCT）：硬件实现与性能分析

Sensors (Basel). 2020 Mar 4;20(5):1405. doi: 10.3390/s20051405.

Block-based spatial prediction and transforms based on 2D Markov processes for image and video compression.基于二维马尔可夫过程的块式空间预测和变换在图像与视频压缩中的应用。

IEEE Trans Image Process. 2015 Apr;24(4):1247-60. doi: 10.1109/TIP.2015.2400818. Epub 2015 Feb 5.

引用本文的文献

Low-Complexity Multiple Transform Selection Combining Multi-Type Tree Partition Algorithm for Versatile Video Coding.低复杂度多变换选择结合多类型树分割算法的通用视频编码。

Sensors (Basel). 2022 Jul 25;22(15):5523. doi: 10.3390/s22155523.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于下一代视频编码的联合可分离和不可分离变换

Joint Separable and Non-Separable Transforms for Next-Generation Video Coding.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献