Samsung Telecommunications America, Richardson, TX 75082, USA.
IEEE Trans Image Process. 2013 Oct;22(10):4061-71. doi: 10.1109/TIP.2013.2270087. Epub 2013 Jun 19.
In this paper, we present a transform scheme where a secondary transform is applied after the conventional DCT for intra as well as inter prediction residues. Our approach is applicable to any block-based video codec that employs transforms along the horizontal and vertical direction separably. The secondary transform is applied to the lower K ( K=4 or 8) frequency coefficients of the output of conventional DCT at block with dimensions 8 and larger. The proposed transform scheme has low complexity as it is applied only to the top-left portion of the DCT output, especially in the context of large blocks such as 32 × 32 where an alternate non-DCT 32 × 32 transform would have a prohibitive implementation hardware cost. The proposed technique is single-pass, and the choice of whether to use the secondary transform is solely based on the prediction direction for intra residue, and on transform unit location in the prediction unit for the inter residue. The scheme requires no additional signaling information or R-D search. Our simulation results show that the proposed transform scheme provides significant BD-rate improvement over the conventional DCT-based coding scheme. Finally, we also show how to implement the proposed secondary transforms with low latency in hardware.
在本文中,我们提出了一种变换方案,在常规 DCT 之后对帧内和帧间预测残差应用二次变换。我们的方法适用于任何采用水平和垂直方向分离变换的基于块的视频编解码器。二次变换应用于尺寸为 8 及以上的块的常规 DCT 输出的较低 K(K=4 或 8)频率系数。由于仅应用于 DCT 输出的左上角部分,因此该变换方案具有较低的复杂度,特别是在大的块(例如 32×32)的情况下,替代的非 DCT 32×32 变换将具有很高的硬件成本。所提出的技术是单遍的,是否使用二次变换的选择仅基于帧内残差的预测方向,以及帧间残差的预测单元中的变换单元位置。该方案不需要额外的信令信息或 R-D 搜索。我们的仿真结果表明,与基于常规 DCT 的编码方案相比,所提出的变换方案显著提高了 BD 率。最后,我们还展示了如何在硬件中实现具有低延迟的二次变换。