Systems Laboratory, Stanford University, Stanford, CA 94305, USA.
IEEE Trans Image Process. 2010 Jul;19(7):1740-55. doi: 10.1109/TIP.2010.2044964. Epub 2010 Mar 8.
The direction-adaptive partitioned block transform (DA-PBT) is proposed to exploit the directional features in color images to improve coding performance. Depending on the directionality in an image block, the transform either selects one of the eight directional modes or falls back to the nondirectional mode equivalent to the conventional 2-D DCT. The selection of a directional mode determines the transform direction that provides directional basis functions, the block partitioning that spatially confines the high-frequency energy, the scanning order that arranges the transform coefficients into a 1-D sequence for efficient entropy coding, and the quantization matrix optimized for visual quality. The DA-PBT can be incorporated into image coding using a rate-distortion optimized framework for direction selection, and can therefore be viewed as a generalization of variable blocksize transforms with the inclusion of directional transforms and nonrectangular partitions. As a block transform, it can naturally be combined with block-based intra or inter prediction to exploit the directionality remaining in the residual. Experimental results show that the proposed DA-PBT outperforms the 2-D DCT by more than 2 dB for test images with directional features. It also greatly reduces the ringing and checkerboard artifacts typically observed around directional features in images. The DA-PBT also consistently outperforms a previously proposed directional DCT. When combined with directional prediction, gains are less than additive, as similar signal properties are exploited by the prediction and the transform. For hybrid video coding, significant gains are shown for intra coding, but not for encoding the residual after accurate motion-compensated prediction.
方向自适应分区块变换(DA-PBT)被提出以利用彩色图像中的方向特征来提高编码性能。根据图像块的方向,变换要么选择八个方向模式之一,要么退回到等效于传统二维 DCT 的非方向模式。方向模式的选择决定了提供方向基函数的变换方向、空间限制高频能量的块分区、将变换系数排列成一维序列以进行高效熵编码的扫描顺序以及针对视觉质量优化的量化矩阵。DA-PBT 可以通过用于方向选择的率失真优化框架被合并到图像编码中,因此可以被视为具有方向变换和非矩形分区的可变块大小变换的推广。作为一种块变换,它可以与基于块的帧内或帧间预测自然地结合,以利用残差中剩余的方向特性。实验结果表明,对于具有方向特征的测试图像,所提出的 DA-PBT 比二维 DCT 好 2dB 以上。它还大大减少了图像中方向特征周围通常观察到的振铃和棋盘伪影。DA-PBT 也始终优于先前提出的方向 DCT。当与方向预测结合使用时,增益不是加性的,因为预测和变换利用了相似的信号特性。对于混合视频编码,对于帧内编码,增益是显著的,但对于经过精确运动补偿预测后的残差编码则不是。