Ouyang Mingyu, Chen Zhenzhong
IEEE Trans Image Process. 2024;33:3385-3398. doi: 10.1109/TIP.2024.3403054. Epub 2024 May 31.
JPEG compression adopts the quantization of Discrete Cosine Transform (DCT) coefficients for effective bit-rate reduction, whilst the quantization could lead to a significant loss of important image details. Recovering compressed JPEG images in the frequency domain has recently garnered increasing interest, complementing the multitude of restoration techniques established in the pixel domain. However, existing DCT domain methods typically suffer from limited effectiveness in handling a wide range of compression quality factors or fall short in recovering sparse quantized coefficients and the components across different colorspaces. To address these challenges, we propose a DCT domain spatial-frequential Transformer, namely DCTransformer, for JPEG quantized coefficient recovery. Specifically, a dual-branch architecture is designed to capture both spatial and frequential correlations within the collocated DCT coefficients. Moreover, we incorporate the operation of quantization matrix embedding, which effectively allows our single model to handle a wide range of quality factors, and a luminance-chrominance alignment head that produces a unified feature map to align different-sized luminance and chrominance components. Our proposed DCTransformer outperforms the current state-of-the-art JPEG artifact removal techniques, as demonstrated by our extensive experiments.
JPEG压缩采用离散余弦变换(DCT)系数量化以有效降低比特率,而这种量化可能会导致重要图像细节的大量损失。最近,在频域中恢复压缩的JPEG图像越来越受到关注,这对像素域中已有的众多恢复技术起到了补充作用。然而,现有的DCT域方法在处理广泛的压缩质量因子时通常效果有限,或者在恢复稀疏量化系数以及不同颜色空间的分量方面存在不足。为应对这些挑战,我们提出了一种用于JPEG量化系数恢复的DCT域空间频率Transformer,即DCTransformer。具体而言,设计了一种双分支架构来捕捉并置DCT系数内的空间和频率相关性。此外,我们纳入了量化矩阵嵌入操作,这有效地使我们的单个模型能够处理广泛的质量因子,以及一个亮度 - 色度对齐头,该头生成统一的特征图以对齐不同大小的亮度和色度分量。我们提出的DCTransformer优于当前最先进的JPEG伪像去除技术,大量实验证明了这一点。