Punnappurath Abhijith, Brown Michael S
IEEE Trans Pattern Anal Mach Intell. 2020 Apr;42(4):1013-1019. doi: 10.1109/TPAMI.2019.2903062. Epub 2019 Mar 4.
Deep learning-based image compressors are actively being explored in an effort to supersede conventional image compression algorithms, such as JPEG. Conventional and deep learning-based compression algorithms focus on minimizing image fidelity errors in the nonlinear standard RGB (sRGB) color space. However, for many computer vision tasks, the sensor's linear raw-RGB image is desirable. Recent work has shown that the original raw-RGB image can be reconstructed using only small amounts of metadata embedded inside the JPEG image [1]. However, [1] relied on the conventional JPEG encoding that is unaware of the raw-RGB reconstruction task. In this paper, we examine the ability of deep image compressors to be "aware" of the additional objective of raw reconstruction. Towards this goal, we describe a general framework that enables deep networks targeting image compression to jointly consider both image fidelity errors and raw reconstruction errors. We describe this approach in two scenarios: (1) the network is trained from scratch using our proposed joint loss, and (2) a network originally trained only for sRGB fidelity loss is later fine-tuned to incorporate our raw reconstruction loss. When compared to sRGB fidelity-only compression, our combined loss leads to appreciable improvements in PSNR of the raw reconstruction with only minor impact on sRGB fidelity as measured by MS-SSIM.
基于深度学习的图像压缩器正在被积极探索,以取代传统的图像压缩算法,如JPEG。传统的和基于深度学习的压缩算法专注于在非线性标准RGB(sRGB)颜色空间中最小化图像保真度误差。然而,对于许多计算机视觉任务来说,传感器的线性原始RGB图像是理想的。最近的工作表明,仅使用嵌入在JPEG图像中的少量元数据就可以重建原始的原始RGB图像[1]。然而,[1]依赖于传统的JPEG编码,而这种编码并不知道原始RGB重建任务。在本文中,我们研究了深度图像压缩器“感知”原始重建这一额外目标的能力。为了实现这一目标,我们描述了一个通用框架,该框架使针对图像压缩的深度网络能够同时考虑图像保真度误差和原始重建误差。我们在两种情况下描述这种方法:(1)使用我们提出的联合损失从头开始训练网络,(2)最初仅针对sRGB保真度损失进行训练的网络后来进行微调,以纳入我们的原始重建损失。与仅针对sRGB保真度的压缩相比,我们的组合损失在原始重建的PSNR方面带来了显著的改进,而对MS-SSIM测量的sRGB保真度只有轻微影响。