Khan Asif Hussain, Micheloni Christian, Martinel Niki
IEEE Trans Image Process. 2024;33:4556-4567. doi: 10.1109/TIP.2024.3442613. Epub 2024 Aug 23.
Blind image super-resolution (SR) aims to recover a high-resolution (HR) image from its low-resolution (LR) counterpart under the assumption of unknown degradations. Many existing blind SR methods rely on supervising ground-truth kernels referred to as explicit degradation estimators. However, it is very challenging to obtain the ground-truths for different degradations kernels. Moreover, most of these methods rely on heavy backbone networks, which demand extensive computational resources. Implicit degradation estimators do not require the availability of ground truth kernels, but they see a significant performance gap with the explicit degradation estimators due to such missing information. We present a novel approach that significantly narrows such a gap by means of a lightweight architecture that implicitly learns the degradation kernel with the help of a novel loss component. The kernel is exploited by a learnable Wiener filter that performs efficient deconvolution in the Fourier domain by deriving a closed-form solution. Inspired by prompt-based learning, we also propose a novel degradation-conditioned prompt layer that exploits the estimated kernel to drive the focus on the discriminative contextual information that guides the reconstruction process in recovering the latent HR image. Extensive experiments under different degradation settings demonstrate that our model, named PL-IDENet, yields PSNR and SSIM improvements of more than 0.4dB and 1.3%, and 1.4dB and 4.8% to the best implicit and explicit blind-SR method, respectively. These results are achieved while maintaining a substantially lower number of parameters/FLOPs (i.e., 25% and 68% fewer parameters than best implicit and explicit methods, respectively).
盲图像超分辨率(SR)旨在在未知退化的假设下,从低分辨率(LR)图像恢复高分辨率(HR)图像。许多现有的盲SR方法依赖于监督被称为显式退化估计器的真实内核。然而,获得不同退化内核的真实情况非常具有挑战性。此外,这些方法中的大多数依赖于重型骨干网络,这需要大量的计算资源。隐式退化估计器不需要真实内核的可用性,但由于缺少此类信息,它们与显式退化估计器相比存在显著的性能差距。我们提出了一种新颖的方法,通过一种轻量级架构显著缩小这种差距,该架构借助一种新颖的损失分量隐式地学习退化内核。内核由一个可学习的维纳滤波器利用,该滤波器通过推导闭式解在傅里叶域中执行高效的反卷积。受基于提示的学习启发,我们还提出了一种新颖的退化条件提示层,该层利用估计的内核来驱动对判别性上下文信息的关注,该信息在恢复潜在HR图像时指导重建过程。在不同退化设置下的大量实验表明,我们名为PL-IDENet的模型分别比最佳隐式和显式盲SR方法的PSNR和SSIM提高了超过0.4dB和1.3%,以及1.4dB和4.8%。这些结果是在保持参数/浮点运算次数大幅减少的情况下实现的(即分别比最佳隐式和显式方法少25%和68%的参数)。