Ma Ruijun, Zhang Yaoxuan, Zhang Bob, Fang Leyuan, Huang Dong, Qi Long
IEEE Trans Image Process. 2024;33:3707-3721. doi: 10.1109/TIP.2024.3404253. Epub 2024 Jun 13.
Recent advancements in deep learning techniques have pushed forward the frontiers of real photograph denoising. However, due to the inherent pooling operations in the spatial domain, current CNN-based denoisers are biased towards focusing on low-frequency representations, while discarding the high-frequency components. This will induce a problem for suboptimal visual quality as the image denoising tasks target completely eliminating the complex noises and recovering all fine-scale and salient information. In this work, we tackle this challenge from the frequency perspective and present a new solution pipeline, coined as frequency attention denoising network (FADNet). Our key idea is to build a learning-based frequency attention framework, where the feature correlations on a broader frequency spectrum can be fully characterized, thus enhancing the representational power of the network across multiple frequency channels. Based on this, we design a cascade of adaptive instance residual modules (AIRMs). In each AIRM, we first transform the spatial-domain features into the frequency space. Then, a learning-based frequency attention framework is devised to explore the feature inter-dependencies converted in the frequency domain. Besides this, we introduce an adaptive layer by leveraging the guidance of the estimated noise map and intermediate features to meet the challenges of model generalization in the noise discrepancy. The effectiveness of our method is demonstrated on several real camera benchmark datasets, with superior denoising performance, generalization capability, and efficiency versus the state-of-the-art.
深度学习技术的最新进展推动了真实照片去噪的前沿发展。然而,由于空间域中固有的池化操作,当前基于卷积神经网络(CNN)的去噪器倾向于专注于低频表示,而丢弃高频分量。这将导致视觉质量次优的问题,因为图像去噪任务的目标是完全消除复杂噪声并恢复所有精细尺度和显著信息。在这项工作中,我们从频率角度应对这一挑战,并提出了一种新的解决方案管道,称为频率注意力去噪网络(FADNet)。我们的关键思想是构建一个基于学习的频率注意力框架,在该框架中,可以充分表征更广泛频谱上的特征相关性,从而增强网络在多个频率通道上的表征能力。基于此,我们设计了一系列自适应实例残差模块(AIRM)。在每个AIRM中,我们首先将空间域特征转换到频率空间。然后,设计一个基于学习的频率注意力框架来探索在频域中转换的特征相互依赖性。除此之外,我们通过利用估计的噪声图和中间特征的指导引入一个自适应层,以应对噪声差异中模型泛化的挑战。我们的方法在几个真实相机基准数据集上的有效性得到了证明,与现有技术相比,具有卓越的去噪性能、泛化能力和效率。