Cui Yuning, Liu Mingyu, Ren Wenqi, Knoll Alois
IEEE Trans Neural Netw Learn Syst. 2025 May 1;PP. doi: 10.1109/TNNLS.2025.3561924.
Image restoration aims to recover clean images from degraded counterparts. While Transformer-based approaches have achieved significant advancements in this field, they are limited by high complexity and their inability to capture omni-range dependencies, hindering their overall performance. In this work, we develop Modumer for effective and efficient image restoration by revisiting the Transformer block and modulation design, which processes input through a convolutional block and projection layers and fuses features via elementwise multiplication. Specifically, within each unit of Modumer, we integrate the cascaded modulation design with the downsampled Transformer block to build the attention layers, enabling omni-kernel modulation and mapping inputs into high-dimensional feature spaces. Moreover, we introduce a bioinspired parameter-sharing mechanism to attention layers, which not only enhances efficiency but also improves performance. In addition, a dual-domain feed-forward network (DFFN) strengthens the representational power of the model. Extensive experimental evaluations demonstrate that the proposed Modumer achieves state-of-the-art performance across ten datasets in five single-degradation image restoration tasks, including image motion deblurring, deraining, dehazing, desnowing, and low-light enhancement. Moreover, the model exhibits strong generalization capabilities in all-in-one image restoration tasks. Additionally, it demonstrates competitive performance in composite-degradation image restoration.
图像恢复旨在从退化的图像中恢复出清晰的图像。虽然基于Transformer的方法在该领域取得了显著进展,但它们受到高复杂度以及无法捕捉全范围依赖关系的限制,从而阻碍了其整体性能。在这项工作中,我们通过重新审视Transformer模块和调制设计来开发Modumer,以实现高效的图像恢复,它通过卷积模块和投影层处理输入,并通过逐元素乘法融合特征。具体而言,在Modumer的每个单元中,我们将级联调制设计与下采样的Transformer模块集成以构建注意力层,实现全内核调制并将输入映射到高维特征空间。此外,我们为注意力层引入了一种受生物启发的参数共享机制,这不仅提高了效率,还提升了性能。此外,双域前馈网络(DFFN)增强了模型的表征能力。广泛的实验评估表明,所提出的Modumer在五个单退化图像恢复任务的十个数据集上实现了当前最优的性能,包括图像运动去模糊、去雨、去雾、除雪和低光增强。此外,该模型在一体化图像恢复任务中展现出强大的泛化能力。此外,它在复合退化图像恢复中表现出具有竞争力的性能。