ICAFormer：一种基于交互式通道注意力的图像去雾Transformer

ICAFormer: An Image Dehazing Transformer Based on Interactive Channel Attention.

作者信息

Chen Yanfei, Yue Tong, An Pei, Hong Hanyu, Liu Tao, Liu Yangkai, Zhou Yihui

机构信息

Hubei Key Laboratory of Optical Information and Pattern Recognition, School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China.

School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430072, China.

出版信息

Sensors (Basel). 2025 Jun 15;25(12):3750. doi: 10.3390/s25123750.

DOI:10.3390/s25123750

PMID:40573637

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12197104/

Abstract

Single image dehazing is a fundamental task in computer vision, aiming to recover a clear scene from a hazy input image. To address the limitations of traditional dehazing algorithms-particularly in global feature association and local detail preservation-this study proposes a novel Transformer-based dehazing model enhanced by an interactive channel attention mechanism. The proposed architecture adopts a U-shaped encoder-decoder framework, incorporating key components such as a feature extraction module and a feature fusion module based on interactive attention. Specifically, the interactive channel attention mechanism facilitates cross-layer feature interaction, enabling the dynamic fusion of global contextual information and local texture details. The network architecture leverages a multi-scale feature pyramid to extract image information across different dimensions, while an improved cross-channel attention weighting mechanism enhances feature representation in regions with varying haze densities. Extensive experiments conducted on both synthetic and real-world datasets-including the RESIDE benchmark-demonstrate the superior performance of the proposed method. Quantitatively, it achieves PSNR gains of 0.53 dB for indoor scenes and 1.64 dB for outdoor scenes, alongside SSIM improvements of 1.4% and 1.7%, respectively, compared with the second-best performing method. Qualitative assessments further confirm that the proposed model excels in restoring fine structural details in dense haze regions while maintaining high color fidelity. These results validate the effectiveness of the proposed approach in enhancing both perceptual quality and quantitative accuracy in image dehazing tasks.

摘要

单图像去雾是计算机视觉中的一项基础任务，旨在从模糊的输入图像中恢复清晰的场景。为了解决传统去雾算法的局限性，特别是在全局特征关联和局部细节保留方面的局限性，本研究提出了一种基于Transformer的新型去雾模型，并通过交互式通道注意力机制进行了增强。所提出的架构采用了U形编码器-解码器框架，纳入了诸如基于交互式注意力的特征提取模块和特征融合模块等关键组件。具体而言，交互式通道注意力机制促进了跨层特征交互，实现了全局上下文信息和局部纹理细节的动态融合。该网络架构利用多尺度特征金字塔在不同维度上提取图像信息，同时改进的跨通道注意力加权机制增强了在不同雾度密度区域的特征表示。在包括RESIDE基准在内的合成数据集和真实世界数据集上进行的大量实验证明了所提方法的卓越性能。在定量方面，与性能第二好的方法相比，它在室内场景中实现了0.53 dB的峰值信噪比提升以及在室外场景中实现了1.64 dB的提升，同时结构相似性指数（SSIM）分别提高了1.4%和1.7%。定性评估进一步证实，所提出的模型在恢复浓雾区域的精细结构细节方面表现出色，同时保持了高色彩保真度。这些结果验证了所提方法在提高图像去雾任务中的感知质量和定量准确性方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff0f/12197104/9be33645eafd/sensors-25-03750-g001.jpg

相似文献

ICAFormer: An Image Dehazing Transformer Based on Interactive Channel Attention.

Sensors (Basel). 2025 Jun 15;25(12):3750. doi: 10.3390/s25123750.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

SODU2-NET: a novel deep learning-based approach for salient object detection utilizing U-NET.

PeerJ Comput Sci. 2025 May 19;11:e2623. doi: 10.7717/peerj-cs.2623. eCollection 2025.

DGCFNet: Dual Global Context Fusion Network for remote sensing image semantic segmentation.

PeerJ Comput Sci. 2025 Mar 27;11:e2786. doi: 10.7717/peerj-cs.2786. eCollection 2025.

CDFAN: Cross-Domain Fusion Attention Network for Pansharpening.

Entropy (Basel). 2025 May 27;27(6):567. doi: 10.3390/e27060567.

Liver Semantic Segmentation Method Based on Multi-Channel Feature Extraction and Cross Fusion.

Bioengineering (Basel). 2025 Jun 11;12(6):636. doi: 10.3390/bioengineering12060636.

Int Ophthalmol. 2025 Jun 27;45(1):266. doi: 10.1007/s10792-025-03602-6.

MACCoM: A multiple attention and convolutional cross-mixer framework for detailed 2D biomedical image segmentation.

Comput Biol Med. 2024 Sep;179:108847. doi: 10.1016/j.compbiomed.2024.108847. Epub 2024 Jul 15.

Exploring the Potential of Electroencephalography Signal-Based Image Generation Using Diffusion Models: Integrative Framework Combining Mixed Methods and Multimodal Analysis.

JMIR Med Inform. 2025 Jun 25;13:e72027. doi: 10.2196/72027.

EMOST: A dual-branch hybrid network for medical image fusion via efficient model module and sparse transformer.

Comput Biol Med. 2024 Sep;179:108771. doi: 10.1016/j.compbiomed.2024.108771. Epub 2024 Jul 5.

本文引用的文献

Dual-Stream Complex-Valued Convolutional Network for Authentic Dehazed Image Quality Assessment.

IEEE Trans Image Process. 2024;33:466-478. doi: 10.1109/TIP.2023.3343029. Epub 2024 Jan 5.

U-Shape Transformer for Underwater Image Enhancement.

IEEE Trans Image Process. 2023;32:3066-3079. doi: 10.1109/TIP.2023.3276332. Epub 2023 May 30.

MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.

IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.

FSAD-Net: Feedback Spatial Attention Dehazing Network.

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7719-7733. doi: 10.1109/TNNLS.2022.3146004. Epub 2023 Oct 5.

Benchmarking Single Image Dehazing and Beyond.

IEEE Trans Image Process. 2018 Aug 30. doi: 10.1109/TIP.2018.2867951.

DehazeNet: An End-to-End System for Single Image Haze Removal.

IEEE Trans Image Process. 2016 Nov;25(11):5187-5198. doi: 10.1109/TIP.2016.2598681.

Referenceless Prediction of Perceptual Fog Density and Perceptual Image Defogging.

IEEE Trans Image Process. 2015 Nov;24(11):3888-901. doi: 10.1109/TIP.2015.2456502. Epub 2015 Jul 15.

Single Image Haze Removal Using Dark Channel Prior.

IEEE Trans Pattern Anal Mach Intell. 2011 Dec;33(12):2341-53. doi: 10.1109/TPAMI.2010.168. Epub 2010 Sep 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

ICAFormer：一种基于交互式通道注意力的图像去雾Transformer

ICAFormer: An Image Dehazing Transformer Based on Interactive Channel Attention.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献