Wang Wenqing, Li Lingzhou, Yang Yifei, Liu Han, Guo Runyuan
School of Automation and Information Engineering, Xi'an University of Technology, Xi'an 710048, China.
Shaanxi Key Laboratory of Complex System Control and Intelligent Information Processing, Xi'an University of Technology, Xi'an 710048, China.
Sensors (Basel). 2024 Dec 3;24(23):7729. doi: 10.3390/s24237729.
The purpose of infrared and visible image fusion is to combine the advantages of both and generate a fused image that contains target information and has rich details and contrast. However, existing fusion algorithms often overlook the importance of incorporating both local and global feature extraction, leading to missing key information in the fused image. To address these challenges, this paper proposes a dual-branch fusion network combining convolutional neural network (CNN) and Transformer, which enhances the feature extraction capability and motivates the fused image to contain more information. Firstly, a local feature extraction module with CNN as the core is constructed. Specifically, the residual gradient module is used to enhance the ability of the network to extract texture information. Also, jump links and coordinate attention are used in order to relate shallow features to deeper ones. In addition, a global feature extraction module based on Transformer is constructed. Through the powerful ability of Transformer, the global context information of the image can be captured and the global features are fully extracted. The effectiveness of the proposed method in this paper is verified on different experimental datasets, and it is better than most of the current advanced fusion algorithms.
红外与可见光图像融合的目的是结合两者的优势,生成一幅包含目标信息且具有丰富细节和对比度的融合图像。然而,现有的融合算法常常忽视结合局部和全局特征提取的重要性,导致融合图像中关键信息缺失。为应对这些挑战,本文提出了一种结合卷积神经网络(CNN)和Transformer的双分支融合网络,它增强了特征提取能力,促使融合图像包含更多信息。首先,构建了一个以CNN为核心的局部特征提取模块。具体而言,使用残差梯度模块来增强网络提取纹理信息的能力。此外,还使用跳跃连接和坐标注意力,以便将浅层特征与深层特征联系起来。另外,构建了一个基于Transformer的全局特征提取模块。通过Transformer的强大能力,可以捕捉图像的全局上下文信息并充分提取全局特征。本文所提方法的有效性在不同实验数据集上得到了验证,且优于当前大多数先进的融合算法。