Zhang Ling, Yang Xuefei, Wan Zhenlong, Cao Dingxin, Lin Yingcheng
The School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.
National Information Center of GACC, Beijing 100005, China.
Sensors (Basel). 2022 Nov 4;22(21):8487. doi: 10.3390/s22218487.
Taking advantage of the functional complementarity between infrared and visible light sensors imaging, pixel-level real-time image fusion based on infrared and visible light images of different resolutions is a promising strategy for visual enhancement, which has demonstrated tremendous potential for autonomous driving, military reconnaissance, video surveillance, etc. Great progress has been made in this field in recent years, but the fusion speed and quality of visual enhancement are still not satisfactory. Herein, we propose a multi-scale FPGA-based image fusion technology with substantially enhanced visual enhancement capability and fusion speed. Specifically, the source images are first decomposed into three distinct layers using guided filter and saliency detection, which are the detail layer, saliency layer and background layer. Fusion weight map of the saliency layer is subsequently constructed using attention mechanism. Afterwards weight fusion strategy is used for saliency layer fusion and detail layer fusion, while weight average fusion strategy is used for the background layer fusion, followed by the incorporation of image enhancement technology to improve the fused image contrast. Finally, high-level synthesis tool is used to design the hardware circuit. The method in the present study is thoroughly tested on XCZU15EG board, which could not only effectively improve the image enhancement capability in glare and smoke environments, but also achieve fast real-time image fusion with 55FPS for infrared and visible images with a resolution of 640 × 470.
利用红外与可见光传感器成像之间的功能互补性,基于不同分辨率的红外与可见光图像进行像素级实时图像融合是一种很有前景的视觉增强策略,已在自动驾驶、军事侦察、视频监控等领域展现出巨大潜力。近年来该领域取得了很大进展,但视觉增强的融合速度和质量仍不尽人意。在此,我们提出一种基于多尺度现场可编程门阵列(FPGA)的图像融合技术,其视觉增强能力和融合速度得到显著提升。具体而言,首先使用引导滤波器和显著性检测将源图像分解为三个不同的层,即细节层、显著性层和背景层。随后利用注意力机制构建显著性层的融合权重图。之后采用加权融合策略进行显著性层融合和细节层融合,而背景层融合则采用加权平均融合策略,接着引入图像增强技术以提高融合图像的对比度。最后,使用高级综合工具设计硬件电路。本研究中的方法在XCZU15EG板上进行了全面测试,该方法不仅能有效提高在眩光和烟雾环境下的图像增强能力,还能实现对分辨率为640×470的红外与可见光图像以55帧每秒的速度进行快速实时图像融合。