Faculty of Information Engineering, Xinjiang Institute of Technology, Aksu, China.
PLoS One. 2024 Nov 6;19(11):e0307236. doi: 10.1371/journal.pone.0307236. eCollection 2024.
The objective of image fusion is to integrate complementary features from source images to better cater to the needs of human and machine vision. However, existing image fusion algorithms predominantly focus on enhancing the visual appeal of the fused image for human perception, often neglecting their impact on subsequent high-level visual tasks, particularly the processing of semantic information. Moreover, these fusion methods that incorporate downstream tasks tend to be overly complex and computationally intensive, which is not conducive to practical applications. To address these issues, a lightweight infrared and visible light image fusion method known as SIFusion, which is based on semantic injection, is proposed in this paper. This method employs a semantic-aware branch to extract semantic feature information, and then integrates these features into the fused features through a Semantic Injection Module (SIM) to meet the semantic requirements of high-level visual tasks. Furthermore, to simplify the complexity of the fusion network, this method introduces an Edge Convolution Module (ECB) based on structural reparameterization technology to enhance the representational capacity of the encoder and decoder. Extensive experimental comparisons demonstrate that the proposed method performs excellently in terms of visual appeal and advanced semantics, providing satisfactory fusion results for subsequent high-level visual tasks even in challenging scenarios.
图像融合的目标是整合源图像的互补特征,以更好地满足人类和机器视觉的需求。然而,现有的图像融合算法主要侧重于增强融合图像的视觉吸引力,以满足人类感知的需求,往往忽略了它们对后续高级视觉任务的影响,特别是语义信息的处理。此外,这些融合方法结合下游任务往往过于复杂和计算密集,不利于实际应用。为了解决这些问题,本文提出了一种基于语义注入的轻量级红外和可见光图像融合方法,称为 SIFusion。该方法采用语义感知分支提取语义特征信息,然后通过语义注入模块(SIM)将这些特征集成到融合特征中,以满足高级视觉任务的语义要求。此外,为了简化融合网络的复杂性,该方法引入了一种基于结构重参数化技术的边缘卷积模块(ECB),以增强编码器和解码器的表示能力。大量的实验比较表明,所提出的方法在视觉吸引力和高级语义方面表现出色,即使在具有挑战性的场景下,也能为后续的高级视觉任务提供令人满意的融合结果。