Liu Renhe, Liu Yu, Wang Han, Li Junxian, Hu Kai
School of Microelectronics, Tianjin University, Tianjin 300072, China.
Tianjin Medical University Cancer Institute & Hospital, Tianjin 300060, China.
Biomed Opt Express. 2025 May 27;16(6):2459-2481. doi: 10.1364/BOE.555458. eCollection 2025 Jun 1.
Medical image fusion integrates complementary information from multimodal medical images to provide comprehensive references for clinical decision-making, such as the diagnosis of Alzheimer's disease and the detection and segmentation of brain tumors. Although traditional and deep learning-based fusion methods have been extensively studied, they often fail to devise targeted strategies that fully utilize distinct regional or feature-specific information. This paper proposes SAFFusion, a saliency-aware frequency fusion network that integrates intensity and texture cues from multimodal medical images. We first introduce Mamba-UNet, a multiscale encoder-decoder architecture enhanced by the Mamba design, to improve global modeling in feature extraction and image reconstruction. By employing the contourlet transform in Mamba-UNet, we replace conventional pooling with multiscale representations and decompose spatial features into high- and low-frequency subbands. A dual-branch frequency feature fusion module then fuses cross-modality information according to the distinct characteristics of these frequency subbands. Furthermore, we apply latent low-rank representation (LatLRR) to assess image saliency and implement adaptive loss constraints to preserve information in salient and non-salient regions. Quantitative results on CT/MRI, SPECT/MRI, and PET/MRI fusion tasks show that SAFFusion outperforms state-of-the-art methods. Qualitative evaluations confirm that SAFFusion effectively merges prominent intensity features and rich textures from multiple sources.
医学图像融合整合来自多模态医学图像的互补信息,为临床决策提供全面参考,如阿尔茨海默病的诊断以及脑肿瘤的检测与分割。尽管传统的和基于深度学习的融合方法已得到广泛研究,但它们往往未能设计出充分利用不同区域或特定特征信息的针对性策略。本文提出了SAFFusion,一种显著感知频率融合网络,它整合了多模态医学图像的强度和纹理线索。我们首先引入Mamba-UNet,一种由Mamba设计增强的多尺度编码器-解码器架构,以改进特征提取和图像重建中的全局建模。通过在Mamba-UNet中采用轮廓波变换,我们用多尺度表示取代传统池化,并将空间特征分解为高频和低频子带。然后,一个双分支频率特征融合模块根据这些频率子带的不同特征融合跨模态信息。此外,我们应用潜在低秩表示(LatLRR)来评估图像显著性,并实施自适应损失约束以保留显著和非显著区域的信息。在CT/MRI、SPECT/MRI和PET/MRI融合任务上的定量结果表明,SAFFusion优于现有方法。定性评估证实,SAFFusion有效地融合了来自多个源的突出强度特征和丰富纹理。