CIRF：基于深度学习的多模态图像融合的耦合图像重建与融合策略

CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion.

作者信息

Zheng Junze, Xiao Junyan, Wang Yaowei, Zhang Xuming

机构信息

Department of Biomedical Engineering, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.

出版信息

Sensors (Basel). 2024 May 30;24(11):3545. doi: 10.3390/s24113545.

DOI:10.3390/s24113545

PMID:38894335

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11175309/

Abstract

Multi-modal medical image fusion (MMIF) is crucial for disease diagnosis and treatment because the images reconstructed from signals collected by different sensors can provide complementary information. In recent years, deep learning (DL) based methods have been widely used in MMIF. However, these methods often adopt a serial fusion strategy without feature decomposition, causing error accumulation and confusion of characteristics across different scales. To address these issues, we have proposed the Coupled Image Reconstruction and Fusion (CIRF) strategy. Our method parallels the image fusion and reconstruction branches which are linked by a common encoder. Firstly, CIRF uses the lightweight encoder to extract base and detail features, respectively, through the Vision Transformer (ViT) and the Convolutional Neural Network (CNN) branches, where the two branches interact to supplement information. Then, two types of features are fused separately via different blocks and finally decoded into fusion results. In the loss function, both the supervised loss from the reconstruction branch and the unsupervised loss from the fusion branch are included. As a whole, CIRF increases its expressivity by adding multi-task learning and feature decomposition. Additionally, we have also explored the impact of image masking on the network's feature extraction ability and validated the generalization capability of the model. Through experiments on three datasets, it has been demonstrated both subjectively and objectively, that the images fused by CIRF exhibit appropriate brightness and smooth edge transition with more competitive evaluation metrics than those fused by several other traditional and DL-based methods.

摘要

多模态医学图像融合（MMIF）对于疾病诊断和治疗至关重要，因为从不同传感器收集的信号重建的图像可以提供互补信息。近年来，基于深度学习（DL）的方法已广泛应用于MMIF。然而，这些方法通常采用无特征分解的串行融合策略，导致误差累积和不同尺度特征的混淆。为了解决这些问题，我们提出了耦合图像重建与融合（CIRF）策略。我们的方法将图像融合和重建分支并行，通过一个公共编码器连接。首先，CIRF使用轻量级编码器分别通过视觉Transformer（ViT）和卷积神经网络（CNN）分支提取基础特征和细节特征，两个分支相互作用以补充信息。然后，两种类型的特征分别通过不同的模块进行融合，最后解码为融合结果。在损失函数中，既包括来自重建分支的监督损失，也包括来自融合分支的无监督损失。总体而言，CIRF通过添加多任务学习和特征分解提高了其表达能力。此外，我们还探讨了图像掩码对网络特征提取能力的影响，并验证了模型的泛化能力。通过在三个数据集上的实验，主观和客观地证明了，与其他几种传统方法和基于DL的方法融合的图像相比，CIRF融合的图像具有适当的亮度和平滑的边缘过渡，评估指标更具竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7f4/11175309/ad56a3bccdf3/sensors-24-03545-g001.jpg

相似文献

CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion.CIRF：基于深度学习的多模态图像融合的耦合图像重建与融合策略

Sensors (Basel). 2024 May 30;24(11):3545. doi: 10.3390/s24113545.

CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion.CTFusion：基于卷积神经网络-Transformer的红外与可见光图像融合自监督学习

Math Biosci Eng. 2024 Jul 30;21(7):6710-6730. doi: 10.3934/mbe.2024294.

Hahn-PCNN-CNN: an end-to-end multi-modal brain medical image fusion framework useful for clinical diagnosis. Hahn-PCNN-CNN：一种端到端的多模态脑医学影像融合框架，有助于临床诊断。

BMC Med Imaging. 2021 Jul 14;21(1):111. doi: 10.1186/s12880-021-00642-z.

LRFNet: A real-time medical image fusion method guided by detail information.LRFNet：一种基于细节信息引导的实时医学图像融合方法。

Comput Biol Med. 2024 May;173:108381. doi: 10.1016/j.compbiomed.2024.108381. Epub 2024 Mar 27.

MDC-RHT: Multi-Modal Medical Image Fusion via Multi-Dimensional Dynamic Convolution and Residual Hybrid Transformer.MDC-RHT：基于多维动态卷积和残差混合变压器的多模态医学图像融合

Sensors (Basel). 2024 Jun 21;24(13):4056. doi: 10.3390/s24134056.

Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation.用于3D医学图像分割的自动多模态Transformer网络（AMTNet）。

Phys Med Biol. 2023 Jan 9;68(2). doi: 10.1088/1361-6560/aca74c.

MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.MS-TCNet：一种基于多尺度特征学习的有效的 Transformer-CNN 组合网络，用于 3D 医学图像分割。

Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.

A Two-To-One Deep Learning General Framework for Image Fusion.一种用于图像融合的二比一深度学习通用框架。

Front Bioeng Biotechnol. 2022 Jul 14;10:923364. doi: 10.3389/fbioe.2022.923364. eCollection 2022.

BTMF-GAN: A multi-modal MRI fusion generative adversarial network for brain tumors.BTMF-GAN：一种用于脑肿瘤的多模态 MRI 融合生成对抗网络。

Comput Biol Med. 2023 May;157:106769. doi: 10.1016/j.compbiomed.2023.106769. Epub 2023 Mar 9.

STEDNet: Swin transformer-based encoder-decoder network for noise reduction in low-dose CT.STEDNet：基于 Swin Transformer 的编解码网络，用于降低低剂量 CT 中的噪声。

Med Phys. 2023 Jul;50(7):4443-4458. doi: 10.1002/mp.16249. Epub 2023 Feb 9.

本文引用的文献

F-DARTS: Foveated Differentiable Architecture Search Based Multimodal Medical Image Fusion.F-DARTS：基于焦点可区分的可微分架构搜索的多模态医学图像融合。

IEEE Trans Med Imaging. 2023 Nov;42(11):3348-3361. doi: 10.1109/TMI.2023.3283517. Epub 2023 Oct 27.

An Improved Hybrid Network With a Transformer Module for Medical Image Fusion.一种改进的混合网络，具有用于医学图像融合的 Transformer 模块。

IEEE J Biomed Health Inform. 2023 Jul;27(7):3489-3500. doi: 10.1109/JBHI.2023.3264819. Epub 2023 Jun 30.

MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer.MATR：基于多尺度自适应变换的多模态医学图像融合。

IEEE Trans Image Process. 2022;31:5134-5149. doi: 10.1109/TIP.2022.3193288. Epub 2022 Aug 2.

Deep Learning-Based Multi-Focus Image Fusion: A Survey and a Comparative Study.基于深度学习的多聚焦图像融合：综述与比较研究

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4819-4838. doi: 10.1109/TPAMI.2021.3078906. Epub 2022 Aug 4.

U2Fusion: A Unified Unsupervised Image Fusion Network.U2Fusion：一种统一的无监督图像融合网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):502-518. doi: 10.1109/TPAMI.2020.3012548. Epub 2021 Dec 8.

Multi-modal medical image fusion by Laplacian pyramid and adaptive sparse representation.基于拉普拉斯金字塔和自适应稀疏表示的多模态医学图像融合

Comput Biol Med. 2020 Aug;123:103823. doi: 10.1016/j.compbiomed.2020.103823. Epub 2020 Jun 20.

An adaptive two-scale biomedical image fusion method with statistical comparisons.一种具有统计比较的自适应双尺度生物医学图像融合方法。

Comput Methods Programs Biomed. 2020 Nov;196:105603. doi: 10.1016/j.cmpb.2020.105603. Epub 2020 Jun 12.

A Review of Multimodal Medical Image Fusion Techniques.多模态医学图像融合技术综述。

Comput Math Methods Med. 2020 Apr 23;2020:8279342. doi: 10.1155/2020/8279342. eCollection 2020.

DenseFuse: A Fusion Approach to Infrared and Visible Images.密集融合：一种红外与可见光图像的融合方法。

IEEE Trans Image Process. 2018 Dec 18. doi: 10.1109/TIP.2018.2887342.

Fast parallel image registration on CPU and GPU for diagnostic classification of Alzheimer's disease.基于 CPU 和 GPU 的快速并行图像配准技术在阿尔茨海默病诊断分类中的应用

Front Neuroinform. 2014 Jan 16;7:50. doi: 10.3389/fninf.2013.00050. eCollection 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

CIRF：基于深度学习的多模态图像融合的耦合图像重建与融合策略

CIRF: Coupled Image Reconstruction and Fusion Strategy for Deep Learning Based Multi-Modal Image Fusion.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献