• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于图像融合的任务引导、隐式搜索和元初始化深度模型。

A Task-Guided, Implicitly-Searched and Meta-Initialized Deep Model for Image Fusion.

作者信息

Liu Risheng, Liu Zhu, Liu Jinyuan, Fan Xin, Luo Zhongxuan

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Oct;46(10):6594-6609. doi: 10.1109/TPAMI.2024.3382308. Epub 2024 Sep 5.

DOI:10.1109/TPAMI.2024.3382308
PMID:38536690
Abstract

Image fusion plays a key role in a variety of multi-sensor-based vision systems, especially for enhancing visual quality and/or extracting aggregated features for perception. However, most existing methods just consider image fusion as an individual task, thus ignoring its underlying relationship with these downstream vision problems. Furthermore, designing proper fusion architectures often requires huge engineering labor. It also lacks mechanisms to improve the flexibility and generalization ability of current fusion approaches. To mitigate these issues, we establish a Task-guided, Implicit-searched and Meta-initialized (TIM) deep model to address the image fusion problem in a challenging real-world scenario. Specifically, we first propose a constrained strategy to incorporate information from downstream tasks to guide the unsupervised learning process of image fusion. Within this framework, we then design an implicit search scheme to automatically discover compact architectures for our fusion model with high efficiency. In addition, a pretext meta initialization technique is introduced to leverage divergence fusion data to support fast adaptation for different kinds of image fusion tasks. Qualitative and quantitative experimental results on different categories of image fusion problems and related downstream tasks (e.g., visual enhancement and semantic understanding) substantiate the flexibility and effectiveness of our TIM.

摘要

图像融合在各种基于多传感器的视觉系统中起着关键作用,特别是在提高视觉质量和/或提取用于感知的聚合特征方面。然而,大多数现有方法仅将图像融合视为一项单独的任务,从而忽略了它与这些下游视觉问题的潜在关系。此外,设计合适的融合架构通常需要大量的工程劳动。它还缺乏提高当前融合方法的灵活性和泛化能力的机制。为了缓解这些问题,我们建立了一个任务引导、隐式搜索和元初始化(TIM)深度模型,以解决具有挑战性的现实场景中的图像融合问题。具体来说,我们首先提出一种约束策略,将来自下游任务的信息纳入其中,以指导图像融合的无监督学习过程。在此框架内,我们接着设计一种隐式搜索方案,以高效地自动发现我们融合模型的紧凑架构。此外,引入了一种 pretext 元初始化技术,以利用差异融合数据来支持对不同类型图像融合任务的快速适应。在不同类别的图像融合问题和相关下游任务(例如视觉增强和语义理解)上的定性和定量实验结果证实了我们的 TIM 的灵活性和有效性。

相似文献

1
A Task-Guided, Implicitly-Searched and Meta-Initialized Deep Model for Image Fusion.一种用于图像融合的任务引导、隐式搜索和元初始化深度模型。
IEEE Trans Pattern Anal Mach Intell. 2024 Oct;46(10):6594-6609. doi: 10.1109/TPAMI.2024.3382308. Epub 2024 Sep 5.
2
Learning With Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision.基于嵌套场景建模和协同架构搜索的低光照视觉学习
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5953-5969. doi: 10.1109/TPAMI.2022.3212995. Epub 2023 Apr 3.
3
A Multi-Task Fusion Strategy-Based Decision-Making and Planning Method for Autonomous Driving Vehicles.一种基于多任务融合策略的自动驾驶车辆决策与规划方法
Sensors (Basel). 2023 Aug 8;23(16):7021. doi: 10.3390/s23167021.
4
A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion.基于语义感知的多分支多尺度神经网络用于多模态医学图像融合。
Sci Rep. 2024 Jul 30;14(1):17609. doi: 10.1038/s41598-024-68183-3.
5
Semantic-Aware Fusion Network Based on Super-Resolution.基于超分辨率的语义感知融合网络
Sensors (Basel). 2024 Jun 5;24(11):3665. doi: 10.3390/s24113665.
6
Reducing annotation burden in MR: A novel MR-contrast guided contrastive learning approach for image segmentation.减少磁共振成像中的标注负担:一种新的基于磁共振对比引导的对比学习方法用于图像分割。
Med Phys. 2024 Apr;51(4):2707-2720. doi: 10.1002/mp.16820. Epub 2023 Nov 13.
7
An Interactive Image Segmentation Method Based on Multi-Level Semantic Fusion.一种基于多级语义融合的交互式图像分割方法。
Sensors (Basel). 2023 Jul 14;23(14):6394. doi: 10.3390/s23146394.
8
Multi-View Saliency-Guided Clustering for Image Cosegmentation.用于图像协同分割的多视图显著度引导聚类
IEEE Trans Image Process. 2019 May 8. doi: 10.1109/TIP.2019.2913555.
9
Unsupervised Deep Image Fusion with Structure Tensor Representations.基于结构张量表示的无监督深度图像融合
IEEE Trans Image Process. 2020 Jan 17. doi: 10.1109/TIP.2020.2966075.
10
Unsupervised Test-Time Adaptation Learning for Effective Hyperspectral Image Super-Resolution With Unknown Degeneration.用于未知退化情况下有效高光谱图像超分辨率的无监督测试时自适应学习
IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):5008-5025. doi: 10.1109/TPAMI.2024.3361894. Epub 2024 Jun 5.

引用本文的文献

1
A novel multimodel medical image fusion framework with edge enhancement and cross-scale transformer.一种具有边缘增强和跨尺度变压器的新型多模态医学图像融合框架。
Sci Rep. 2025 Apr 4;15(1):11657. doi: 10.1038/s41598-025-93616-y.
2
Multi-Harmonic Nonlinear Ultrasonic Fusion with Deep Learning for Subtle Parameter Identification of Micro-Crack Groups.基于深度学习的多谐波非线性超声融合用于微裂纹群细微参数识别
Sensors (Basel). 2025 Feb 13;25(4):1152. doi: 10.3390/s25041152.
3
MGFusion: a multimodal large language model-guided information perception for infrared and visible image fusion.
MGFusion:一种用于红外与可见光图像融合的多模态大语言模型引导的信息感知方法
Front Neurorobot. 2024 Dec 23;18:1521603. doi: 10.3389/fnbot.2024.1521603. eCollection 2024.
4
SharDif: Sharing and Differential Learning for Image Fusion.SharDif:用于图像融合的共享与差异学习
Entropy (Basel). 2024 Jan 9;26(1):57. doi: 10.3390/e26010057.