• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于可见光和红外无人机目标图像配准的通用跨模态配准框架。

General cross-modality registration framework for visible and infrared UAV target image registration.

作者信息

Luo Yu, Cha Hao, Zuo Lei, Cheng Peng, Zhao Qing

机构信息

College of Electronic Engineering, Naval University of Engineering, Wuhan, 4300000, China.

出版信息

Sci Rep. 2023 Aug 9;13(1):12941. doi: 10.1038/s41598-023-39863-3.

DOI:10.1038/s41598-023-39863-3
PMID:37558713
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10412594/
Abstract

In all-day-all-weather tasks, well-aligned multi-modality images pairs can provide extensive complementary information for image-guided UAV target detection. However, multi-modality images in real scenarios are often misaligned, and images registration is extremely difficult due to spatial deformation and the difficulty narrowing cross-modality discrepancy. To better overcome the obstacle, in this paper, we construct a General Cross-Modality Registration (GCMR) Framework, which explores generation registration pattern to simplify the cross-modality image registration into a easier mono-modality image registration with an Image Cross-Modality Translation Network (ICMTN) module and a Multi-level Residual Dense Registration Network (MRDRN). Specifically, ICMTN module is used to generate a pseudo infrared image taking a visible image as input and correct the distortion of structural information during the translation of image modalities. Benefiting from the favorable geometry correct ability of the ICMTN, we further employs MRDRN module which can fully extract and exploit the mutual information of misaligned images to better registered Visible and Infrared image in a mono-modality setting. We evaluate five variants of our approach on the public Anti-UAV datasets. The extensive experimental results demonstrate that the proposed architecture achieves state-of-the-art performance.

摘要

在全天候任务中,对齐良好的多模态图像对可为图像引导的无人机目标检测提供广泛的补充信息。然而,实际场景中的多模态图像往往未对齐,并且由于空间变形和缩小跨模态差异的困难,图像配准极具挑战性。为了更好地克服这一障碍,在本文中,我们构建了一个通用跨模态配准(GCMR)框架,该框架探索生成配准模式,通过图像跨模态转换网络(ICMTN)模块和多级残差密集配准网络(MRDRN)将跨模态图像配准简化为更简单的单模态图像配准。具体而言,ICMTN模块用于以可见光图像为输入生成伪红外图像,并在图像模态转换过程中校正结构信息的失真。受益于ICMTN良好的几何校正能力,我们进一步采用MRDRN模块,该模块可以充分提取和利用未对齐图像的互信息,以便在单模态设置中更好地配准可见光和红外图像。我们在公开的反无人机数据集上评估了我们方法的五个变体。大量实验结果表明,所提出的架构实现了最优性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/e48b8d1a4804/41598_2023_39863_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/bc6c0fdb2686/41598_2023_39863_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/08ff59e1592c/41598_2023_39863_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/4b4c181a2cce/41598_2023_39863_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/2ae5bc8f2909/41598_2023_39863_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/3e9e87dcf0ed/41598_2023_39863_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/1dc9b2fd027b/41598_2023_39863_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/9eff2ff36a03/41598_2023_39863_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/e48b8d1a4804/41598_2023_39863_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/bc6c0fdb2686/41598_2023_39863_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/08ff59e1592c/41598_2023_39863_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/4b4c181a2cce/41598_2023_39863_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/2ae5bc8f2909/41598_2023_39863_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/3e9e87dcf0ed/41598_2023_39863_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/1dc9b2fd027b/41598_2023_39863_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/9eff2ff36a03/41598_2023_39863_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/445f/10412594/e48b8d1a4804/41598_2023_39863_Fig8_HTML.jpg

相似文献

1
General cross-modality registration framework for visible and infrared UAV target image registration.用于可见光和红外无人机目标图像配准的通用跨模态配准框架。
Sci Rep. 2023 Aug 9;13(1):12941. doi: 10.1038/s41598-023-39863-3.
2
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
3
Translation, Association and Augmentation: Learning Cross-Modality Re-Identification From Single-Modality Annotation.翻译、关联与增强:从单模态标注中学习跨模态重新识别
IEEE Trans Image Process. 2023;32:5099-5113. doi: 10.1109/TIP.2023.3310338. Epub 2023 Sep 12.
4
Reliable multi-modal medical image-to-image translation independent of pixel-wise aligned data.可靠的多模态医学图像到图像的转换,不依赖于像素对齐的数据。
Med Phys. 2024 Nov;51(11):8283-8301. doi: 10.1002/mp.17362. Epub 2024 Aug 17.
5
Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN.基于空间变换 CNN 的单模态和跨模态近似重复图像对检测。
Sensors (Basel). 2021 Jan 2;21(1):255. doi: 10.3390/s21010255.
6
Image synthesis-based multi-modal image registration framework by using deep fully convolutional networks.基于图像合成的多模态图像深度全卷积网络配准框架。
Med Biol Eng Comput. 2019 May;57(5):1037-1048. doi: 10.1007/s11517-018-1924-y. Epub 2018 Dec 7.
7
Efficient Cross-Modality Insulator Augmentation for Multi-Domain Insulator Defect Detection in UAV Images.用于无人机图像中多域绝缘子缺陷检测的高效跨模态绝缘子增强技术
Sensors (Basel). 2024 Jan 10;24(2):428. doi: 10.3390/s24020428.
8
MRI Cross-Modality Image-to-Image Translation.MRI 跨模态图像到图像翻译。
Sci Rep. 2020 Feb 28;10(1):3753. doi: 10.1038/s41598-020-60520-6.
9
Graph Sampling-Based Multi-Stream Enhancement Network for Visible-Infrared Person Re-Identification.基于图采样的多流增强网络用于可见光-红外行人重识别
Sensors (Basel). 2023 Sep 18;23(18):7948. doi: 10.3390/s23187948.
10
Geometry-Consistent Adversarial Registration Model for Unsupervised Multi-Modal Medical Image Registration.基于几何一致性对抗的无监督多模态医学图像配准方法。
IEEE J Biomed Health Inform. 2023 Jul;27(7):3455-3466. doi: 10.1109/JBHI.2023.3270199. Epub 2023 Jun 30.

本文引用的文献

1
Is image-to-image translation the panacea for multimodal image registration? A comparative study.图像到图像的翻译是否是多模态图像配准的万能药?一项对比研究。
PLoS One. 2022 Nov 28;17(11):e0276196. doi: 10.1371/journal.pone.0276196. eCollection 2022.
2
Synth-by-Reg (SbR): Contrastive learning for synthesis-based registration of paired images.基于合成的配准对比学习(SbR):用于成对图像基于合成的配准的对比学习。
Simul Synth Med Imaging. 2021 Sep;12965:44-54. doi: 10.1007/978-3-030-87592-3_5. Epub 2021 Sep 21.
3
A Bilevel Integrated Model With Data-Driven Layer Ensemble for Multi-Modality Image Fusion.
一种基于数据驱动层集成的多模态图像融合双水平集成模型。
IEEE Trans Image Process. 2021;30:1261-1274. doi: 10.1109/TIP.2020.3043125. Epub 2020 Dec 21.
4
Deep Learning based Inter-Modality Image Registration Supervised by Intra-Modality Similarity.基于深度学习的模态内相似性监督的跨模态图像配准
Mach Learn Med Imaging. 2018 Sep;11046:55-63. doi: 10.1007/978-3-030-00919-9_7. Epub 2018 Sep 15.
5
Normalized mutual information feature selection.归一化互信息特征选择
IEEE Trans Neural Netw. 2009 Feb;20(2):189-201. doi: 10.1109/TNN.2008.2005601. Epub 2009 Jan 13.
6
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.