• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

空中红外生成对抗网络(AerialIRGAN):具有双编码器结构的无配对空中可见光到红外图像转换

AerialIRGAN: unpaired aerial visible-to-infrared image translation with dual-encoder structure.

作者信息

Ma Decao, Su Juan, Li Shaopeng, Xian Yong

机构信息

Xi'an Research Institute of High Technology, 710025, Xi'an, China.

Department of Automation, Tsinghua University, 100084, Beijing, China.

出版信息

Sci Rep. 2024 Sep 27;14(1):22105. doi: 10.1038/s41598-024-73381-0.

DOI:10.1038/s41598-024-73381-0
PMID:39333306
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11436762/
Abstract

Due to the high cost of equipment and the constraints of shooting conditions, obtaining aerial infrared images of specific targets is very challenging. Most methods using Generative Adversarial Networks for translating visible images to infrared greatly depend on registered data and struggle to handle the diversity and complexity of scenes in aerial infrared targets. This paper proposes a one side end-to-end unpaired aerial visible-to-infrared image translation algorithm, termed AerialIRGAN. AerialIRGAN introduces a dual-encoder structure, where one encoder is designed based on the Segment Anything Model to extract deep semantic features from visible images, and the other encoder is designed based on UniRepLKNet to capture small-scale patterns and sparse patterns from visible images. Subsequently, AerialIRGAN constructs a bridging module to deeply integrate the features of both encoders and their corresponding decoders. Finally, AerialIRGAN proposes a structural appearance consistency loss to guide the synthetic infrared images to maintain the structure of the source image while possessing distinct infrared characteristics. The experimental results show that compared to the existing typical infrared image generation algorithms, the proposed method can generate higher-quality infrared images and achieve better performance in both subjective visual description and objective metric evaluation.

摘要

由于设备成本高昂以及拍摄条件的限制,获取特定目标的航空红外图像极具挑战性。大多数使用生成对抗网络将可见光图像转换为红外图像的方法严重依赖于配准数据,并且难以处理航空红外目标场景的多样性和复杂性。本文提出了一种单侧端到端无配对的航空可见光到红外图像转换算法,称为AerialIRGAN。AerialIRGAN引入了一种双编码器结构,其中一个编码器基于“分割一切模型”(Segment Anything Model)设计,用于从可见光图像中提取深度语义特征,另一个编码器基于UniRepLKNet设计,用于从可见光图像中捕捉小规模模式和稀疏模式。随后,AerialIRGAN构建了一个桥接模块,以深度整合两个编码器及其相应解码器的特征。最后,AerialIRGAN提出了一种结构外观一致性损失,以引导合成红外图像在保持源图像结构的同时具备独特的红外特征。实验结果表明,与现有的典型红外图像生成算法相比,该方法能够生成更高质量的红外图像,并且在主观视觉描述和客观指标评估方面均取得了更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/ae57eaed7158/41598_2024_73381_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/e85886313041/41598_2024_73381_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/6511b7d5ad2b/41598_2024_73381_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/856b17db7d34/41598_2024_73381_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/548f21dcf545/41598_2024_73381_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/cea3a2554718/41598_2024_73381_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/fce89db1cdad/41598_2024_73381_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/10685aa45564/41598_2024_73381_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/ae57eaed7158/41598_2024_73381_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/e85886313041/41598_2024_73381_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/6511b7d5ad2b/41598_2024_73381_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/856b17db7d34/41598_2024_73381_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/548f21dcf545/41598_2024_73381_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/cea3a2554718/41598_2024_73381_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/fce89db1cdad/41598_2024_73381_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/10685aa45564/41598_2024_73381_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3228/11436762/ae57eaed7158/41598_2024_73381_Fig8_HTML.jpg

相似文献

1
AerialIRGAN: unpaired aerial visible-to-infrared image translation with dual-encoder structure.空中红外生成对抗网络(AerialIRGAN):具有双编码器结构的无配对空中可见光到红外图像转换
Sci Rep. 2024 Sep 27;14(1):22105. doi: 10.1038/s41598-024-73381-0.
2
A Generative Adversarial Network for Infrared and Visible Image Fusion Based on Semantic Segmentation.基于语义分割的红外与可见光图像融合生成对抗网络
Entropy (Basel). 2021 Mar 21;23(3):376. doi: 10.3390/e23030376.
3
Low-light image enhancement using generative adversarial networks.使用生成对抗网络的低光照图像增强
Sci Rep. 2024 Aug 9;14(1):18489. doi: 10.1038/s41598-024-69505-1.
4
Infrared-Visible Image Fusion Based on Semantic Guidance and Visual Perception.基于语义引导和视觉感知的红外-可见光图像融合
Entropy (Basel). 2022 Sep 21;24(10):1327. doi: 10.3390/e24101327.
5
DRSNFuse: Deep Residual Shrinkage Network for Infrared and Visible Image Fusion.DRSNFuse:用于红外与可见光图像融合的深度残差收缩网络。
Sensors (Basel). 2022 Jul 8;22(14):5149. doi: 10.3390/s22145149.
6
Nighttime road scene image enhancement based on cycle-consistent generative adversarial network.基于循环一致生成对抗网络的夜间道路场景图像增强
Sci Rep. 2024 Jun 22;14(1):14375. doi: 10.1038/s41598-024-65270-3.
7
SCFusion: Infrared and Visible Fusion Based on Salient Compensation.SCFusion:基于显著补偿的红外与可见光融合
Entropy (Basel). 2023 Jun 27;25(7):985. doi: 10.3390/e25070985.
8
MJ-GAN: Generative Adversarial Network with Multi-Grained Feature Extraction and Joint Attention Fusion for Infrared and Visible Image Fusion.MJ-GAN:用于红外与可见光图像融合的具有多粒度特征提取和联合注意力融合的生成对抗网络
Sensors (Basel). 2023 Jul 12;23(14):6322. doi: 10.3390/s23146322.
9
Visible-Image-Assisted Nonuniformity Correction of Infrared Images Using the GAN with SEBlock.使用带 SEBlock 的 GAN 进行可见图像辅助的红外图像非均匀性校正。
Sensors (Basel). 2023 Mar 20;23(6):3282. doi: 10.3390/s23063282.
10
Advancing infrared and visible image fusion with an enhanced multiscale encoder and attention-based networks.利用增强型多尺度编码器和基于注意力的网络推进红外与可见光图像融合
iScience. 2024 Sep 10;27(10):110915. doi: 10.1016/j.isci.2024.110915. eCollection 2024 Oct 18.

引用本文的文献

1
Detection of low-altitude infrared small targets for UAVs using a density-based artificial bee colony algorithm.基于密度的人工蜂群算法用于无人机低空红外小目标检测
Sci Rep. 2025 Jul 2;15(1):23344. doi: 10.1038/s41598-025-06070-1.

本文引用的文献

1
Diffusion Models in Vision: A Survey.视觉中的扩散模型:综述
IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):10850-10869. doi: 10.1109/TPAMI.2023.3261988. Epub 2023 Aug 7.
2
GiT: Graph Interactive Transformer for Vehicle Re-Identification.GiT:用于车辆重新识别的图交互Transformer
IEEE Trans Image Process. 2023;32:1039-1051. doi: 10.1109/TIP.2023.3238642. Epub 2023 Feb 7.
3
A Review of Modern Thermal Imaging Sensor Technology and Applications for Autonomous Aerial Navigation.现代热成像传感器技术及其在自主空中导航中的应用综述
J Imaging. 2021 Oct 19;7(10):217. doi: 10.3390/jimaging7100217.
4
Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping.用于单边无监督域映射的几何一致生成对抗网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2019 Jun;2019:2422-2431. doi: 10.1109/cvpr.2019.00253. Epub 2020 Jan 9.