• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UIU-Net:用于红外小目标检测的U-Net嵌套U-Net结构

UIU-Net: U-Net in U-Net for Infrared Small Object Detection.

作者信息

Wu Xin, Hong Danfeng, Chanussot Jocelyn

出版信息

IEEE Trans Image Process. 2023;32:364-376. doi: 10.1109/TIP.2022.3228497. Epub 2022 Dec 21.

DOI:10.1109/TIP.2022.3228497
PMID:37015404
Abstract

Learning-based infrared small object detection methods currently rely heavily on the classification backbone network. This tends to result in tiny object loss and feature distinguishability limitations as the network depth increases. Furthermore, small objects in infrared images are frequently emerged bright and dark, posing severe demands for obtaining precise object contrast information. For this reason, we in this paper propose a simple and effective "U-Net in U-Net" framework, UIU-Net for short, and detect small objects in infrared images. As the name suggests, UIU-Net embeds a tiny U-Net into a larger U-Net backbone, enabling the multi-level and multi-scale representation learning of objects. Moreover, UIU-Net can be trained from scratch, and the learned features can enhance global and local contrast information effectively. More specifically, the UIU-Net model is divided into two modules: the resolution-maintenance deep supervision (RM-DS) module and the interactive-cross attention (IC-A) module. RM-DS integrates Residual U-blocks into a deep supervision network to generate deep multi-scale resolution-maintenance features while learning global context information. Further, IC-A encodes the local context information between the low-level details and high-level semantic features. Extensive experiments conducted on two infrared single-frame image datasets, i.e., SIRST and Synthetic datasets, show the effectiveness and superiority of the proposed UIU-Net in comparison with several state-of-the-art infrared small object detection methods. The proposed UIU-Net also produces powerful generalization performance for video sequence infrared small object datasets, e.g., ATR ground/air video sequence dataset. The codes of this work are available openly at https://github.com/danfenghong/IEEE.

摘要

基于学习的红外小目标检测方法目前严重依赖分类主干网络。随着网络深度的增加,这往往会导致小目标丢失和特征可区分性受限。此外,红外图像中的小目标经常出现明暗变化,对获取精确的目标对比度信息提出了严峻要求。因此,我们在本文中提出了一种简单有效的“U-Net in U-Net”框架,简称为UIU-Net,用于检测红外图像中的小目标。顾名思义,UIU-Net将一个小的U-Net嵌入到一个更大的U-Net主干中,实现目标的多层次和多尺度表示学习。此外,UIU-Net可以从头开始训练,学习到的特征可以有效地增强全局和局部对比度信息。更具体地说,UIU-Net模型分为两个模块:分辨率保持深度监督(RM-DS)模块和交互式交叉注意力(IC-A)模块。RM-DS将残差U块集成到深度监督网络中,在学习全局上下文信息的同时生成深度多尺度分辨率保持特征。此外,IC-A对低级细节和高级语义特征之间的局部上下文信息进行编码。在两个红外单帧图像数据集,即SIRST和合成数据集上进行的大量实验表明,与几种先进的红外小目标检测方法相比,所提出的UIU-Net具有有效性和优越性。所提出的UIU-Net在视频序列红外小目标数据集,例如ATR地面/空中视频序列数据集上也具有强大的泛化性能。这项工作的代码可在https://github.com/danfenghong/IEEE上公开获取。

相似文献

1
UIU-Net: U-Net in U-Net for Infrared Small Object Detection.UIU-Net:用于红外小目标检测的U-Net嵌套U-Net结构
IEEE Trans Image Process. 2023;32:364-376. doi: 10.1109/TIP.2022.3228497. Epub 2022 Dec 21.
2
IMD-Net: Interpretable multi-scale detection network for infrared dim and small objects.IMD-Net:用于红外弱小目标的可解释多尺度检测网络。
Math Biosci Eng. 2024 Jan 2;21(1):1712-1737. doi: 10.3934/mbe.2024074.
3
EMCAH-Net: an effective multi-scale context aggregation hybrid network for medical image segmentation.EMCAH-Net:一种用于医学图像分割的高效多尺度上下文聚合混合网络。
Quant Imaging Med Surg. 2025 Apr 1;15(4):3064-3083. doi: 10.21037/qims-24-1983. Epub 2025 Mar 28.
4
CSCA U-Net: A channel and space compound attention CNN for medical image segmentation.CSCA U-Net:一种用于医学图像分割的通道和空间联合注意力卷积神经网络。
Artif Intell Med. 2024 Apr;150:102800. doi: 10.1016/j.artmed.2024.102800. Epub 2024 Feb 14.
5
Dense Nested Attention Network for Infrared Small Target Detection.用于红外小目标检测的密集嵌套注意力网络
IEEE Trans Image Process. 2023;32:1745-1758. doi: 10.1109/TIP.2022.3199107. Epub 2023 Mar 14.
6
Water body extraction from high spatial resolution remote sensing images based on enhanced U-Net and multi-scale information fusion.基于增强型U-Net和多尺度信息融合的高空间分辨率遥感影像水体提取
Sci Rep. 2024 Jul 12;14(1):16132. doi: 10.1038/s41598-024-67113-7.
7
UCR-Net: U-shaped context residual network for medical image segmentation.UCR-Net:用于医学图像分割的U型上下文残差网络。
Comput Biol Med. 2022 Dec;151(Pt A):106203. doi: 10.1016/j.compbiomed.2022.106203. Epub 2022 Oct 18.
8
S-Net: A novel shallow network for enhanced detail retention in medical image segmentation.S-Net:一种用于在医学图像分割中增强细节保留的新型浅层网络。
Comput Methods Programs Biomed. 2025 Jun;265:108730. doi: 10.1016/j.cmpb.2025.108730. Epub 2025 Mar 20.
9
Short Circuit Recognition for Metal Electrorefining Using an Improved Faster R-CNN With Synthetic Infrared Images.使用带有合成红外图像的改进型更快区域卷积神经网络(Faster R-CNN)进行金属电解精炼的短路识别
Front Neurorobot. 2021 Nov 26;15:751037. doi: 10.3389/fnbot.2021.751037. eCollection 2021.
10
MADR-Net: multi-level attention dilated residual neural network for segmentation of medical images.MADR-Net:用于医学图像分割的多层次注意扩张残差神经网络。
Sci Rep. 2024 Jun 3;14(1):12699. doi: 10.1038/s41598-024-63538-2.

引用本文的文献

1
A lightweight small object detection model for UAV images based on deep semantic integration.一种基于深度语义融合的无人机图像轻量级小目标检测模型。
Sci Rep. 2025 Aug 29;15(1):31888. doi: 10.1038/s41598-025-16878-6.
2
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
3
VBM-YOLO: an enhanced YOLO model with reduced information loss for vehicle body markers detection.
VBM-YOLO:一种用于车身标记检测的信息损失减少的增强型YOLO模型。
PeerJ Comput Sci. 2025 Jun 2;11:e2932. doi: 10.7717/peerj-cs.2932. eCollection 2025.
4
Segmentation-based lightweight multi-class classification model for crop disease detection, classification, and severity assessment using DCNN.基于分割的轻量级多类分类模型,用于利用深度卷积神经网络进行作物病害检测、分类和严重程度评估。
PLoS One. 2025 May 14;20(5):e0322705. doi: 10.1371/journal.pone.0322705. eCollection 2025.
5
Optimizing Satellite Imagery Datasets for Enhanced Land/Water Segmentation.优化卫星图像数据集以增强陆地/水体分割
Sensors (Basel). 2025 Mar 13;25(6):1793. doi: 10.3390/s25061793.
6
High-Concentration Time-Frequency Representation and Instantaneous Frequency Estimation of Frequency-Crossing Signals.频率交叉信号的高浓度时频表示与瞬时频率估计
Sensors (Basel). 2025 Mar 24;25(7):2030. doi: 10.3390/s25072030.
7
Augmenting atmospheric turbulence effects on thermal-adapted deep object detection models.增强大气湍流对热适应深度目标检测模型的影响。
Sci Rep. 2025 Mar 22;15(1):9900. doi: 10.1038/s41598-025-86830-1.
8
Infrared Small Target Detection Algorithm Based on Improved Dense Nested U-Net Network.基于改进型密集嵌套U-Net网络的红外小目标检测算法
Sensors (Basel). 2025 Jan 29;25(3):814. doi: 10.3390/s25030814.
9
SSATNet: Spectral-spatial attention transformer for hyperspectral corn image classification.SSATNet:用于高光谱玉米图像分类的光谱-空间注意力变换器
Front Plant Sci. 2025 Jan 16;15:1458978. doi: 10.3389/fpls.2024.1458978. eCollection 2024.
10
MSD-Net: Multi-scale dense convolutional neural network for photoacoustic image reconstruction with sparse data.MSD-Net:用于稀疏数据光声图像重建的多尺度密集卷积神经网络
Photoacoustics. 2024 Dec 12;41:100679. doi: 10.1016/j.pacs.2024.100679. eCollection 2025 Feb.