• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高分辨率特征增强语义分割网络的红外飞机检测算法

Infrared Aircraft Detection Algorithm Based on High-Resolution Feature-Enhanced Semantic Segmentation Network.

作者信息

Liu Gang, Xi Jiangtao, Ma Chao, Chen Huixiang

机构信息

College of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China.

School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Wollongong, NSW 2522, Australia.

出版信息

Sensors (Basel). 2024 Dec 11;24(24):7933. doi: 10.3390/s24247933.

DOI:10.3390/s24247933
PMID:39771670
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11678999/
Abstract

In order to achieve infrared aircraft detection under interference conditions, this paper proposes an infrared aircraft detection algorithm based on high-resolution feature-enhanced semantic segmentation network. Firstly, the designed location attention mechanism is utilized to enhance the current-level feature map by obtaining correlation weights between pixels at different positions. Then, it is fused with the high-level feature map rich in semantic features to construct a location attention feature fusion network, thereby enhancing the representation capability of target features. Secondly, based on the idea of using dilated convolutions to expand the receptive field of feature maps, a hybrid atrous spatial pyramid pooling module is designed. By utilizing a serial structure of dilated convolutions with small dilation rates, this module addresses the issue of feature information loss when expanding the receptive field through dilated spatial pyramid pooling. It captures the contextual information of the target, further enhancing the target features. Finally, a dice loss function is introduced to calculate the overlap between the predicted results and the ground truth labels, facilitating deep excavation of foreground information for comprehensive learning of samples. This paper constructs an infrared aircraft detection algorithm based on a high-resolution feature-enhanced semantic segmentation network which combines the location attention feature fusion network, the hybrid atrous spatial pyramid pooling module, the dice loss function, and a network that maintains the resolution of feature maps. Experiments conducted on a self-built infrared dataset show that the proposed algorithm achieves a mean intersection over union (mIoU) of 92.74%, a mean pixel accuracy (mPA) of 96.34%, and a mean recall (MR) of 96.19%, all of which outperform classic segmentation algorithms such as DeepLabv3+, Segformer, HRNetv2, and DDRNet. This demonstrates that the proposed algorithm can achieve effective detection of infrared aircraft in the presence of interference.

摘要

为了在干扰条件下实现红外飞机检测,本文提出了一种基于高分辨率特征增强语义分割网络的红外飞机检测算法。首先,利用设计的位置注意力机制,通过获取不同位置像素之间的相关权重来增强当前层特征图。然后,将其与富含语义特征的高层特征图融合,构建位置注意力特征融合网络,从而增强目标特征的表示能力。其次,基于使用空洞卷积扩大特征图感受野的思想,设计了一种混合空洞空间金字塔池化模块。该模块通过利用小扩张率的空洞卷积串行结构,解决了通过空洞空间金字塔池化扩大感受野时特征信息丢失的问题。它捕获目标的上下文信息,进一步增强目标特征。最后,引入骰子损失函数来计算预测结果与真实标签之间的重叠,便于深入挖掘前景信息以全面学习样本。本文构建了一种基于高分辨率特征增强语义分割网络的红外飞机检测算法,该网络结合了位置注意力特征融合网络、混合空洞空间金字塔池化模块、骰子损失函数以及保持特征图分辨率的网络。在自建红外数据集上进行的实验表明,所提算法的平均交并比(mIoU)达到92.74%,平均像素精度(mPA)达到96.34%,平均召回率(MR)达到96.19%,均优于DeepLabv3+、Segformer、HRNetv2和DDRNet等经典分割算法。这表明所提算法能够在存在干扰的情况下实现对红外飞机的有效检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/19e0e55fc633/sensors-24-07933-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/687f34d128a2/sensors-24-07933-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/10426f660b55/sensors-24-07933-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/29ecd6b4589a/sensors-24-07933-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/b39c376fe7d9/sensors-24-07933-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/209de74f6ba2/sensors-24-07933-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/43689c59820e/sensors-24-07933-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/7b8a995851e8/sensors-24-07933-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/b0b9fecc61ce/sensors-24-07933-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/527a9f08045b/sensors-24-07933-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/a7c81f09e550/sensors-24-07933-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/feb8d9971453/sensors-24-07933-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/605a5209d1f1/sensors-24-07933-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/fc7f0880cc5b/sensors-24-07933-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/19e0e55fc633/sensors-24-07933-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/687f34d128a2/sensors-24-07933-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/10426f660b55/sensors-24-07933-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/29ecd6b4589a/sensors-24-07933-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/b39c376fe7d9/sensors-24-07933-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/209de74f6ba2/sensors-24-07933-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/43689c59820e/sensors-24-07933-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/7b8a995851e8/sensors-24-07933-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/b0b9fecc61ce/sensors-24-07933-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/527a9f08045b/sensors-24-07933-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/a7c81f09e550/sensors-24-07933-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/feb8d9971453/sensors-24-07933-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/605a5209d1f1/sensors-24-07933-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/fc7f0880cc5b/sensors-24-07933-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49e8/11678999/19e0e55fc633/sensors-24-07933-g014.jpg

相似文献

1
Infrared Aircraft Detection Algorithm Based on High-Resolution Feature-Enhanced Semantic Segmentation Network.基于高分辨率特征增强语义分割网络的红外飞机检测算法
Sensors (Basel). 2024 Dec 11;24(24):7933. doi: 10.3390/s24247933.
2
Semantic Segmentation Network Based on Adaptive Attention and Deep Fusion Utilizing a Multi-Scale Dilated Convolutional Pyramid.基于自适应注意力和深度融合的语义分割网络:利用多尺度扩张卷积金字塔
Sensors (Basel). 2024 Aug 16;24(16):5305. doi: 10.3390/s24165305.
3
An improved DeepLabv3 + railway track extraction algorithm based on densely connected and attention mechanisms.一种基于密集连接和注意力机制的改进型DeepLabv3 +铁路轨道提取算法。
Sci Rep. 2025 Jan 20;15(1):2556. doi: 10.1038/s41598-024-84937-5.
4
A lightweight semantic segmentation method for concrete bridge surface diseases based on improved DeeplabV3.一种基于改进的深度卷积神经网络语义分割算法(DeeplabV3)的混凝土桥面病害轻量级语义分割方法。
Sci Rep. 2025 Mar 26;15(1):10348. doi: 10.1038/s41598-025-95518-5.
5
Novel Method of Semantic Segmentation Applicable to Augmented Reality.适用于增强现实的语义分割新方法。
Sensors (Basel). 2020 Mar 20;20(6):1737. doi: 10.3390/s20061737.
6
Semantic segmentation for tooth cracks using improved DeepLabv3+ model.使用改进的DeepLabv3+模型进行牙齿裂纹的语义分割。
Heliyon. 2024 Feb 10;10(4):e25892. doi: 10.1016/j.heliyon.2024.e25892. eCollection 2024 Feb 29.
7
Cattle Target Segmentation Method in Multi-Scenes Using Improved DeepLabV3+ Method.基于改进的DeepLabV3+方法的多场景牛目标分割方法
Animals (Basel). 2023 Aug 4;13(15):2521. doi: 10.3390/ani13152521.
8
An ENet Semantic Segmentation Method Combined with Attention Mechanism.一种结合注意力机制的 ENet 语义分割方法。
Comput Intell Neurosci. 2023 Feb 22;2023:6965259. doi: 10.1155/2023/6965259. eCollection 2023.
9
BMSeNet: Multiscale Context Pyramid Pooling and Spatial Detail Enhancement Network for Real-Time Semantic Segmentation.BMSeNet:用于实时语义分割的多尺度上下文金字塔池化与空间细节增强网络
Sensors (Basel). 2024 Aug 9;24(16):5145. doi: 10.3390/s24165145.
10
[Lung parenchyma segmentation based on double scale parallel attention network].基于双尺度并行注意力网络的肺实质分割
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2022 Aug 25;39(4):721-729. doi: 10.7507/1001-5515.202108005.

本文引用的文献

1
DMU-Net: A Dual-Stream Multi-Scale U-Net Network Using Multi-Dimensional Spatial Information for Urban Building Extraction.DMU-Net:一种利用多维空间信息的双流多尺度 U-Net 网络,用于城市建筑物提取。
Sensors (Basel). 2023 Feb 10;23(4):1991. doi: 10.3390/s23041991.
2
Semantic Segmentation of the Malignant Breast Imaging Reporting and Data System Lexicon on Breast Ultrasound Images by Using DeepLab v3.基于 DeepLab v3 的乳腺超声图像中恶性乳腺影像报告和数据系统词典的语义分割
Sensors (Basel). 2022 Jul 18;22(14):5352. doi: 10.3390/s22145352.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.
DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.