Suppr超能文献

提升光学遥感图像目标检测中的检测性能:一种采用空间自适应角度感知网络和边缘感知倾斜边界框损失函数的双重策略

Elevating Detection Performance in Optical Remote Sensing Image Object Detection: A Dual Strategy with Spatially Adaptive Angle-Aware Networks and Edge-Aware Skewed Bounding Box Loss Function.

作者信息

Yan Zexin, Fan Jie, Li Zhongbo, Xie Yongqiang

机构信息

Institute of System Engineering, Academy of Military Sciences, Beijing 100141, China.

出版信息

Sensors (Basel). 2024 Aug 18;24(16):5342. doi: 10.3390/s24165342.

Abstract

In optical remote sensing image object detection, discontinuous boundaries often limit detection accuracy, particularly at high Intersection over Union (IoU) thresholds. This paper addresses this issue by proposing the Spatial Adaptive Angle-Aware (SA3) Network. The SA3 Network employs a hierarchical refinement approach, consisting of coarse regression, fine regression, and precise tuning, to optimize the angle parameters of rotated bounding boxes. It adapts to specific task scenarios using either class-aware or class-agnostic strategies. Experimental results demonstrate its effectiveness in significantly improving detection accuracy at high IoU thresholds. Additionally, we introduce a Gaussian transform-based IoU factor during angle regression loss calculation, leading to the development of Edge-aware Skewed Bounding Box Loss (EAS Loss). The EAS loss enhances the loss gradient at the final stage of angle regression for bounding boxes, addressing the challenge of further learning when the predicted box angle closely aligns with the real target box angle. This results in increased training efficiency and better alignment between training and evaluation metrics. Experimental results show that the proposed method substantially enhances the detection accuracy of ReDet and ReBiDet models. The SA3 Network and EAS loss not only elevate the mAP of the ReBiDet model on DOTA-v1.5 to 78.85% but also effectively improve the model's mAP under high IoU threshold conditions.

摘要

在光学遥感图像目标检测中,不连续的边界常常限制检测精度,尤其是在高交并比(IoU)阈值的情况下。本文通过提出空间自适应角度感知(SA3)网络来解决这一问题。SA3网络采用一种分层细化方法,包括粗回归、细回归和精确调整,以优化旋转边界框的角度参数。它使用类别感知或类别不可知策略来适应特定的任务场景。实验结果表明,它在高IoU阈值下能显著提高检测精度。此外,我们在角度回归损失计算过程中引入基于高斯变换的IoU因子,从而开发出边缘感知倾斜边界框损失(EAS损失)。EAS损失增强了边界框角度回归最后阶段的损失梯度,解决了预测框角度与真实目标框角度紧密对齐时进一步学习的挑战。这导致训练效率提高,训练和评估指标之间的对齐性更好。实验结果表明,所提出的方法显著提高了ReDet和ReBiDet模型的检测精度。SA3网络和EAS损失不仅将ReBiDet模型在DOTA-v1.5上的平均精度均值(mAP)提高到78.85%,而且在高IoU阈值条件下有效地提高了模型的mAP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d1b/11359887/bd18bf334387/sensors-24-05342-g001.jpg

相似文献

4
Oriented Vehicle Detection in Aerial Images Based on YOLOv4.
Sensors (Basel). 2022 Nov 1;22(21):8394. doi: 10.3390/s22218394.
5
Powerful-IoU: More straightforward and faster bounding box regression loss with a nonmonotonic focusing mechanism.
Neural Netw. 2024 Feb;170:276-284. doi: 10.1016/j.neunet.2023.11.041. Epub 2023 Nov 22.
6
7
Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUs.
J Imaging. 2022 Jul 11;8(7):193. doi: 10.3390/jimaging8070193.
9
EMG-YOLO: road crack detection algorithm for edge computing devices.
Front Neurorobot. 2024 Jul 2;18:1423738. doi: 10.3389/fnbot.2024.1423738. eCollection 2024.
10
Research on the Method of Foreign Object Detection for Railway Tracks Based on Deep Learning.
Sensors (Basel). 2024 Jul 11;24(14):4483. doi: 10.3390/s24144483.

引用本文的文献

本文引用的文献

1
Detecting Rotated Objects as Gaussian Distributions and its 3-D Generalization.
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4335-4354. doi: 10.1109/TPAMI.2022.3197152. Epub 2023 Mar 7.
2
SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing.
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2384-2399. doi: 10.1109/TPAMI.2022.3166956. Epub 2023 Jan 6.
3
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges.
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7778-7796. doi: 10.1109/TPAMI.2021.3117983. Epub 2022 Oct 4.
4
Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection.
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1452-1459. doi: 10.1109/TPAMI.2020.2974745. Epub 2021 Mar 5.
5
Mask R-CNN.
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.
6
TextBoxes++: A Single-Shot Oriented Scene Text Detector.
IEEE Trans Image Process. 2018 Aug;27(8):3676-3690. doi: 10.1109/TIP.2018.2825107. Epub 2018 Apr 9.
7
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
8
High-performance rotation invariant multiview face detection.
IEEE Trans Pattern Anal Mach Intell. 2007 Apr;29(4):671-86. doi: 10.1109/TPAMI.2007.1011.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验