OD-YOLO：基于新型多尺度特征融合的遥感图像稳健小目标检测模型

OD-YOLO: Robust Small Object Detection Model in Remote Sensing Image with a Novel Multi-Scale Feature Fusion.

作者信息

Bu Yangcheng, Ye Hairong, Tie Zhixin, Chen Yanbing, Zhang Dingming

机构信息

School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China.

KeYi College, Zhejiang Sci-Tech University, Shaoxing 312369, China.

出版信息

Sensors (Basel). 2024 Jun 3;24(11):3596. doi: 10.3390/s24113596.

DOI:10.3390/s24113596

PMID:38894387

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11175302/

Abstract

As remote sensing technology has advanced, the use of satellites and similar technologies has become increasingly prevalent in daily life. Now, it plays a crucial role in hydrology, agriculture, and geography. Nevertheless, because of the distinct qualities of remote sensing, including expansive scenes and small, densely packed targets, there are many challenges in detecting remote sensing objects. Those challenges lead to insufficient accuracy in remote sensing object detection. Consequently, developing a new model is essential to enhance the identification capabilities for objects in remote sensing imagery. To solve these constraints, we have designed the OD-YOLO approach that uses multi-scale feature fusion to improve the performance of the YOLOv8n model in small target detection. Firstly, traditional convolutions have poor recognition capabilities for certain geometric shapes. Therefore, in this paper, we introduce the Detection Refinement Module (DRmodule) into the backbone architecture. This module utilizes Deformable Convolutional Networks and the Hybrid Attention Transformer to strengthen the model's capability for feature extraction from geometric shapes and blurred objects effectively. Meanwhile, based on the Feature Pyramid Network of YOLO, at the head of the model framework, this paper enhances the detection capability by introducing a Dynamic Head to strengthen the fusion of different scales features in the feature pyramid. Additionally, to address the issue of detecting small objects in remote sensing images, this paper specifically designs the OIoU loss function to finely describe the difference between the detection box and the true box, further enhancing model performance. Experiments on the VisDrone dataset show that OD-YOLO surpasses the compared models by at least 5.2% in mAP50 and 4.4% in mAP75, and experiments on the Foggy Cityscapes dataset demonstrated that OD-YOLO improved mAP by 6.5%, demonstrating outstanding results in tasks related to remote sensing images and adverse weather object detection. This work not only advances the research in remote sensing image analysis, but also provides effective technical support for the practical deployment of future remote sensing applications.

摘要

随着遥感技术的发展，卫星及类似技术在日常生活中的应用越来越普遍。如今，它在水文、农业和地理领域发挥着至关重要的作用。然而，由于遥感具有独特的特性，包括场景广阔以及目标小且密集，在检测遥感目标时存在诸多挑战。这些挑战导致遥感目标检测的准确性不足。因此，开发一种新模型对于提高遥感图像中目标的识别能力至关重要。为了解决这些限制，我们设计了OD - YOLO方法，该方法使用多尺度特征融合来提高YOLOv8n模型在小目标检测中的性能。首先，传统卷积对某些几何形状的识别能力较差。因此，在本文中，我们将检测细化模块（DRmodule）引入主干架构。该模块利用可变形卷积网络和混合注意力变换器，有效地增强了模型从几何形状和模糊目标中提取特征的能力。同时，基于YOLO的特征金字塔网络，在模型框架的头部，本文通过引入动态头部来增强检测能力，以加强特征金字塔中不同尺度特征的融合。此外，为了解决遥感图像中检测小目标的问题，本文专门设计了OIoU损失函数，以精确描述检测框与真实框之间的差异，进一步提高模型性能。在VisDrone数据集上的实验表明，OD - YOLO在mAP50上比对比模型至少高出5.2%，在mAP75上高出4.4%，并且在Foggy Cityscapes数据集上的实验表明，OD - YOLO将mAP提高了6.5%，在与遥感图像和恶劣天气目标检测相关的任务中取得了优异的成果。这项工作不仅推动了遥感图像分析的研究，还为未来遥感应用的实际部署提供了有效的技术支持。