Suppr超能文献

基于跨尺度多特征融合的航空图像目标检测方法

Aerial images object detection method based on cross-scale multi-feature fusion.

作者信息

Pan Yang, Yang Jinhua, Zhu Lei, Yao Lina, Zhang Bo

机构信息

School of Electronics and Information, Xi'an Polytechnic University, Xi'an 710048, China.

出版信息

Math Biosci Eng. 2023 Aug 9;20(9):16148-16168. doi: 10.3934/mbe.2023721.

Abstract

Aerial image target detection technology has essential application value in navigation security, traffic control and environmental monitoring. Compared with natural scene images, the background of aerial images is more complex, and there are more small targets, which puts higher requirements on the detection accuracy and real-time performance of the algorithm. To further improve the detection accuracy of lightweight networks for small targets in aerial images, we propose a cross-scale multi-feature fusion target detection method (CMF-YOLOv5s) for aerial images. Based on the original YOLOv5s, a bidirectional cross-scale feature fusion sub-network (BsNet) is constructed, using a newly designed multi-scale fusion module (MFF) and cross-scale feature fusion strategy to enhance the algorithm's ability, that fuses multi-scale feature information and reduces the loss of small target feature information. To improve the problem of the high leakage detection rate of small targets in aerial images, we constructed a multi-scale detection head containing four outputs to improve the network's ability to perceive small targets. To enhance the network's recognition rate of small target samples, we improve the K-means algorithm by introducing a genetic algorithm to optimize the prediction frame size to generate anchor boxes more suitable for aerial images. The experimental results show that on the aerial image small target dataset VisDrone-2019, the proposed method can detect more small targets in aerial images with complex backgrounds. With a detection speed of 116 FPS, compared with the original algorithm, the detection accuracy metrics mAP0.5 and mAP0.5:0.95 for small targets are improved by 5.5% and 3.6%, respectively. Meanwhile, compared with eight advanced lightweight networks such as YOLOv7-Tiny and PP-PicoDet-s, mAP0.5 improves by more than 3.3%, and mAP0.5:0.95 improves by more than 1.9%.

摘要

航空图像目标检测技术在导航安全、交通管制和环境监测等方面具有重要的应用价值。与自然场景图像相比,航空图像的背景更为复杂,小目标更多,这对算法的检测精度和实时性能提出了更高的要求。为了进一步提高航空图像中小目标的轻量级网络检测精度,我们提出了一种用于航空图像的跨尺度多特征融合目标检测方法(CMF-YOLOv5s)。基于原始的YOLOv5s,构建了一个双向跨尺度特征融合子网络(BsNet),采用新设计的多尺度融合模块(MFF)和跨尺度特征融合策略来增强算法能力,融合多尺度特征信息并减少小目标特征信息的损失。为了改善航空图像中小目标漏检率高的问题,我们构建了一个包含四个输出的多尺度检测头,以提高网络感知小目标的能力。为了提高网络对小目标样本的识别率,我们通过引入遗传算法改进K-means算法,优化预测框尺寸,以生成更适合航空图像的锚框。实验结果表明,在航空图像小目标数据集VisDrone-2019上,该方法能够在复杂背景的航空图像中检测出更多小目标。检测速度为116 FPS,与原算法相比,小目标的检测精度指标mAP0.5和mAP0.5:0.95分别提高了5.5%和3.6%。同时,与YOLOv7-Tiny和PP-PicoDet-s等八个先进的轻量级网络相比,mAP0.5提高了3.3%以上,mAP0.5:0.95提高了1.9%以上。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验