Pan Yang, Yang Jinhua, Zhu Lei, Yao Lina, Zhang Bo
School of Electronics and Information, Xi'an Polytechnic University, Xi'an 710048, China.
Math Biosci Eng. 2023 Aug 9;20(9):16148-16168. doi: 10.3934/mbe.2023721.
Aerial image target detection technology has essential application value in navigation security, traffic control and environmental monitoring. Compared with natural scene images, the background of aerial images is more complex, and there are more small targets, which puts higher requirements on the detection accuracy and real-time performance of the algorithm. To further improve the detection accuracy of lightweight networks for small targets in aerial images, we propose a cross-scale multi-feature fusion target detection method (CMF-YOLOv5s) for aerial images. Based on the original YOLOv5s, a bidirectional cross-scale feature fusion sub-network (BsNet) is constructed, using a newly designed multi-scale fusion module (MFF) and cross-scale feature fusion strategy to enhance the algorithm's ability, that fuses multi-scale feature information and reduces the loss of small target feature information. To improve the problem of the high leakage detection rate of small targets in aerial images, we constructed a multi-scale detection head containing four outputs to improve the network's ability to perceive small targets. To enhance the network's recognition rate of small target samples, we improve the K-means algorithm by introducing a genetic algorithm to optimize the prediction frame size to generate anchor boxes more suitable for aerial images. The experimental results show that on the aerial image small target dataset VisDrone-2019, the proposed method can detect more small targets in aerial images with complex backgrounds. With a detection speed of 116 FPS, compared with the original algorithm, the detection accuracy metrics mAP0.5 and mAP0.5:0.95 for small targets are improved by 5.5% and 3.6%, respectively. Meanwhile, compared with eight advanced lightweight networks such as YOLOv7-Tiny and PP-PicoDet-s, mAP0.5 improves by more than 3.3%, and mAP0.5:0.95 improves by more than 1.9%.
航空图像目标检测技术在导航安全、交通管制和环境监测等方面具有重要的应用价值。与自然场景图像相比,航空图像的背景更为复杂,小目标更多,这对算法的检测精度和实时性能提出了更高的要求。为了进一步提高航空图像中小目标的轻量级网络检测精度,我们提出了一种用于航空图像的跨尺度多特征融合目标检测方法(CMF-YOLOv5s)。基于原始的YOLOv5s,构建了一个双向跨尺度特征融合子网络(BsNet),采用新设计的多尺度融合模块(MFF)和跨尺度特征融合策略来增强算法能力,融合多尺度特征信息并减少小目标特征信息的损失。为了改善航空图像中小目标漏检率高的问题,我们构建了一个包含四个输出的多尺度检测头,以提高网络感知小目标的能力。为了提高网络对小目标样本的识别率,我们通过引入遗传算法改进K-means算法,优化预测框尺寸,以生成更适合航空图像的锚框。实验结果表明,在航空图像小目标数据集VisDrone-2019上,该方法能够在复杂背景的航空图像中检测出更多小目标。检测速度为116 FPS,与原算法相比,小目标的检测精度指标mAP0.5和mAP0.5:0.95分别提高了5.5%和3.6%。同时,与YOLOv7-Tiny和PP-PicoDet-s等八个先进的轻量级网络相比,mAP0.5提高了3.3%以上,mAP0.5:0.95提高了1.9%以上。