School of Traffic and Transportation, Chongqing Jiaotong University, Chongqing, China.
PLoS One. 2024 Mar 27;19(3):e0300120. doi: 10.1371/journal.pone.0300120. eCollection 2024.
With the widespread use of UAVs, UAV aerial image target detection technology can be used for practical applications in the military, traffic planning, personnel search and rescue and other fields. In this paper, we propose a multi-scale UAV aerial image detection method based on adaptive feature fusion for solving the problem of detecting small target objects in UAV aerial images. This method automatically adjusts the convolution kernel receptive field and reduces the redundant background of the image by adding an adaptive feature extraction module (AFEM) to the backbone network. This enables it to obtain more accurately and effectively small target feature information. In addition, we design an adaptive feature weighted fusion network (SBiFPN) to effectively enhance the representation of shallow feature information of small targets. Finally, we add an additional small target detection scale to the original network to expand the receptive field of the network and strengthen the detection of small target objects. The training and testing are carried out on the VisDrone public dataset. The experimental results show that the proposed method can achieve 38.5% mAP, which is 2.0% higher than the baseline network YOLOv5s, and can still detect the UAV aerial image well in complex scenes.
随着无人机的广泛应用,无人机航拍图像目标检测技术可以在军事、交通规划、人员搜索和救援等领域得到实际应用。本文提出了一种基于自适应特征融合的多尺度无人机航拍图像检测方法,用于解决无人机航拍图像中小目标物体的检测问题。该方法通过在骨干网络中添加自适应特征提取模块(AFEM),自动调整卷积核感受野,减少图像的冗余背景,从而更准确有效地获取小目标特征信息。此外,我们设计了自适应特征加权融合网络(SBiFPN),有效地增强了小目标浅特征信息的表示。最后,我们在原始网络中添加了一个额外的小目标检测尺度,以扩展网络的感受野,加强对小目标物体的检测。在 VisDrone 公共数据集上进行了训练和测试。实验结果表明,所提出的方法可以达到 38.5%的 mAP,比基线网络 YOLOv5s 高 2.0%,并且在复杂场景下仍能很好地检测无人机航拍图像。