Zhang Xiaoli, Zuo Guocai
Changsha Institute of Technology, Changsha, Hunan, China.
School of Electronic Information Engineering, Lang Fang Normal University, Langfang, 065000, Hebei, China.
Sci Rep. 2025 Jan 2;15(1):421. doi: 10.1038/s41598-024-84747-9.
The main challenges faced when detecting targets captured by UAVs include small target image size, dense target distribution, and uneven category distribution.In addition, the hardware limitations of UAVs impose constraints on the size and complexity of the model, which may lead to poor detection accuracy of the model. In order to solve these problems, a small target detection method based on the improved YOLOv8 algorithm for UAV viewpoint is proposed. The following improvements are made to the YOLOv8n model. Firstly, a bi-directional feature pyramid network (BiFPN) is introduced to enhance the fusion capability of the features. This improvement leads to better detection of small targets. Secondly, in the head part of the model, the original C2f module is replaced with the C3Ghost module. This change maintains the model's performance while significantly reducing the computational load. Lastly, the detection head adds a channel attention mechanism. This mechanism helps in filtering out unimportant information and enhancing the recognition of key features. The MPDIoU (Minimum Point Distance based IoU) loss function is improved, and the idea of inner-IoU loss function is adopted as a way to improve the model's learning ability for difficult samples. Experimental results on the VisDrone dataset show that the YOLOv8n model with these improvements improves 17.2%, 10.5%, and 16.2% in mean accuracy (mAP), precision (P), and recall (R), respectively, and these improvements significantly improve the performance of small target detection from the UAV viewpoint.
无人机捕获目标检测时面临的主要挑战包括目标图像尺寸小、目标分布密集以及类别分布不均衡。此外,无人机的硬件限制对模型的大小和复杂度施加了约束,这可能导致模型的检测精度较差。为了解决这些问题,提出了一种基于改进的YOLOv8算法的无人机视角小目标检测方法。对YOLOv8n模型进行了以下改进。首先,引入双向特征金字塔网络(BiFPN)以增强特征融合能力。这一改进使得对小目标的检测效果更好。其次,在模型的头部,将原来的C2f模块替换为C3Ghost模块。这一改变在显著降低计算量的同时保持了模型的性能。最后,检测头添加了通道注意力机制。该机制有助于过滤掉不重要的信息并增强对关键特征的识别。改进了MPDIoU(基于最小点距离的IoU)损失函数,并采用内IoU损失函数的思想来提高模型对困难样本的学习能力。在VisDrone数据集上的实验结果表明,经过这些改进的YOLOv8n模型在平均精度(mAP)、精确率(P)和召回率(R)方面分别提高了17.2%、10.5%和16.2%,这些改进从无人机视角显著提升了小目标检测的性能。