Feng Sang, Huang Yi, Zhang Ning
School of Electromechanical Engineering, Guangdong University of Technology, Guangzhou 510006, China.
Sensors (Basel). 2024 Sep 9;24(17):5850. doi: 10.3390/s24175850.
Unmanned aerial vehicles (UAVs) with cameras offer extensive monitoring capabilities and exceptional maneuverability, making them ideal for real-time ship detection and effective ship management. However, ship detection by camera-equipped UAVs faces challenges when it comes to multi-viewpoints, multi-scales, environmental variability, and dataset scarcity. To overcome these challenges, we proposed a data augmentation method based on stable diffusion to generate new images for expanding the dataset. Additionally, we improve the YOLOv8n OBB model by incorporating the BiFPN structure and EMA module, enhancing its ability to detect multi-viewpoint and multi-scale ship instances. Through multiple comparative experiments, we evaluated the effectiveness of our proposed data augmentation method and the improved model. The results indicated that our proposed data augmentation method is effective for low-volume datasets with complex object features. The YOLOv8n-BiFPN-EMA OBB model we proposed performed well in detecting multi-viewpoint and multi-scale ship instances, achieving the mAP (@0.5) of 92.3%, the mAP (@0.5:0.95) of 77.5%, a reduction of 0.8 million in model parameters, and a detection speed that satisfies real-time ship detection requirements.
配备摄像头的无人机具有广泛的监测能力和出色的机动性,使其成为实时船舶检测和有效船舶管理的理想选择。然而,配备摄像头的无人机在进行船舶检测时,在多视角、多尺度、环境变化和数据集稀缺等方面面临挑战。为了克服这些挑战,我们提出了一种基于稳定扩散的数据增强方法,以生成新图像来扩展数据集。此外,我们通过整合BiFPN结构和EMA模块对YOLOv8n OBB模型进行改进,增强其检测多视角和多尺度船舶实例的能力。通过多次对比实验,我们评估了所提出的数据增强方法和改进模型的有效性。结果表明,我们提出的数据增强方法对于具有复杂对象特征的小容量数据集是有效的。我们提出的YOLOv8n-BiFPN-EMA OBB模型在检测多视角和多尺度船舶实例方面表现出色,mAP(@0.5)达到92.3%,mAP(@0.5:0.95)达到77.5%,模型参数减少了80万,检测速度满足实时船舶检测要求。