Yu Jie, Zheng Hao, Xie Li, Zhang Lei, Yu Mei, Han Jin
Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering, School of Computer and Information, China Three Gorges University, Yichang, China.
School of Computer and Information, China Three Gorges University, Yichang, China.
Front Neurorobot. 2023 Dec 14;17:1315251. doi: 10.3389/fnbot.2023.1315251. eCollection 2023.
Unmanned surface vessel (USV) target detection algorithms often face challenges such as misdetection and omission of small targets due to significant variations in target scales and susceptibility to interference from complex environments. To address these issues, we propose a small target enhanced YOLOv7 (STE-YOLO) approach. Firstly, we introduce a specialized detection branch designed to identify tiny targets. This enhancement aims to improve the multi-scale target detection capabilities and address difficulties in recognizing targets of different sizes. Secondly, we present the lite visual center (LVC) module, which effectively fuses data from different levels to give more attention to small targets. Additionally, we integrate the lite efficient layer aggregation networks (L-ELAN) into the backbone network to reduce redundant computations and enhance computational efficiency. Lastly, we use Wise-IOU to optimize the loss function definition, thereby improving the model robustness by dynamically optimizing gradient contributions from samples of varying quality. We conducted experiments on the WSODD dataset and the FIOW-Img dataset. The results on the comprehensive WSODD dataset demonstrate that STE-YOLO, when compared to YOLOv7, reduces network parameters by 14% while improving AP50 and APs scores by 2.1% and 1.6%, respectively. Furthermore, when compared to five other leading target detection algorithms, STE-YOLO demonstrates superior accuracy and efficiency.
无人水面舰艇(USV)目标检测算法常常面临诸多挑战,比如由于目标尺度的显著变化以及易受复杂环境干扰,导致小目标的误检和漏检。为解决这些问题,我们提出一种小目标增强YOLOv7(STE-YOLO)方法。首先,我们引入一个专门用于识别微小目标的检测分支。这种增强旨在提高多尺度目标检测能力,并解决识别不同大小目标的困难。其次,我们提出了轻量级视觉中心(LVC)模块,它能有效融合来自不同层级的数据,从而更关注小目标。此外,我们将轻量级高效层聚合网络(L-ELAN)集成到骨干网络中,以减少冗余计算并提高计算效率。最后,我们使用Wise-IOU来优化损失函数定义,从而通过动态优化来自不同质量样本的梯度贡献来提高模型的鲁棒性。我们在WSODD数据集和FIOW-Img数据集上进行了实验。在综合的WSODD数据集上的结果表明,与YOLOv7相比,STE-YOLO的网络参数减少了14%,同时AP50和APs分数分别提高了2.1%和1.6%。此外,与其他五种领先的目标检测算法相比,STE-YOLO展现出卓越的准确性和效率。