基于动态任务对齐样本分配和注意力机制的增强型水面目标检测

Enhanced Water Surface Object Detection with Dynamic Task-Aligned Sample Assignment and Attention Mechanisms.

作者信息

Zhao Liangtian, Qiu Shouqiang, Chen Yuanming

机构信息

School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510641, China.

出版信息

Sensors (Basel). 2024 May 14;24(10):3104. doi: 10.3390/s24103104.

DOI:10.3390/s24103104

PMID:38793957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11125137/

Abstract

The detection of objects on water surfaces is a pivotal technology for the perceptual systems of unmanned surface vehicles (USVs). This paper proposes a novel real-time target detection system designed to address the challenges posed by indistinct bottom boundaries and foggy imagery. Our method enhances the YOLOv8s model by incorporating the convolutional block attention module (CBAM) and a self-attention mechanism, examining their impact at various integration points. A dynamic sample assignment strategy was introduced to enhance the precision of our model and accelerate its convergence. To address the challenge of delineating bottom boundaries with clarity, our model employs a two-strategy approach: a threshold filter and a feedforward neural network (FFN) that provides targeted guidance for refining these boundaries. Our model demonstrated exceptional performance, achieving a mean average precision (mAP) of 47.1% on the water surface object dataset, which represents a 1.7% increase over the baseline YOLOv8 model. The dynamic sample assignment strategy contributes a 1.0% improvement on average precision at the intersection over union (IoU) threshold of 0.5 (AP), while the FFN strategy fine-tunes the bottom boundaries and achieves an additional 0.8% improvement in average precision at IoU threshold of 0.75 (AP). Furthermore, ablation studies have validated the versatility of our approach, confirming its potential for integration into various detection frameworks.

摘要

水面目标检测是无人水面艇（USV）感知系统的一项关键技术。本文提出了一种新颖的实时目标检测系统，旨在应对模糊的底部边界和模糊图像带来的挑战。我们的方法通过整合卷积块注意力模块（CBAM）和自注意力机制来增强YOLOv8s模型，并在不同的集成点考察它们的影响。引入了一种动态样本分配策略来提高模型的精度并加速其收敛。为了应对清晰描绘底部边界的挑战，我们的模型采用了双策略方法：一个阈值滤波器和一个前馈神经网络（FFN），该网络为细化这些边界提供有针对性的指导。我们的模型表现出色，在水面目标数据集上实现了47.1%的平均精度均值（mAP），比基线YOLOv8模型提高了1.7%。动态样本分配策略在交并比（IoU）阈值为0.5时的平均精度（AP）上平均提高了1.0%，而FFN策略对底部边界进行了微调，在IoU阈值为0.75时的平均精度（AP）上又提高了0.8%。此外，消融研究验证了我们方法的通用性，证实了其集成到各种检测框架中的潜力。