用于多波束前视声呐水下目标检测的先进深度学习框架。

Advanced deep learning framework for underwater object detection with multibeam forward-looking sonar.

作者信息

Ge Liangfu, Singh Premjeet, Sadhu Ayan

机构信息

Department of Civil and Environmental Engineering, The Western Academy for Advanced Research, Western University, London, ON, Canada.

Department of Civil and Environmental Engineering, Western University, London, ON, Canada.

出版信息

Struct Health Monit. 2024 Mar 24;24(4):1991-2007. doi: 10.1177/14759217241235637. eCollection 2025 Jul.

DOI:10.1177/14759217241235637

PMID:40621572

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12227173/

Abstract

Underwater object detection (UOD) is an essential activity in maintaining and monitoring underwater infrastructure, playing an important role in their efficient and low-risk asset management. In underwater environments, sonar, recognized for overcoming the limitations of optical imaging in low-light and turbid conditions, has increasingly gained popularity for UOD. However, due to the low resolution and limited foreground-background contrast in sonar images, existing sonar-based object detection algorithms still face challenges regarding precision and transferability. To solve these challenges, this article proposes an advanced deep learning framework for UOD that uses the data from multibeam forward-looking sonar. The framework is adapted from the network architecture of YOLOv7, one of the state-of-the-art vision-based object detection algorithms, by incorporating unique optimizations in three key aspects: data preprocessing, feature fusion, and loss functions. These improvements are extensively tested on a dedicated public dataset, showing superior object classification performance compared to the selected existing sonar-based methods. Through experiments conducted on an underwater remotely operated vehicle, the proposed framework validates significant enhancements in target classification, localization, and transfer learning capabilities. Since the engineering structures have similar geometric shapes to the objects tested in this study, the proposed framework presents potential applicability to underwater structural inspection and monitoring, and autonomous asset management.

摘要

水下目标检测（UOD）是维护和监测水下基础设施的一项重要活动，在其高效且低风险的资产管理中发挥着重要作用。在水下环境中，声纳因克服了光学成像在低光照和浑浊条件下的局限性而受到认可，在水下目标检测中越来越受欢迎。然而，由于声纳图像分辨率低且前景 - 背景对比度有限，现有的基于声纳的目标检测算法在精度和可迁移性方面仍面临挑战。为了解决这些挑战，本文提出了一种用于水下目标检测的先进深度学习框架，该框架使用来自多波束前视声纳的数据。该框架是基于最先进的基于视觉的目标检测算法之一YOLOv7的网络架构改编而来，在数据预处理、特征融合和损失函数三个关键方面进行了独特的优化。这些改进在一个专用公共数据集上进行了广泛测试，与所选的现有基于声纳的方法相比，显示出卓越的目标分类性能。通过在水下遥控潜水器上进行的实验，所提出的框架验证了在目标分类、定位和迁移学习能力方面的显著提升。由于工程结构与本研究中测试的物体具有相似的几何形状，所提出的框架在水下结构检查和监测以及自主资产管理方面具有潜在的适用性。