用于水面目标检测的基于图像的基准数据集及新型目标检测器

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection.

作者信息

Zhou Zhiguo, Sun Jiaen, Yu Jiabao, Liu Kaiyuan, Duan Junwei, Chen Long, Chen C L Philip

机构信息

School of Information and Electronics, Beijing Institute of Technology, Beijing, China.

College of Information Science and Technology, Jinan University, Guangzhou, China.

出版信息

Front Neurorobot. 2021 Sep 24;15:723336. doi: 10.3389/fnbot.2021.723336. eCollection 2021.

DOI:10.3389/fnbot.2021.723336

PMID:34630064

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8497741/

Abstract

Water surface object detection is one of the most significant tasks in autonomous driving and water surface vision applications. To date, existing public large-scale datasets collected from websites do not focus on specific scenarios. As a characteristic of these datasets, the quantity of the images and instances is also still at a low level. To accelerate the development of water surface autonomous driving, this paper proposes a large-scale, high-quality annotated benchmark dataset, named Water Surface Object Detection Dataset (WSODD), to benchmark different water surface object detection algorithms. The proposed dataset consists of 7,467 water surface images in different water environments, climate conditions, and shooting times. In addition, the dataset comprises a total of 14 common object categories and 21,911 instances. Simultaneously, more specific scenarios are focused on in WSODD. In order to find a straightforward architecture to provide good performance on WSODD, a new object detector, named CRB-Net, is proposed to serve as a baseline. In experiments, CRB-Net was compared with 16 state-of-the-art object detection methods and outperformed all of them in terms of detection precision. In this paper, we further discuss the effect of the dataset diversity (e.g., instance size, lighting conditions), training set size, and dataset details (e.g., method of categorization). Cross-dataset validation shows that WSODD significantly outperforms other relevant datasets and that the adaptability of CRB-Net is excellent.

摘要

水面目标检测是自动驾驶和水面视觉应用中最重要的任务之一。迄今为止，从网站收集的现有公共大规模数据集并未专注于特定场景。作为这些数据集的一个特点，图像和实例的数量仍处于较低水平。为了加速水面自动驾驶的发展，本文提出了一个大规模、高质量标注的基准数据集，名为水面目标检测数据集（WSODD），用于对不同的水面目标检测算法进行基准测试。所提出的数据集由7467张处于不同水环境、气候条件和拍摄时间的水面图像组成。此外，该数据集总共包含14个常见目标类别和21911个实例。同时，WSODD专注于更具体的场景。为了找到一个能在WSODD上提供良好性能的简单架构，提出了一种名为CRB-Net的新目标检测器作为基线。在实验中，CRB-Net与16种先进的目标检测方法进行了比较，在检测精度方面优于所有这些方法。在本文中，我们进一步讨论了数据集多样性（如图例大小、光照条件）、训练集大小和数据集细节（如分类方法）的影响。跨数据集验证表明，WSODD明显优于其他相关数据集，并且CRB-Net的适应性非常出色。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/efc616043f45/fnbot-15-723336-g0001.jpg

相似文献

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection.

Front Neurorobot. 2021 Sep 24;15:723336. doi: 10.3389/fnbot.2021.723336. eCollection 2021.

AIE-YOLO: Effective object detection method in extreme driving scenarios via adaptive image enhancement.

Sci Prog. 2024 Jul-Sep;107(3):368504241263165. doi: 10.1177/00368504241263165.

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search.

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):456-473. doi: 10.1109/TPAMI.2020.3009758. Epub 2021 Dec 7.

3D Object Detection with SLS-Fusion Network in Foggy Weather Conditions.

Sensors (Basel). 2021 Oct 9;21(20):6711. doi: 10.3390/s21206711.

TJU-DHD: A Diverse High-Resolution Dataset for Object Detection.

IEEE Trans Image Process. 2021;30:207-219. doi: 10.1109/TIP.2020.3034487. Epub 2020 Nov 18.

NSTU-BDTAKA: An open dataset for Bangladeshi paper currency detection and recognition.

Data Brief. 2024 Jul 3;55:110701. doi: 10.1016/j.dib.2024.110701. eCollection 2024 Aug.

Road and Railway Smart Mobility: A High-Definition Ground Truth Hybrid Dataset.

Sensors (Basel). 2022 May 22;22(10):3922. doi: 10.3390/s22103922.

EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes.

IEEE Trans Pattern Anal Mach Intell. 2019 Feb 5. doi: 10.1109/TPAMI.2019.2897684.

Learning to Holistically Detect Bridges From Large-Size VHR Remote Sensing Imagery.

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11507-11523. doi: 10.1109/TPAMI.2024.3393024. Epub 2024 Nov 6.

IDOD-YOLOV7: Image-Dehazing YOLOV7 for Object Detection in Low-Light Foggy Traffic Environments.

Sensors (Basel). 2023 Jan 25;23(3):1347. doi: 10.3390/s23031347.

引用本文的文献

YOLO-HPSD: A high-precision ship target detection model based on YOLOv10.

PLoS One. 2025 May 12;20(5):e0321863. doi: 10.1371/journal.pone.0321863. eCollection 2025.

An annotated Dataset and Benchmark for Detecting Floating Debris in Inland Waters.

Sci Data. 2025 Mar 5;12(1):385. doi: 10.1038/s41597-025-04594-9.

Improved YOLOv8 Algorithm for Water Surface Object Detection.

Sensors (Basel). 2024 Aug 5;24(15):5059. doi: 10.3390/s24155059.

Enhanced YOLOv7 integrated with small target enhancement for rapid detection of objects on water surfaces.

Front Neurorobot. 2023 Dec 14;17:1315251. doi: 10.3389/fnbot.2023.1315251. eCollection 2023.

本文引用的文献

Lung Cancer Segmentation With Transfer Learning: Usefulness of a Pretrained Model Constructed From an Artificial Dataset Generated Using a Generative Adversarial Network.

Front Artif Intell. 2021 Jul 16;4:694815. doi: 10.3389/frai.2021.694815. eCollection 2021.

Real-Time Water Surface Object Detection Based on Improved Faster R-CNN.

Sensors (Basel). 2019 Aug 12;19(16):3523. doi: 10.3390/s19163523.

Muscle Synergy Analysis of a Hand-Grasp Dataset: A Limited Subset of Motor Modules May Underlie a Large Variety of Grasps.

Front Neurorobot. 2018 Sep 25;12:57. doi: 10.3389/fnbot.2018.00057. eCollection 2018.

Focal Loss for Dense Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.

A Benchmark Dataset and Saliency-Guided Stacked Autoencoders for Video-Based Salient Object Detection.

IEEE Trans Image Process. 2018 Jan;27(1):349-364. doi: 10.1109/TIP.2017.2762594. Epub 2017 Oct 12.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.

Fast Image-Based Obstacle Detection From Unmanned Surface Vehicles.

IEEE Trans Cybern. 2016 Mar;46(3):641-54. doi: 10.1109/TCYB.2015.2412251. Epub 2015 Mar 31.

Object detection with discriminatively trained part-based models.

IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于水面目标检测的基于图像的基准数据集及新型目标检测器

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献