• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于水面目标检测的基于图像的基准数据集及新型目标检测器

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection.

作者信息

Zhou Zhiguo, Sun Jiaen, Yu Jiabao, Liu Kaiyuan, Duan Junwei, Chen Long, Chen C L Philip

机构信息

School of Information and Electronics, Beijing Institute of Technology, Beijing, China.

College of Information Science and Technology, Jinan University, Guangzhou, China.

出版信息

Front Neurorobot. 2021 Sep 24;15:723336. doi: 10.3389/fnbot.2021.723336. eCollection 2021.

DOI:10.3389/fnbot.2021.723336
PMID:34630064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8497741/
Abstract

Water surface object detection is one of the most significant tasks in autonomous driving and water surface vision applications. To date, existing public large-scale datasets collected from websites do not focus on specific scenarios. As a characteristic of these datasets, the quantity of the images and instances is also still at a low level. To accelerate the development of water surface autonomous driving, this paper proposes a large-scale, high-quality annotated benchmark dataset, named Water Surface Object Detection Dataset (WSODD), to benchmark different water surface object detection algorithms. The proposed dataset consists of 7,467 water surface images in different water environments, climate conditions, and shooting times. In addition, the dataset comprises a total of 14 common object categories and 21,911 instances. Simultaneously, more specific scenarios are focused on in WSODD. In order to find a straightforward architecture to provide good performance on WSODD, a new object detector, named CRB-Net, is proposed to serve as a baseline. In experiments, CRB-Net was compared with 16 state-of-the-art object detection methods and outperformed all of them in terms of detection precision. In this paper, we further discuss the effect of the dataset diversity (e.g., instance size, lighting conditions), training set size, and dataset details (e.g., method of categorization). Cross-dataset validation shows that WSODD significantly outperforms other relevant datasets and that the adaptability of CRB-Net is excellent.

摘要

水面目标检测是自动驾驶和水面视觉应用中最重要的任务之一。迄今为止,从网站收集的现有公共大规模数据集并未专注于特定场景。作为这些数据集的一个特点,图像和实例的数量仍处于较低水平。为了加速水面自动驾驶的发展,本文提出了一个大规模、高质量标注的基准数据集,名为水面目标检测数据集(WSODD),用于对不同的水面目标检测算法进行基准测试。所提出的数据集由7467张处于不同水环境、气候条件和拍摄时间的水面图像组成。此外,该数据集总共包含14个常见目标类别和21911个实例。同时,WSODD专注于更具体的场景。为了找到一个能在WSODD上提供良好性能的简单架构,提出了一种名为CRB-Net的新目标检测器作为基线。在实验中,CRB-Net与16种先进的目标检测方法进行了比较,在检测精度方面优于所有这些方法。在本文中,我们进一步讨论了数据集多样性(如图例大小、光照条件)、训练集大小和数据集细节(如分类方法)的影响。跨数据集验证表明,WSODD明显优于其他相关数据集,并且CRB-Net的适应性非常出色。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/8dcb1a8a6074/fnbot-15-723336-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/efc616043f45/fnbot-15-723336-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/2b0b3cd9a3b3/fnbot-15-723336-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/5b4f2c069c81/fnbot-15-723336-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/842bedd55389/fnbot-15-723336-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/7879c99dd620/fnbot-15-723336-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/b9b4bc00c816/fnbot-15-723336-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/7c9a539190a9/fnbot-15-723336-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/ae5c28ce6014/fnbot-15-723336-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/8dcb1a8a6074/fnbot-15-723336-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/efc616043f45/fnbot-15-723336-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/2b0b3cd9a3b3/fnbot-15-723336-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/5b4f2c069c81/fnbot-15-723336-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/842bedd55389/fnbot-15-723336-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/7879c99dd620/fnbot-15-723336-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/b9b4bc00c816/fnbot-15-723336-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/7c9a539190a9/fnbot-15-723336-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/ae5c28ce6014/fnbot-15-723336-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/507d/8497741/8dcb1a8a6074/fnbot-15-723336-g0009.jpg

相似文献

1
An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection.用于水面目标检测的基于图像的基准数据集及新型目标检测器
Front Neurorobot. 2021 Sep 24;15:723336. doi: 10.3389/fnbot.2021.723336. eCollection 2021.
2
AIE-YOLO: Effective object detection method in extreme driving scenarios via adaptive image enhancement.AIE-YOLO:通过自适应图像增强实现的极端驾驶场景下的有效目标检测方法
Sci Prog. 2024 Jul-Sep;107(3):368504241263165. doi: 10.1177/00368504241263165.
3
Tasks Integrated Networks: Joint Detection and Retrieval for Image Search.任务集成网络:图像搜索的联合检测与检索。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):456-473. doi: 10.1109/TPAMI.2020.3009758. Epub 2021 Dec 7.
4
3D Object Detection with SLS-Fusion Network in Foggy Weather Conditions.雾天条件下基于SLS融合网络的3D目标检测
Sensors (Basel). 2021 Oct 9;21(20):6711. doi: 10.3390/s21206711.
5
TJU-DHD: A Diverse High-Resolution Dataset for Object Detection.TJU-DHD:一个用于目标检测的多样化高分辨率数据集。
IEEE Trans Image Process. 2021;30:207-219. doi: 10.1109/TIP.2020.3034487. Epub 2020 Nov 18.
6
NSTU-BDTAKA: An open dataset for Bangladeshi paper currency detection and recognition.NSTU - BDTAKA:一个用于孟加拉国纸币检测与识别的开放数据集。
Data Brief. 2024 Jul 3;55:110701. doi: 10.1016/j.dib.2024.110701. eCollection 2024 Aug.
7
Road and Railway Smart Mobility: A High-Definition Ground Truth Hybrid Dataset.道路与铁路智能交通:高清地面实况混合数据集。
Sensors (Basel). 2022 May 22;22(10):3922. doi: 10.3390/s22103922.
8
EuroCity Persons: A Novel Benchmark for Person Detection in Traffic Scenes.欧洲城市行人:交通场景中行人检测的一种新型基准
IEEE Trans Pattern Anal Mach Intell. 2019 Feb 5. doi: 10.1109/TPAMI.2019.2897684.
9
Learning to Holistically Detect Bridges From Large-Size VHR Remote Sensing Imagery.从大尺寸高分辨率遥感影像中整体检测桥梁的方法研究
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11507-11523. doi: 10.1109/TPAMI.2024.3393024. Epub 2024 Nov 6.
10
IDOD-YOLOV7: Image-Dehazing YOLOV7 for Object Detection in Low-Light Foggy Traffic Environments.IDOD-YOLOV7:用于低光照雾天交通环境中目标检测的图像去雾 YOLOV7。
Sensors (Basel). 2023 Jan 25;23(3):1347. doi: 10.3390/s23031347.

引用本文的文献

1
YOLO-HPSD: A high-precision ship target detection model based on YOLOv10.YOLO-HPSD:一种基于YOLOv10的高精度船舶目标检测模型。
PLoS One. 2025 May 12;20(5):e0321863. doi: 10.1371/journal.pone.0321863. eCollection 2025.
2
An annotated Dataset and Benchmark for Detecting Floating Debris in Inland Waters.一个用于检测内陆水域漂浮垃圾的带注释数据集和基准。
Sci Data. 2025 Mar 5;12(1):385. doi: 10.1038/s41597-025-04594-9.
3
Improved YOLOv8 Algorithm for Water Surface Object Detection.用于水面目标检测的改进YOLOv8算法

本文引用的文献

1
Lung Cancer Segmentation With Transfer Learning: Usefulness of a Pretrained Model Constructed From an Artificial Dataset Generated Using a Generative Adversarial Network.基于迁移学习的肺癌分割:利用生成对抗网络生成的人工数据集构建的预训练模型的效用
Front Artif Intell. 2021 Jul 16;4:694815. doi: 10.3389/frai.2021.694815. eCollection 2021.
2
Real-Time Water Surface Object Detection Based on Improved Faster R-CNN.基于改进型更快区域卷积神经网络的实时水面目标检测
Sensors (Basel). 2019 Aug 12;19(16):3523. doi: 10.3390/s19163523.
3
Muscle Synergy Analysis of a Hand-Grasp Dataset: A Limited Subset of Motor Modules May Underlie a Large Variety of Grasps.
Sensors (Basel). 2024 Aug 5;24(15):5059. doi: 10.3390/s24155059.
4
Enhanced YOLOv7 integrated with small target enhancement for rapid detection of objects on water surfaces.集成小目标增强功能的增强型YOLOv7,用于快速检测水面上的物体。
Front Neurorobot. 2023 Dec 14;17:1315251. doi: 10.3389/fnbot.2023.1315251. eCollection 2023.
手部抓握数据集的肌肉协同分析:少量运动模块子集可能是多种抓握动作的基础。
Front Neurorobot. 2018 Sep 25;12:57. doi: 10.3389/fnbot.2018.00057. eCollection 2018.
4
Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.
5
A Benchmark Dataset and Saliency-Guided Stacked Autoencoders for Video-Based Salient Object Detection.基于视频的显著目标检测的基准数据集和显著引导堆叠自动编码器。
IEEE Trans Image Process. 2018 Jan;27(1):349-364. doi: 10.1109/TIP.2017.2762594. Epub 2017 Oct 12.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
7
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
8
Fast Image-Based Obstacle Detection From Unmanned Surface Vehicles.基于快速图像的无人水面艇障碍物检测。
IEEE Trans Cybern. 2016 Mar;46(3):641-54. doi: 10.1109/TCYB.2015.2412251. Epub 2015 Mar 31.
9
Object detection with discriminatively trained part-based models.基于判别式训练的部件模型的目标检测。
IEEE Trans Pattern Anal Mach Intell. 2010 Sep;32(9):1627-45. doi: 10.1109/TPAMI.2009.167.