• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于无人机的海事合成孔径雷达中数据驱动的锚框优化来增强两阶段目标检测模型。

Enhancing two-stage object detection models via data-driven anchor box optimization in UAV-based maritime SAR.

作者信息

Zhao Beigeng, Song Rui

机构信息

College of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang, China.

出版信息

Sci Rep. 2024 Feb 27;14(1):4765. doi: 10.1038/s41598-024-55570-z.

DOI:10.1038/s41598-024-55570-z
PMID:38413792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10899653/
Abstract

The high-altitude imaging capabilities of Unmanned Aerial Vehicles (UAVs) offer an effective solution for maritime Search and Rescue (SAR) operations. In such missions, the accurate identification of boats, personnel, and objects within images is crucial. While object detection models trained on general image datasets can be directly applied to these tasks, their effectiveness is limited due to the unique challenges posed by the specific characteristics of maritime SAR scenarios. Addressing this challenge, our study leverages the large-scale benchmark dataset SeaDronesSee, specific to UAV-based maritime SAR, to analyze and explore the unique attributes of image data in this scenario. We identify the need for optimization in detecting specific categories of difficult-to-detect objects within this context. Building on this, an anchor box optimization strategy is proposed based on clustering analysis, aimed at enhancing the performance of the renowned two-stage object detection models in this specialized task. Experiments were conducted to validate the proposed anchor box optimization method and to explore the underlying reasons for its effectiveness. The experimental results show our optimization method achieved a 45.8% and a 10% increase in average precision over the default anchor box configurations of torchvision and the SeaDronesSee official sample code configuration respectively. This enhancement was particularly evident in the model's significantly improved ability to detect swimmers, floaters, and life jackets on boats within the SeaDronesSee dataset's SAR scenarios. The methods and findings of this study are anticipated to provide the UAV-based maritime SAR research community with valuable insights into data characteristics and model optimization, offering a meaningful reference for future research.

摘要

无人机(UAV)的高空成像能力为海上搜索与救援(SAR)行动提供了一种有效的解决方案。在这类任务中,准确识别图像中的船只、人员和物体至关重要。虽然在通用图像数据集上训练的目标检测模型可以直接应用于这些任务,但由于海上SAR场景的特定特征带来的独特挑战,其有效性受到限制。为应对这一挑战,我们的研究利用了专门针对基于无人机的海上SAR的大规模基准数据集SeaDronesSee,来分析和探索该场景下图像数据的独特属性。我们确定了在此背景下检测特定类别难以检测物体时进行优化的必要性。在此基础上,提出了一种基于聚类分析的锚框优化策略,旨在提高著名的两阶段目标检测模型在这项专门任务中的性能。进行了实验以验证所提出的锚框优化方法,并探索其有效性的潜在原因。实验结果表明,我们的优化方法分别比torchvision的默认锚框配置和SeaDronesSee官方示例代码配置的平均精度提高了45.8%和10%。这种提升在模型检测SeaDronesSee数据集中SAR场景下船只上的游泳者、漂浮物和救生衣的能力显著提高方面尤为明显。预计本研究的方法和结果将为基于无人机的海上SAR研究社区提供有关数据特征和模型优化的宝贵见解,为未来研究提供有意义的参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/5d4c0e8dce91/41598_2024_55570_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/0e2cc3d07e41/41598_2024_55570_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/66779c6f47bf/41598_2024_55570_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/2dec9dd7a8ae/41598_2024_55570_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/96815745aa8d/41598_2024_55570_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/4aa35465a242/41598_2024_55570_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/5d4c0e8dce91/41598_2024_55570_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/0e2cc3d07e41/41598_2024_55570_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/66779c6f47bf/41598_2024_55570_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/2dec9dd7a8ae/41598_2024_55570_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/96815745aa8d/41598_2024_55570_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/4aa35465a242/41598_2024_55570_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99d3/10899653/5d4c0e8dce91/41598_2024_55570_Fig5_HTML.jpg

相似文献

1
Enhancing two-stage object detection models via data-driven anchor box optimization in UAV-based maritime SAR.通过基于无人机的海事合成孔径雷达中数据驱动的锚框优化来增强两阶段目标检测模型。
Sci Rep. 2024 Feb 27;14(1):4765. doi: 10.1038/s41598-024-55570-z.
2
Heuristic data-driven anchor generation for UAV-based maritime rescue image object detection.基于无人机的海上救援图像目标检测的启发式数据驱动锚点生成
Heliyon. 2024 May 10;10(10):e30485. doi: 10.1016/j.heliyon.2024.e30485. eCollection 2024 May 30.
3
Modular YOLOv8 optimization for real-time UAV maritime rescue object detection.用于实时无人机海上救援目标检测的模块化YOLOv8优化
Sci Rep. 2024 Oct 18;14(1):24492. doi: 10.1038/s41598-024-75807-1.
4
HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detection.HIT-UAV:基于无人机的目标检测用高空红外热数据集。
Sci Data. 2023 Apr 20;10(1):227. doi: 10.1038/s41597-023-02066-6.
5
UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios.无人机 - YOLOv8:一种基于改进YOLOv8的用于无人机航拍场景的小目标检测模型。
Sensors (Basel). 2023 Aug 15;23(16):7190. doi: 10.3390/s23167190.
6
Joint Resource Management and Trajectory Optimization for UAV-Enabled Maritime Network.无人机支持的海上网络的联合资源管理和轨迹优化。
Sensors (Basel). 2022 Dec 13;22(24):9763. doi: 10.3390/s22249763.
7
A detection method of the rescue targets in the marine casualty based on improved YOLOv5s.一种基于改进YOLOv5s的海上事故救援目标检测方法。
Front Neurorobot. 2022 Nov 16;16:1053124. doi: 10.3389/fnbot.2022.1053124. eCollection 2022.
8
Influence of Insufficient Dataset Augmentation on IoU and Detection Threshold in CNN Training for Object Detection on Aerial Images.在用于航空图像目标检测的 CNN 训练中,数据集不足对 IoU 和检测阈值的影响。
Sensors (Basel). 2022 Nov 23;22(23):9080. doi: 10.3390/s22239080.
9
Unsupervised Human Detection with an Embedded Vision System on a Fully Autonomous UAV for Search and Rescue Operations.基于嵌入式视觉系统的全自主无人机无监督人体检测在搜索救援行动中的应用。
Sensors (Basel). 2019 Aug 14;19(16):3542. doi: 10.3390/s19163542.
10
Visual image design of the internet of things based on AI intelligence.基于人工智能的物联网视觉图像设计
Heliyon. 2023 Nov 25;9(12):e22845. doi: 10.1016/j.heliyon.2023.e22845. eCollection 2023 Dec.

引用本文的文献

1
Enhanced YOLO11 for lightweight and accurate drone-based maritime search and rescue object detection.用于基于无人机的轻量级且精确的海上搜索与救援目标检测的增强型YOLO11
PLoS One. 2025 Jul 1;20(7):e0321920. doi: 10.1371/journal.pone.0321920. eCollection 2025.
2
Optimizing RetinaNet anchors using differential evolution for improved object detection.使用差分进化优化RetinaNet锚框以改进目标检测
Sci Rep. 2025 Jun 20;15(1):20101. doi: 10.1038/s41598-025-02888-x.
3
An unpaired SAR-to-optical image translation method based on Schrödinger bridge network and multi-scale feature fusion.

本文引用的文献

1
UAV remote sensing applications in marine monitoring: Knowledge visualization and review.UAV 遥感在海洋监测中的应用:知识可视化与综述。
Sci Total Environ. 2022 Sep 10;838(Pt 1):155939. doi: 10.1016/j.scitotenv.2022.155939. Epub 2022 May 13.
2
Deep learning techniques to classify agricultural crops through UAV imagery: a review.通过无人机图像对农作物进行分类的深度学习技术综述
Neural Comput Appl. 2022;34(12):9511-9536. doi: 10.1007/s00521-022-07104-9. Epub 2022 Mar 5.
3
Toward Integrated Large-Scale Environmental Monitoring Using WSN/UAV/Crowdsensing: A Review of Applications, Signal Processing, and Future Perspectives.
一种基于薛定谔桥网络和多尺度特征融合的非配对合成孔径雷达(SAR)到光学图像翻译方法。
Sci Rep. 2024 Nov 7;14(1):27047. doi: 10.1038/s41598-024-75762-x.
利用 WSN/UAV/Crowdsensing 实现综合大规模环境监测:应用、信号处理及未来展望综述。
Sensors (Basel). 2022 Feb 25;22(5):1824. doi: 10.3390/s22051824.
4
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges.航空图像中的目标检测:大规模基准测试与挑战
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7778-7796. doi: 10.1109/TPAMI.2021.3117983. Epub 2022 Oct 4.
5
Object Detection With Deep Learning: A Review.基于深度学习的目标检测研究综述。
IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.
6
TextBoxes++: A Single-Shot Oriented Scene Text Detector.TextBoxes++:一种单阶段的面向场景的文本检测器。
IEEE Trans Image Process. 2018 Aug;27(8):3676-3690. doi: 10.1109/TIP.2018.2825107. Epub 2018 Apr 9.
7
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.