• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机制动的深度强化学习的无人水面航行器编队形成与路径跟踪控制

USV Formation and Path-Following Control via Deep Reinforcement Learning With Random Braking.

作者信息

Zhao Yujiao, Ma Yong, Hu Songlin

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5468-5478. doi: 10.1109/TNNLS.2021.3068762. Epub 2021 Nov 30.

DOI:10.1109/TNNLS.2021.3068762
PMID:33793404
Abstract

This article addresses the problem of path following for underactuated unmanned surface vessels (USVs) formation via a modified deep reinforcement learning with random braking (DRLRB). A formation control model based on deep reinforcement learning (DRL) is constructed to urge USVs to form a preset formation. Specifically, an efficient reward function is designed from the perspective of velocity and error distance of each USV related to the given formation, and then a novel random braking mechanism is formulated to prevent the training of the decision-making network from falling into the local optimum and failing to achieve the training objectives. Following that, a virtual leader-based path-following guidance system is developed for the USV formation problem. Wherein, with the aid of DRLRB, our proposed system can adjust formation automatically and flexibly even when some USVs deviate from the formation. Simulation verifies the effectiveness and superiority of our formation and path-following control strategy.

摘要

本文通过一种改进的带随机制动的深度强化学习(DRLRB)来解决欠驱动无人水面舰艇(USV)编队的路径跟踪问题。构建了基于深度强化学习(DRL)的编队控制模型,以促使无人水面舰艇形成预设编队。具体而言,从每个无人水面舰艇相对于给定编队的速度和误差距离的角度设计了一种有效的奖励函数,然后制定了一种新颖的随机制动机制,以防止决策网络的训练陷入局部最优并无法实现训练目标。在此基础上,针对无人水面舰艇编队问题开发了一种基于虚拟领航者的路径跟踪制导系统。其中,借助DRLRB,我们提出的系统即使在一些无人水面舰艇偏离编队时也能自动灵活地调整编队。仿真验证了我们的编队和路径跟踪控制策略的有效性和优越性。

相似文献

1
USV Formation and Path-Following Control via Deep Reinforcement Learning With Random Braking.基于随机制动的深度强化学习的无人水面航行器编队形成与路径跟踪控制
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5468-5478. doi: 10.1109/TNNLS.2021.3068762. Epub 2021 Nov 30.
2
A Novel Reinforcement Learning Collision Avoidance Algorithm for USVs Based on Maneuvering Characteristics and COLREGs.基于机动特性和 COLREGs 的 USV 新型强化学习避碰算法。
Sensors (Basel). 2022 Mar 8;22(6):2099. doi: 10.3390/s22062099.
3
Constrained control using novel nonlinear mapping for underactuated unmanned surface vehicles with unknown sideslip angle.基于新型非线性映射的欠驱动无人水面艇未知侧滑角约束控制
ISA Trans. 2023 Oct;141:261-275. doi: 10.1016/j.isatra.2023.06.034. Epub 2023 Jul 4.
4
Distributed fixed-time formation tracking control for multiple underactuated USVs with lumped uncertainties and input saturation.具有集中不确定性和输入饱和的多艘欠驱动无人水面艇的分布式固定时间编队跟踪控制
ISA Trans. 2024 Nov;154:186-198. doi: 10.1016/j.isatra.2024.08.022. Epub 2024 Aug 27.
5
A finite-time path following scheme of unmanned surface vessels with an optimization strategy.一种具有优化策略的无人水面舰艇有限时间路径跟踪方案。
ISA Trans. 2024 Mar;146:61-74. doi: 10.1016/j.isatra.2024.01.016. Epub 2024 Jan 20.
6
Multi-under-Actuated Unmanned Surface Vessel Coordinated Path Tracking.多欠驱动无人水面艇协同路径跟踪
Sensors (Basel). 2020 Feb 6;20(3):864. doi: 10.3390/s20030864.
7
Underactuated USV path following mechanism based on the cascade method.基于级联方法的欠驱动 USV 路径跟踪机制。
Sci Rep. 2022 Jan 27;12(1):1461. doi: 10.1038/s41598-022-05456-9.
8
Dynamic Obstacle Avoidance for USVs Using Cross-Domain Deep Reinforcement Learning and Neural Network Model Predictive Controller.基于跨域深度强化学习和神经网络模型预测控制器的 USV 动态避障
Sensors (Basel). 2023 Mar 29;23(7):3572. doi: 10.3390/s23073572.
9
Adaptive Optimal Surrounding Control of Multiple Unmanned Surface Vessels via Actor-Critic Reinforcement Learning.基于智能体-评论家强化学习的多艘无人水面舰艇自适应最优周边控制
IEEE Trans Neural Netw Learn Syst. 2024 Oct 18;PP. doi: 10.1109/TNNLS.2024.3474289.
10
Line-of-sight-based global finite-time stable path following control of unmanned surface vehicles with actuator saturation.考虑执行器饱和的无人水面舰艇基于视线的全局有限时间稳定路径跟踪控制
ISA Trans. 2022 Jun;125:306-317. doi: 10.1016/j.isatra.2021.07.009. Epub 2021 Jul 8.

引用本文的文献

1
Event-triggered neuroadaptive predefined practical finite-time control for dynamic positioning vessels: A time-based generator approach.基于事件触发的神经自适应预定义实际有限时间控制在动力定位船舶中的应用:一种基于时间的生成器方法
Fundam Res. 2022 Oct 6;4(5):1254-1265. doi: 10.1016/j.fmre.2022.09.013. eCollection 2024 Sep.
2
IoT-Based Reinforcement Learning Using Probabilistic Model for Determining Extensive Exploration through Computational Intelligence for Next-Generation Techniques.基于物联网的强化学习使用概率模型通过计算智能确定广泛探索,以用于下一代技术。
Comput Intell Neurosci. 2023 Oct 10;2023:5113417. doi: 10.1155/2023/5113417. eCollection 2023.
3
Delay-Informed Intelligent Formation Control for UAV-Assisted IoT Application.
基于延迟感知的无人机辅助物联网应用智能编队控制。
Sensors (Basel). 2023 Jul 6;23(13):6190. doi: 10.3390/s23136190.