Suppr超能文献

基于多目标强化学习的无人水面艇路径规划算法。

Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning.

机构信息

MOE Key Laboratory of Image Information Processing and Intelligent Control, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China.

Wuhan Second Ship Design and Research Institute, Wuhan 430205, China.

出版信息

Comput Intell Neurosci. 2023 Feb 15;2023:2146314. doi: 10.1155/2023/2146314. eCollection 2023.

Abstract

It is challenging to perform path planning tasks in complex marine environments as the unmanned surface vessel approaches the goal while avoiding obstacles. However, the conflict between the two subtarget tasks of obstacle avoidance and goal approaching makes the path planning difficult. Thus, a path planning method for unmanned surface vessel based on multiobjective reinforcement learning is proposed under the complex environment with high randomness and multiple dynamic obstacles. Firstly, the path planning scene is set as the main scene, and the two subtarget scenes including obstacle avoidance and goal approaching are divided from it. The action selection strategy in each subtarget scene is trained through the double deep -network with prioritized experience replay. A multiobjective reinforcement learning framework based on ensemble learning is further designed for policy integration in the main scene. Finally, by selecting the strategy from subtarget scenes in the designed framework, an optimized action selection strategy is trained and used for the action decision of the agent in the main scene. Compared with traditional value-based reinforcement learning methods, the proposed method achieves a 93% success rate in path planning in simulation scenes. Furthermore, the average length of the paths planned by the proposed method is 3.28% and 1.97% shorter than that of PER-DDQN and dueling DQN, respectively.

摘要

在无人水面舰艇接近目标的同时避免障碍物的情况下,在复杂的海洋环境中执行路径规划任务具有挑战性。然而,避障和趋近目标这两个子目标任务之间的冲突使得路径规划变得困难。因此,提出了一种基于多目标强化学习的无人水面舰艇路径规划方法,用于解决具有高度随机性和多个动态障碍物的复杂环境问题。首先,将路径规划场景设置为主场景,并从其中划分出包括避障和趋近目标在内的两个子目标场景。通过具有优先级经验回放的双深度网络,在每个子目标场景中训练动作选择策略。进一步设计了一个基于集成学习的多目标强化学习框架,用于在主场景中进行策略整合。最后,通过从设计的框架中的子目标场景中选择策略,训练并使用优化的动作选择策略来进行主场景中代理的动作决策。与传统的基于值的强化学习方法相比,该方法在模拟场景中的路径规划成功率达到 93%。此外,与 PER-DDQN 和 duel-ing DQN 相比,所提出方法规划的路径的平均长度分别缩短了 3.28%和 1.97%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06d5/9946747/2325399cf96a/CIN2023-2146314.001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验