• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于拓扑地图的多SARSA分层路径规划方法。

A Hierarchical Path Planning Approach with Multi-SARSA Based on Topological Map.

作者信息

Wen Shiguang, Jiang Yufan, Cui Ben, Gao Ke, Wang Fei

机构信息

Faculty of Robot Science and Engineering, Northeastern University, Shenyang 110169, China.

College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.

出版信息

Sensors (Basel). 2022 Mar 18;22(6):2367. doi: 10.3390/s22062367.

DOI:10.3390/s22062367
PMID:35336535
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8954451/
Abstract

In this paper, a novel path planning algorithm with Reinforcement Learning is proposed based on the topological map. The proposed algorithm has a two-level structure. At the first level, the proposed method generates the topological area using the region dynamic growth algorithm based on the grid map. In the next level, the Multi-SARSA method divided into two layers is applied to find a near-optimal global planning path, in which the artificial potential field method, first of all, is used to initialize the first Q table for faster learning speed, and then the second Q table is initialized with the connected domain obtained by topological map, which provides the prior information. A combination of the two algorithms makes the algorithm easier to converge. Simulation experiments for path planning have been executed. The results indicate that the method proposed in this paper can find the optimal path with a shorter path length, which demonstrates the effectiveness of the presented method.

摘要

本文提出了一种基于拓扑地图的具有强化学习的新型路径规划算法。所提出的算法具有两级结构。在第一级,该方法基于网格地图使用区域动态增长算法生成拓扑区域。在第二级,应用分为两层的多智能体 SARSA 方法来寻找近似最优的全局规划路径,其中首先使用人工势场法初始化第一个 Q 表以加快学习速度,然后用拓扑地图获得的连通域初始化第二个 Q 表,该连通域提供了先验信息。两种算法的结合使算法更易于收敛。已执行路径规划的仿真实验。结果表明,本文提出的方法能够找到路径长度更短的最优路径,证明了所提方法的有效性。

相似文献

1
A Hierarchical Path Planning Approach with Multi-SARSA Based on Topological Map.一种基于拓扑地图的多SARSA分层路径规划方法。
Sensors (Basel). 2022 Mar 18;22(6):2367. doi: 10.3390/s22062367.
2
A path planning approach for mobile robots using short and safe Q-learning.基于短程且安全 Q-学习的移动机器人路径规划方法。
PLoS One. 2022 Sep 26;17(9):e0275100. doi: 10.1371/journal.pone.0275100. eCollection 2022.
3
The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.基于神经网络和分层强化学习的移动机器人路径规划
Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.
4
A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment.基于势场和动态 Q 学习的未知环境下移动机器人路径规划方法。
Comput Intell Neurosci. 2022 Jun 2;2022:2540546. doi: 10.1155/2022/2540546. eCollection 2022.
5
Biologically Inspired Complete Coverage Path Planning Algorithm Based on Q-Learning.基于 Q-learning 的生物启发式全覆盖路径规划算法。
Sensors (Basel). 2023 May 11;23(10):4647. doi: 10.3390/s23104647.
6
A novel Q-learning algorithm based on improved whale optimization algorithm for path planning.基于改进鲸鱼优化算法的新型 Q 学习算法在路径规划中的应用。
PLoS One. 2022 Dec 27;17(12):e0279438. doi: 10.1371/journal.pone.0279438. eCollection 2022.
7
Path Planning for Wheeled Mobile Robot in Partially Known Uneven Terrain.轮式移动机器人在部分未知非均匀地形中的路径规划。
Sensors (Basel). 2022 Jul 12;22(14):5217. doi: 10.3390/s22145217.
8
Rapid global path planning algorithm for unmanned surface vehicles in large-scale and multi-island marine environments.面向大规模多岛屿海洋环境的无人水面舰艇快速全局路径规划算法
PeerJ Comput Sci. 2021 Jun 29;7:e612. doi: 10.7717/peerj-cs.612. eCollection 2021.
9
CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning.CLSQL:基于连续局部搜索策略的移动机器人路径规划改进Q学习算法
Sensors (Basel). 2022 Aug 8;22(15):5910. doi: 10.3390/s22155910.
10
Particle Swarm Algorithm Path-Planning Method for Mobile Robots Based on Artificial Potential Fields.基于人工势场的移动机器人粒子群算法路径规划方法。
Sensors (Basel). 2023 Jul 1;23(13):6082. doi: 10.3390/s23136082.

引用本文的文献

1
Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm.基于改进深度确定性策略梯度算法的移动机器人无地图路径规划
Sensors (Basel). 2024 Aug 30;24(17):5667. doi: 10.3390/s24175667.
2
A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots.一种基于改进软演员-评论家算法的移动机器人路径规划方法。
Biomimetics (Basel). 2023 Oct 10;8(6):481. doi: 10.3390/biomimetics8060481.
3
Implementation of a Real-Time Object Pick-and-Place System Based on a Changing Strategy for Rapidly-Exploring Random Tree.

本文引用的文献

1
A layered goal-oriented fuzzy motion planning strategy for mobile robot navigation.一种用于移动机器人导航的分层目标导向模糊运动规划策略。
IEEE Trans Syst Man Cybern B Cybern. 2005 Dec;35(6):1214-24. doi: 10.1109/tsmcb.2005.850177.
基于快速探索随机树的变策略的实时目标抓取放置系统的实现。
Sensors (Basel). 2023 May 16;23(10):4814. doi: 10.3390/s23104814.
4
Security Analysis of Cyber-Physical Systems Using Reinforcement Learning.基于强化学习的网络物理系统安全分析。
Sensors (Basel). 2023 Feb 2;23(3):1634. doi: 10.3390/s23031634.
5
CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning.CLSQL:基于连续局部搜索策略的移动机器人路径规划改进Q学习算法
Sensors (Basel). 2022 Aug 8;22(15):5910. doi: 10.3390/s22155910.