Suppr
超能文献

基于神经网络和分层强化学习的移动机器人路径规划

The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.

作者信息

Yu Jinglun, Su Yuancheng, Liao Yifan

机构信息

Chongqing University-University of Cincinnati Joint Co-op Institute, Chongqing University, Chongqing, China.

出版信息

Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.

DOI:10.3389/fnbot.2020.00063

PMID:33132890

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7561669/

Abstract

Existing mobile robots cannot complete some functions. To solve these problems, which include autonomous learning in path planning, the slow convergence of path planning, and planned paths that are not smooth, it is possible to utilize neural networks to enable to the robot to perceive the environment and perform feature extraction, which enables them to have a fitness of environment to state action function. By mapping the current state of these actions through Hierarchical Reinforcement Learning (HRL), the needs of mobile robots are met. It is possible to construct a path planning model for mobile robots based on neural networks and HRL. In this article, the proposed algorithm is compared with different algorithms in path planning. It underwent a performance evaluation to obtain an optimal learning algorithm system. The optimal algorithm system was tested in different environments and scenarios to obtain optimal learning conditions, thereby verifying the effectiveness of the proposed algorithm. Deep Deterministic Policy Gradient (DDPG), a path planning algorithm for mobile robots based on neural networks and hierarchical reinforcement learning, performed better in all aspects than other algorithms. Specifically, when compared with Double Deep Q-Learning (DDQN), DDPG has a shorter path planning time and a reduced number of path steps. When introducing an influence value, this algorithm shortens the convergence time by 91% compared with the Q-learning algorithm and improves the smoothness of the planned path by 79%. The algorithm has a good generalization effect in different scenarios. These results have significance for research on guiding, the precise positioning, and path planning of mobile robots.

摘要

现有的移动机器人无法完成某些功能。为了解决这些问题，包括路径规划中的自主学习、路径规划收敛缓慢以及规划路径不顺畅等问题，可以利用神经网络使机器人能够感知环境并进行特征提取，从而使其具有环境对状态动作函数的适应性。通过分层强化学习（HRL）映射这些动作的当前状态，可以满足移动机器人的需求。基于神经网络和HRL为移动机器人构建路径规划模型是可行的。在本文中，将所提出的算法与路径规划中的不同算法进行了比较。对其进行了性能评估以获得最优学习算法系统。在不同环境和场景下对最优算法系统进行测试以获得最优学习条件，从而验证所提算法的有效性。深度确定性策略梯度（DDPG），一种基于神经网络和分层强化学习的移动机器人路径规划算法，在各方面的表现均优于其他算法。具体而言，与双深度Q学习（DDQN）相比，DDPG的路径规划时间更短，路径步数减少。当引入影响值时，该算法与Q学习算法相比收敛时间缩短了91%，规划路径的平滑度提高了79%。该算法在不同场景下具有良好的泛化效果。这些结果对移动机器人的导航、精确定位和路径规划研究具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf96/7561669/a8d5e2a8b3aa/fnbot-14-00063-g0001.jpg

相似文献

The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.

Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.

SLP-Improved DDPG Path-Planning Algorithm for Mobile Robot in Large-Scale Dynamic Environment.

Sensors (Basel). 2023 Mar 28;23(7):3521. doi: 10.3390/s23073521.

Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient.

Sensors (Basel). 2022 May 8;22(9):3579. doi: 10.3390/s22093579.

A path planning approach for mobile robots using short and safe Q-learning.

PLoS One. 2022 Sep 26;17(9):e0275100. doi: 10.1371/journal.pone.0275100. eCollection 2022.

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning.

Sensors (Basel). 2022 Aug 8;22(15):5910. doi: 10.3390/s22155910.

Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm.

Sensors (Basel). 2024 Aug 30;24(17):5667. doi: 10.3390/s24175667.

An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning.

Sensors (Basel). 2020 Jan 11;20(2):426. doi: 10.3390/s20020426.

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment.

Comput Intell Neurosci. 2022 Jun 2;2022:2540546. doi: 10.1155/2022/2540546. eCollection 2022.

A Path-Integral-Based Reinforcement Learning Algorithm for Path Following of an Autoassembly Mobile Robot.

IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4487-4499. doi: 10.1109/TNNLS.2019.2955699. Epub 2020 Oct 29.

Improved Double Deep Q-Network Algorithm Applied to Multi-Dimensional Environment Path Planning of Hexapod Robots.

Sensors (Basel). 2024 Mar 23;24(7):2061. doi: 10.3390/s24072061.

引用本文的文献

Decoding core genes and intercellular communication in osteosarcoma: bioinformatic investigation and immune cell profiling for diagnostic and therapeutic insights.

Discov Oncol. 2024 Nov 1;15(1):609. doi: 10.1007/s12672-024-01247-y.

Optimization of robotic path planning and navigation point configuration based on convolutional neural networks.

Front Neurorobot. 2024 Jun 4;18:1406658. doi: 10.3389/fnbot.2024.1406658. eCollection 2024.

Dynamic 3D Point-Cloud-Driven Autonomous Hierarchical Path Planning for Quadruped Robots.

Biomimetics (Basel). 2024 Apr 24;9(5):259. doi: 10.3390/biomimetics9050259.

A Deep Learning Approach to Lunar Rover Global Path Planning Using Environmental Constraints and the Rover Internal Resource Status.

Sensors (Basel). 2024 Jan 28;24(3):844. doi: 10.3390/s24030844.

Prioritized experience replay in path planning via multi-dimensional transition priority fusion.

Front Neurorobot. 2023 Nov 15;17:1281166. doi: 10.3389/fnbot.2023.1281166. eCollection 2023.

Velocity range-based reward shaping technique for effective map-less navigation with LiDAR sensor and deep reinforcement learning.

Front Neurorobot. 2023 Sep 6;17:1210442. doi: 10.3389/fnbot.2023.1210442. eCollection 2023.

A reinforcement learning method for optimal control of oil well production using cropped well group samples.

Heliyon. 2023 Jul 4;9(7):e17919. doi: 10.1016/j.heliyon.2023.e17919. eCollection 2023 Jul.

Improved Robot Path Planning Method Based on Deep Reinforcement Learning.

Sensors (Basel). 2023 Jun 15;23(12):5622. doi: 10.3390/s23125622.

Real-time route planning of unmanned aerial vehicles based on improved soft actor-critic algorithm.

Front Neurorobot. 2022 Dec 5;16:1025817. doi: 10.3389/fnbot.2022.1025817. eCollection 2022.

A Generalized Laser Simulator Algorithm for Mobile Robot Path Planning with Obstacle Avoidance.

Sensors (Basel). 2022 Oct 25;22(21):8177. doi: 10.3390/s22218177.

本文引用的文献

Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning.

Front Neurorobot. 2019 Dec 10;13:103. doi: 10.3389/fnbot.2019.00103. eCollection 2019.

An adaptive deep Q-learning strategy for handwritten digit recognition.

Neural Netw. 2018 Nov;107:61-71. doi: 10.1016/j.neunet.2018.02.010. Epub 2018 Feb 22.

Discrete-Time Deterministic $Q$ -Learning: A Novel Convergence Analysis.

IEEE Trans Cybern. 2017 May;47(5):1224-1237. doi: 10.1109/TCYB.2016.2542923. Epub 2016 Apr 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

基于神经网络和分层强化学习的移动机器人路径规划

The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译