• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于学习的带有安全约束的月球车端到端路径规划。

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.

机构信息

School of Astronautics, Harbin Institute of Technology, Harbin 150002, China.

China Academy of Space Technology, Beijing 100094, China.

出版信息

Sensors (Basel). 2021 Jan 25;21(3):796. doi: 10.3390/s21030796.

DOI:10.3390/s21030796
PMID:33504073
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7866010/
Abstract

Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terrain data was built using the Gazebo simulation environment and a lunar rover simulator was created in it to simulate the real lunar surface environment and the lunar rover system. Then an end-to-end path planning algorithm based on deep reinforcement learning method is designed, including state space, action space, network structure, reward function considering slip behavior, and training method based on proximal policy optimization. In addition, to improve the generalization ability to different lunar surface topography and different scale environments, a variety of training scenarios were set up to train the network model using the idea of curriculum learning. The simulation results show that the proposed planning algorithm can successfully achieve the end-to-end path planning of the lunar rover, and the path generated by the proposed algorithm has a higher safety guarantee compared with the classical path planning algorithm.

摘要

路径规划是月球车实现安全高效自主探测任务的关键技术,本文提出了一种具有安全约束的基于学习的端到端路径规划算法。首先,使用 Gazebo 仿真环境构建了一个集成真实月面地形数据的训练环境,并在其中创建了一个月球车模拟器,以模拟真实的月面环境和月球车系统。然后,设计了一种基于深度强化学习方法的端到端路径规划算法,包括状态空间、动作空间、网络结构、考虑滑移行为的奖励函数以及基于近端策略优化的训练方法。此外,为了提高对不同月面地形和不同规模环境的泛化能力,采用课程学习的思想设置了多种训练场景来训练网络模型。仿真结果表明,所提出的规划算法能够成功实现月球车的端到端路径规划,并且与经典路径规划算法相比,所生成的路径具有更高的安全保障。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/7a3363aa4de2/sensors-21-00796-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/705cee2fb114/sensors-21-00796-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/fb2587239243/sensors-21-00796-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/bc756460e163/sensors-21-00796-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/9ecf1a44c1d6/sensors-21-00796-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/4ea844eb7a9f/sensors-21-00796-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/ce9226c3a5df/sensors-21-00796-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/99114dd375a7/sensors-21-00796-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/b5daeb08d952/sensors-21-00796-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/fd5ea1ddab39/sensors-21-00796-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/7a3363aa4de2/sensors-21-00796-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/705cee2fb114/sensors-21-00796-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/fb2587239243/sensors-21-00796-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/bc756460e163/sensors-21-00796-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/9ecf1a44c1d6/sensors-21-00796-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/4ea844eb7a9f/sensors-21-00796-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/ce9226c3a5df/sensors-21-00796-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/99114dd375a7/sensors-21-00796-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/b5daeb08d952/sensors-21-00796-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/fd5ea1ddab39/sensors-21-00796-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ebb8/7866010/7a3363aa4de2/sensors-21-00796-g010.jpg

相似文献

1
Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints.基于学习的带有安全约束的月球车端到端路径规划。
Sensors (Basel). 2021 Jan 25;21(3):796. doi: 10.3390/s21030796.
2
A Deep Learning Approach to Lunar Rover Global Path Planning Using Environmental Constraints and the Rover Internal Resource Status.一种利用环境约束和月球车内部资源状态进行月球车全局路径规划的深度学习方法。
Sensors (Basel). 2024 Jan 28;24(3):844. doi: 10.3390/s24030844.
3
Modeling of slip rate-dependent traversability for path planning of wheeled mobile robot in sandy terrain.沙地地形中轮式移动机器人路径规划的滑移率相关可通行性建模
Front Robot AI. 2024 Jan 25;11:1320261. doi: 10.3389/frobt.2024.1320261. eCollection 2024.
4
Value Iteration Networks with Double Estimator for Planetary Rover Path Planning.基于双估计量的价值迭代网络在行星探测车路径规划中的应用
Sensors (Basel). 2021 Dec 16;21(24):8418. doi: 10.3390/s21248418.
5
Deep Reinforcement Learning for Indoor Mobile Robot Path Planning.深度强化学习在室内移动机器人路径规划中的应用。
Sensors (Basel). 2020 Sep 25;20(19):5493. doi: 10.3390/s20195493.
6
Path Planning for Unmanned Surface Vehicles with Strong Generalization Ability Based on Improved Proximal Policy Optimization.基于改进近端策略优化的具有强泛化能力的无人水面艇路径规划
Sensors (Basel). 2023 Oct 31;23(21):8864. doi: 10.3390/s23218864.
7
An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning.基于深度强化学习的无人船自主路径规划模型。
Sensors (Basel). 2020 Jan 11;20(2):426. doi: 10.3390/s20020426.
8
The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.基于神经网络和分层强化学习的移动机器人路径规划
Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.
9
A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework.基于深度强化学习框架的无地图局部路径规划方法。
Sensors (Basel). 2023 Feb 10;23(4):2036. doi: 10.3390/s23042036.
10
Path planning of mobile robot based on improved TD3 algorithm in dynamic environment.动态环境下基于改进TD3算法的移动机器人路径规划
Heliyon. 2024 May 31;10(11):e32167. doi: 10.1016/j.heliyon.2024.e32167. eCollection 2024 Jun 15.

引用本文的文献

1
An Enhanced Artificial Lemming Algorithm and Its Application in UAV Path Planning.一种改进的人工旅鼠算法及其在无人机路径规划中的应用
Biomimetics (Basel). 2025 Jun 6;10(6):377. doi: 10.3390/biomimetics10060377.
2
Path Planning of Quadrupedal Robot Based on Improved RRT-Connect Algorithm.基于改进RRT-Connect算法的四足机器人路径规划
Sensors (Basel). 2025 Apr 18;25(8):2558. doi: 10.3390/s25082558.
3
Pseudo-Spectral Spatial Feature Extraction and Enhanced Fusion Image for Efficient Meter-Sized Lunar Impact Crater Automatic Detection in Digital Orthophoto Map.

本文引用的文献

1
A Two-Stage Method for Target Searching in the Path Planning for Mobile Robots.一种用于移动机器人路径规划中目标搜索的两阶段方法。
Sensors (Basel). 2020 Dec 3;20(23):6919. doi: 10.3390/s20236919.
2
Deep Reinforcement Learning for Indoor Mobile Robot Path Planning.深度强化学习在室内移动机器人路径规划中的应用。
Sensors (Basel). 2020 Sep 25;20(19):5493. doi: 10.3390/s20195493.
3
Deep Multi-Layer Perception Based Terrain Classification for Planetary Exploration Rovers.基于深度多层感知器的行星探测漫游车地形分类
用于数字正射影像图中高效米级月球撞击坑自动检测的伪谱空间特征提取与增强融合图像
Sensors (Basel). 2024 Aug 11;24(16):5206. doi: 10.3390/s24165206.
4
A Deep Learning Approach to Lunar Rover Global Path Planning Using Environmental Constraints and the Rover Internal Resource Status.一种利用环境约束和月球车内部资源状态进行月球车全局路径规划的深度学习方法。
Sensors (Basel). 2024 Jan 28;24(3):844. doi: 10.3390/s24030844.
5
Slip considered path planning of a novel angled-spoke based robot in a terrain mixture of granular media and rigid support.斯利普研究了一种新型斜辐条式机器人在颗粒介质与刚性支撑混合地形中的路径规划。
Sci Rep. 2023 Dec 11;13(1):21902. doi: 10.1038/s41598-023-49407-4.
6
Path Planning for Autonomous Mobile Robots: A Review.自主移动机器人路径规划:综述。
Sensors (Basel). 2021 Nov 26;21(23):7898. doi: 10.3390/s21237898.
7
A Grain-Scale Study of Mojave Mars Simulant (MMS-1).莫哈韦火星模拟物(MMS-1)的颗粒尺度研究。
Sensors (Basel). 2021 Jul 10;21(14):4730. doi: 10.3390/s21144730.
Sensors (Basel). 2019 Jul 13;19(14):3102. doi: 10.3390/s19143102.
4
China's present and future lunar exploration program.中国当前及未来的月球探测计划。
Science. 2019 Jul 19;365(6450):238-239. doi: 10.1126/science.aax9908.
5
Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning.基于视觉的机器人导航,通过结合无监督学习和分层强化学习。
Sensors (Basel). 2019 Apr 1;19(7):1576. doi: 10.3390/s19071576.
6
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.