• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进软演员-评论家算法的无人机实时路径规划

Real-time route planning of unmanned aerial vehicles based on improved soft actor-critic algorithm.

作者信息

Zhou Yuxiang, Shu Jiansheng, Zheng Xiaolong, Hao Hui, Song Huan

机构信息

Xi'an Research Institute of High Technology, Xi'an, China.

出版信息

Front Neurorobot. 2022 Dec 5;16:1025817. doi: 10.3389/fnbot.2022.1025817. eCollection 2022.

DOI:10.3389/fnbot.2022.1025817
PMID:36545396
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9762480/
Abstract

With the application and development of UAV technology and navigation and positioning technology, higher requirements are put forward for UAV maneuvering obstacle avoidance ability and real-time route planning. In this paper, for the problem of real-time UAV route planning in the unknown environment, we combine the ideas of artificial potential field method to modify the state observation and reward function, which solves the problem of sparse rewards of reinforcement learning algorithm, improves the convergence speed of the algorithm, and improves the generalization of the algorithm by step-by-step training based on the ideas of curriculum learning and transfer learning according to the difficulty of the task. The simulation results show that the improved SAC algorithm has fast convergence speed, good timeliness and strong generalization, and can better complete the UAV route planning task.

摘要

随着无人机技术以及导航定位技术的应用与发展,对无人机的机动避障能力和实时路径规划提出了更高的要求。针对未知环境下无人机实时路径规划问题,本文结合人工势场法的思想对状态观测和奖励函数进行修正,解决了强化学习算法奖励稀疏的问题,提高了算法的收敛速度,并根据任务难度基于课程学习和迁移学习的思想进行逐步训练,提升了算法的泛化能力。仿真结果表明,改进后的SAC算法收敛速度快、时效性好、泛化能力强,能够较好地完成无人机路径规划任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/3bc7192816be/fnbot-16-1025817-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/f3139b19fda4/fnbot-16-1025817-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/c6463c821b75/fnbot-16-1025817-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/77025d78c8ba/fnbot-16-1025817-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/f549bce8197b/fnbot-16-1025817-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/db0ee945123f/fnbot-16-1025817-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/37f686889752/fnbot-16-1025817-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/7ca8962a6a3a/fnbot-16-1025817-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/0a1ed63ee538/fnbot-16-1025817-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/65274f0e214d/fnbot-16-1025817-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/fea094921434/fnbot-16-1025817-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/66ca6859500b/fnbot-16-1025817-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/7030ce10cd7f/fnbot-16-1025817-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/3bc7192816be/fnbot-16-1025817-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/f3139b19fda4/fnbot-16-1025817-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/c6463c821b75/fnbot-16-1025817-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/77025d78c8ba/fnbot-16-1025817-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/f549bce8197b/fnbot-16-1025817-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/db0ee945123f/fnbot-16-1025817-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/37f686889752/fnbot-16-1025817-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/7ca8962a6a3a/fnbot-16-1025817-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/0a1ed63ee538/fnbot-16-1025817-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/65274f0e214d/fnbot-16-1025817-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/fea094921434/fnbot-16-1025817-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/66ca6859500b/fnbot-16-1025817-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/7030ce10cd7f/fnbot-16-1025817-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1720/9762480/3bc7192816be/fnbot-16-1025817-g013.jpg

相似文献

1
Real-time route planning of unmanned aerial vehicles based on improved soft actor-critic algorithm.基于改进软演员-评论家算法的无人机实时路径规划
Front Neurorobot. 2022 Dec 5;16:1025817. doi: 10.3389/fnbot.2022.1025817. eCollection 2022.
2
Path Planning for Unmanned Surface Vehicles with Strong Generalization Ability Based on Improved Proximal Policy Optimization.基于改进近端策略优化的具有强泛化能力的无人水面艇路径规划
Sensors (Basel). 2023 Oct 31;23(21):8864. doi: 10.3390/s23218864.
3
Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.基于强化学习的多无人机安全决策方法研究
Front Neurorobot. 2023 Jan 10;16:1105480. doi: 10.3389/fnbot.2022.1105480. eCollection 2022.
4
Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance.多智能体联合近端策略优化算法控制协同固定翼无人机避障研究
Sensors (Basel). 2020 Aug 13;20(16):4546. doi: 10.3390/s20164546.
5
Path Planning Research of a UAV Base Station Searching for Disaster Victims' Location Information Based on Deep Reinforcement Learning.基于深度强化学习的无人机基站搜索灾难受害者位置信息路径规划研究
Entropy (Basel). 2022 Dec 2;24(12):1767. doi: 10.3390/e24121767.
6
Autonomous localized path planning algorithm for UAVs based on TD3 strategy.基于TD3策略的无人机自主局部路径规划算法
Sci Rep. 2024 Jan 8;14(1):763. doi: 10.1038/s41598-024-51349-4.
7
The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.农业机器人的强化学习智能路径规划系统。
Sensors (Basel). 2022 Jun 7;22(12):4316. doi: 10.3390/s22124316.
8
A Real-Time Path Planning Method for Urban Low-Altitude Logistics UAVs.一种用于城市低空物流无人机的实时路径规划方法
Sensors (Basel). 2023 Aug 28;23(17):7472. doi: 10.3390/s23177472.
9
Sensor-Oriented Path Planning for Multiregion Surveillance with a Single Lightweight UAV SAR.基于单架轻型无人机合成孔径雷达的多区域监测的面向传感器路径规划
Sensors (Basel). 2018 Feb 11;18(2):548. doi: 10.3390/s18020548.
10
End-to-End AUV Motion Planning Method Based on Soft Actor-Critic.基于软动作 - 批评家的端到端 AUV 运动规划方法。
Sensors (Basel). 2021 Sep 1;21(17):5893. doi: 10.3390/s21175893.

本文引用的文献

1
A Survey on Curriculum Learning.课程学习调查
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4555-4576. doi: 10.1109/TPAMI.2021.3069908. Epub 2022 Aug 4.
2
The Path Planning of Mobile Robot by Neural Networks and Hierarchical Reinforcement Learning.基于神经网络和分层强化学习的移动机器人路径规划
Front Neurorobot. 2020 Oct 2;14:63. doi: 10.3389/fnbot.2020.00063. eCollection 2020.
3
Deep Reinforcement Learning for Indoor Mobile Robot Path Planning.深度强化学习在室内移动机器人路径规划中的应用。
Sensors (Basel). 2020 Sep 25;20(19):5493. doi: 10.3390/s20195493.
4
Efficient path planning for UAV formation via comprehensively improved particle swarm optimization.基于全面改进粒子群优化算法的无人机编队高效路径规划
ISA Trans. 2020 Feb;97:415-430. doi: 10.1016/j.isatra.2019.08.018. Epub 2019 Aug 8.