具有幅值和速率饱和的欠驱动飞艇航迹跟踪控制

Path Following Control for Underactuated Airships with Magnitude and Rate Saturation.

作者信息

Gou Huabei, Guo Xiao, Lou Wenjie, Ou Jiajun, Yuan Jiace

机构信息

School of Aeronautic Science and Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China.

Frontier Institute of Science and Technology Innovation, Beijing University of Aeronautics and Astronautics, Beijing 100191, China.

出版信息

Sensors (Basel). 2020 Dec 15;20(24):7176. doi: 10.3390/s20247176.

DOI:10.3390/s20247176

PMID:33333882

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7765289/

Abstract

This paper proposes a reinforcement learning (RL) based path following strategy for underactuated airships with magnitude and rate saturation. The Markov decision process (MDP) model for the control problem is established. Then an error bounded line-of-sight (LOS) guidance law is investigated to restrain the state space. Subsequently, a proximal policy optimization (PPO) algorithm is employed to approximate the optimal action policy through trial and error. Since the optimal action policy is generated from the action space, the magnitude and rate saturation can be avoided. The simulation results, involving circular, general, broken-line, and anti-wind path following tasks, demonstrate that the proposed control scheme can transfer to new tasks without adaptation, and possesses satisfying real-time performance and robustness.

摘要

本文针对具有幅值和速率饱和的欠驱动飞艇，提出了一种基于强化学习（RL）的路径跟踪策略。建立了控制问题的马尔可夫决策过程（MDP）模型。然后研究了一种误差有界视线（LOS）制导律来限制状态空间。随后，采用近端策略优化（PPO）算法通过试错来逼近最优动作策略。由于最优动作策略是从动作空间生成的，因此可以避免幅值和速率饱和。涉及圆形、一般、折线和抗风路径跟踪任务的仿真结果表明，所提出的控制方案无需调整即可转移到新任务中，并且具有令人满意的实时性能和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3e5/7765289/8c44c92211ab/sensors-20-07176-g001.jpg

相似文献

Path Following Control for Underactuated Airships with Magnitude and Rate Saturation.具有幅值和速率饱和的欠驱动飞艇航迹跟踪控制

Sensors (Basel). 2020 Dec 15;20(24):7176. doi: 10.3390/s20247176.

Robust Neuro-Optimal Control of Underactuated Snake Robots With Experience Replay.具有经验回放的欠驱动蛇形机器人的鲁棒神经最优控制。

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):208-217. doi: 10.1109/TNNLS.2017.2768820.

Improved adaptive integral line-of-sight guidance law and adaptive fuzzy path following control for underactuated MSV.欠驱动 MSV 的改进自适应积分视线路径跟踪制导律与自适应模糊跟踪控制

ISA Trans. 2019 Nov;94:151-163. doi: 10.1016/j.isatra.2019.04.010. Epub 2019 Apr 26.

Constrained control using novel nonlinear mapping for underactuated unmanned surface vehicles with unknown sideslip angle.基于新型非线性映射的欠驱动无人水面艇未知侧滑角约束控制

ISA Trans. 2023 Oct;141:261-275. doi: 10.1016/j.isatra.2023.06.034. Epub 2023 Jul 4.

Adaptive bounded neural network control for coordinated path-following of networked underactuated autonomous surface vehicles under time-varying state-dependent cyber-attack.时变状态相关网络攻击下欠驱动自主水面舰艇网络协同路径跟踪的自适应有界神经网络控制

ISA Trans. 2020 Sep;104:212-221. doi: 10.1016/j.isatra.2018.12.051. Epub 2019 Feb 1.

Comparing Deep Reinforcement Learning Algorithms' Ability to Safely Navigate Challenging Waters.比较深度强化学习算法在具有挑战性水域中安全导航的能力。

Front Robot AI. 2021 Sep 13;8:738113. doi: 10.3389/frobt.2021.738113. eCollection 2021.

Underactuated USV path following mechanism based on the cascade method.基于级联方法的欠驱动 USV 路径跟踪机制。

Sci Rep. 2022 Jan 27;12(1):1461. doi: 10.1038/s41598-022-05456-9.

Hierarchical approximate policy iteration with binary-tree state space decomposition.基于二叉树状态空间分解的分层近似策略迭代

IEEE Trans Neural Netw. 2011 Dec;22(12):1863-77. doi: 10.1109/TNN.2011.2168422. Epub 2011 Oct 10.

An Improved ELOS Guidance Law for Path Following of Underactuated Unmanned Surface Vehicles.一种用于欠驱动无人水面艇路径跟踪的改进型期望视线制导律

Sensors (Basel). 2024 Aug 20;24(16):5384. doi: 10.3390/s24165384.

An Improved Distributed Sampling PPO Algorithm Based on Beta Policy for Continuous Global Path Planning Scheme.基于贝塔策略的改进分布式采样 PPO 算法在连续全局路径规划方案中的应用。

Sensors (Basel). 2023 Jul 2;23(13):6101. doi: 10.3390/s23136101.

本文引用的文献

Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning.基于离策略强化学习的具有输入饱和的多智能体系统最优同步控制

IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):85-96. doi: 10.1109/TNNLS.2018.2832025. Epub 2018 May 24.

Fuzzy Finite-Time Command Filtered Control of Nonlinear Systems With Input Saturation.具有输入饱和的非线性系统的模糊有限时间命令滤波控制。

IEEE Trans Cybern. 2018 Aug;48(8):2378-2387. doi: 10.1109/TCYB.2017.2738648. Epub 2017 Aug 22.

Output feedback boundary control of an axially moving system with input saturation constraint.具有输入饱和约束的轴向运动系统的输出反馈边界控制

ISA Trans. 2017 May;68:22-32. doi: 10.1016/j.isatra.2017.02.009. Epub 2017 Mar 1.

Adaptive integral LOS path following for an unmanned airship with uncertainties based on robust RBFNN backstepping.基于鲁棒径向基函数神经网络反步控制的不确定无人飞艇自适应积分视线路径跟踪

ISA Trans. 2016 Nov;65:210-219. doi: 10.1016/j.isatra.2016.09.008. Epub 2016 Sep 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

具有幅值和速率饱和的欠驱动飞艇航迹跟踪控制

Path Following Control for Underactuated Airships with Magnitude and Rate Saturation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献