Suppr超能文献

具有幅值和速率饱和的欠驱动飞艇航迹跟踪控制

Path Following Control for Underactuated Airships with Magnitude and Rate Saturation.

作者信息

Gou Huabei, Guo Xiao, Lou Wenjie, Ou Jiajun, Yuan Jiace

机构信息

School of Aeronautic Science and Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China.

Frontier Institute of Science and Technology Innovation, Beijing University of Aeronautics and Astronautics, Beijing 100191, China.

出版信息

Sensors (Basel). 2020 Dec 15;20(24):7176. doi: 10.3390/s20247176.

Abstract

This paper proposes a reinforcement learning (RL) based path following strategy for underactuated airships with magnitude and rate saturation. The Markov decision process (MDP) model for the control problem is established. Then an error bounded line-of-sight (LOS) guidance law is investigated to restrain the state space. Subsequently, a proximal policy optimization (PPO) algorithm is employed to approximate the optimal action policy through trial and error. Since the optimal action policy is generated from the action space, the magnitude and rate saturation can be avoided. The simulation results, involving circular, general, broken-line, and anti-wind path following tasks, demonstrate that the proposed control scheme can transfer to new tasks without adaptation, and possesses satisfying real-time performance and robustness.

摘要

本文针对具有幅值和速率饱和的欠驱动飞艇,提出了一种基于强化学习(RL)的路径跟踪策略。建立了控制问题的马尔可夫决策过程(MDP)模型。然后研究了一种误差有界视线(LOS)制导律来限制状态空间。随后,采用近端策略优化(PPO)算法通过试错来逼近最优动作策略。由于最优动作策略是从动作空间生成的,因此可以避免幅值和速率饱和。涉及圆形、一般、折线和抗风路径跟踪任务的仿真结果表明,所提出的控制方案无需调整即可转移到新任务中,并且具有令人满意的实时性能和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3e5/7765289/8c44c92211ab/sensors-20-07176-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验