基于同步强化学习的自平衡自行车机器人组合控制算法

Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot.

作者信息

Guo Lei, Lin Hongyu, Jiang Jiale, Song Yuan, Gan Dongming

机构信息

School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China.

School of Engineering Technology, Purdue University, West Lafayette, IN, USA.

出版信息

ISA Trans. 2024 Feb;145:479-492. doi: 10.1016/j.isatra.2023.11.032. Epub 2023 Nov 23.

DOI:10.1016/j.isatra.2023.11.032

PMID:38007371

Abstract

In this paper, balance control of a bicycle robot is studied without either a trail or a mechanical regulator when the robot moves in an approximately rectilinear motion. Based on the principle of moment balance, an input nonaffine nonlinear dynamics model of the bicycle robot is established. A driving velocity condition is proposed to maintain the robot balance. The nonaffine nonlinear system is transformed into an affine nonlinear system by defining the equivalent control. Subsequently, a feedback linearization controller is designed for the equivalent control. We design a combined control algorithm of synchronous policy iteration based on the actor-critic architecture. The actor neural network (NN) is designed based on the feedback linearization control law. Weight tuning laws for the critic and actor NNs are proposed. The system closed-loop stability and convergence of the NN weights are guaranteed based on the Lyapunov analysis. The optimality of the equivalent control policy is guaranteed. To satisfy the driving velocity condition, the values of the steering angle and driving velocity are determined based on the optimal equivalent control. The effectiveness of the proposed algorithm is verified through simulations and real experiments.

摘要

本文研究了自行车机器人在近似直线运动时，无拖尾或机械调节器情况下的平衡控制。基于力矩平衡原理，建立了自行车机器人的输入非仿射非线性动力学模型。提出了一个驱动速度条件以维持机器人平衡。通过定义等效控制，将非仿射非线性系统转化为仿射非线性系统。随后，为等效控制设计了反馈线性化控制器。我们基于行为-评判架构设计了一种同步策略迭代的组合控制算法。行为神经网络（NN）基于反馈线性化控制律进行设计。提出了评判和行为神经网络的权重调整律。基于李雅普诺夫分析保证了系统闭环稳定性和神经网络权重的收敛性。保证了等效控制策略的最优性。为满足驱动速度条件，基于最优等效控制确定转向角和驱动速度的值。通过仿真和实际实验验证了所提算法的有效性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于同步强化学习的自平衡自行车机器人组合控制算法

Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot.

作者信息

机构信息

出版信息

相似文献

基于同步强化学习的自平衡自行车机器人组合控制算法

Combined control algorithm based on synchronous reinforcement learning for a self-balancing bicycle robot.

作者信息

机构信息

出版信息

相似文献