Dong Lu, Zhong Xiangnan, Sun Changyin, He Haibo
IEEE Trans Neural Netw Learn Syst. 2017 Aug;28(8):1941-1952. doi: 10.1109/TNNLS.2016.2586303. Epub 2016 Aug 31.
In this paper, an event-triggered near optimal control structure is developed for nonlinear continuous-time systems with control constraints. Due to the saturating actuators, a nonquadratic cost function is introduced and the Hamilton-Jacobi-Bellman (HJB) equation for constrained nonlinear continuous-time systems is formulated. In order to solve the HJB equation, an actor-critic framework is presented. The critic network is used to approximate the cost function and the action network is used to estimate the optimal control law. In addition, in the proposed method, the control signal is transmitted in an aperiodic manner to reduce the computational and the transmission cost. Both the networks are only updated at the trigger instants decided by the event-triggered condition. Detailed Lyapunov analysis is provided to guarantee that the closed-loop event-triggered system is ultimately bounded. Three case studies are used to demonstrate the effectiveness of the proposed method.
本文针对具有控制约束的非线性连续时间系统,开发了一种事件触发的近最优控制结构。由于执行器饱和,引入了非二次代价函数,并建立了约束非线性连续时间系统的汉密尔顿-雅可比-贝尔曼(HJB)方程。为了求解HJB方程,提出了一种 actor-critic 框架。评论家网络用于逼近代价函数,动作网络用于估计最优控制律。此外,在所提出的方法中,控制信号以非周期方式传输,以降低计算和传输成本。两个网络仅在由事件触发条件决定的触发时刻进行更新。提供了详细的李雅普诺夫分析,以确保闭环事件触发系统最终有界。通过三个案例研究来证明所提方法的有效性。