神经动态规划的高耗散非线性空间分布过程自适应最优控制。

Adaptive optimal control of highly dissipative nonlinear spatially distributed processes with neuro-dynamic programming.

出版信息

IEEE Trans Neural Netw Learn Syst. 2015 Apr;26(4):684-96. doi: 10.1109/TNNLS.2014.2320744.

DOI:10.1109/TNNLS.2014.2320744

Abstract

Highly dissipative nonlinear partial differential equations (PDEs) are widely employed to describe the system dynamics of industrial spatially distributed processes (SDPs). In this paper, we consider the optimal control problem of the general highly dissipative SDPs, and propose an adaptive optimal control approach based on neuro-dynamic programming (NDP). Initially, Karhunen-Loève decomposition is employed to compute empirical eigenfunctions (EEFs) of the SDP based on the method of snapshots. These EEFs together with singular perturbation technique are then used to obtain a finite-dimensional slow subsystem of ordinary differential equations that accurately describes the dominant dynamics of the PDE system. Subsequently, the optimal control problem is reformulated on the basis of the slow subsystem, which is further converted to solve a Hamilton-Jacobi-Bellman (HJB) equation. HJB equation is a nonlinear PDE that has proven to be impossible to solve analytically. Thus, an adaptive optimal control method is developed via NDP that solves the HJB equation online using neural network (NN) for approximating the value function; and an online NN weight tuning law is proposed without requiring an initial stabilizing control policy. Moreover, by involving the NN estimation error, we prove that the original closed-loop PDE system with the adaptive optimal control policy is semiglobally uniformly ultimately bounded. Finally, the developed method is tested on a nonlinear diffusion-convection-reaction process and applied to a temperature cooling fin of high-speed aerospace vehicle, and the achieved results show its effectiveness.

摘要

高度耗散非线性偏微分方程（PDE）广泛用于描述工业空间分布过程（SDP）的系统动态。本文研究了广义高耗散 SDP 的最优控制问题，并提出了一种基于神经动态规划（NDP）的自适应最优控制方法。首先，基于快照法，采用 Karhunen-Loève 分解计算 SDP 的经验本征函数（EEF）。然后，将这些 EEF 与奇异摄动技术一起用于获得一个准确描述 PDE 系统主导动态的有限维常微分方程慢子系统。随后，基于慢子系统重新制定最优控制问题，并进一步转换为求解 Hamilton-Jacobi-Bellman（HJB）方程。HJB 方程是一个非线性 PDE，已经证明无法进行解析求解。因此，通过 NDP 开发了一种自适应最优控制方法，该方法使用神经网络（NN）在线求解 HJB 方程，以逼近值函数；并提出了一种在线 NN 权重调整律，无需初始稳定控制策略。此外，通过引入 NN 估计误差，我们证明了带有自适应最优控制策略的原始闭环 PDE 系统是半全局一致最终有界的。最后，将所提出的方法应用于非线性扩散-对流-反应过程，并将其应用于高速航天飞行器的冷却散热片，结果表明其有效性。

相似文献

Adaptive optimal control of highly dissipative nonlinear spatially distributed processes with neuro-dynamic programming.神经动态规划的高耗散非线性空间分布过程自适应最优控制。

IEEE Trans Neural Netw Learn Syst. 2015 Apr;26(4):684-96. doi: 10.1109/TNNLS.2014.2320744.

Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network.基于经验特征函数和神经网络的非线性一维抛物型偏微分方程系统的近似最优控制设计

IEEE Trans Syst Man Cybern B Cybern. 2012 Dec;42(6):1538-49. doi: 10.1109/TSMCB.2012.2194781. Epub 2012 May 10.

Adaptive neural control design for nonlinear distributed parameter systems with persistent bounded disturbances.具有持续有界干扰的非线性分布参数系统的自适应神经控制设计

IEEE Trans Neural Netw. 2009 Oct;20(10):1630-44. doi: 10.1109/TNN.2009.2028887. Epub 2009 Sep 9.

Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems.基于神经网络的连续不确定非线性系统最优鲁棒保性能控制的在线 HJB 解法。

IEEE Trans Cybern. 2014 Dec;44(12):2834-47. doi: 10.1109/TCYB.2014.2357896.

Data-Driven H∞ Control for Nonlinear Distributed Parameter Systems.基于数据驱动的非线性分布参数系统的 H∞ 控制。

IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2949-61. doi: 10.1109/TNNLS.2015.2461023. Epub 2015 Aug 11.

Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming.基于神经动态规划的不确定非线性网络控制系统随机最优控制器设计。

IEEE Trans Neural Netw Learn Syst. 2013 Mar;24(3):471-84. doi: 10.1109/TNNLS.2012.2234133.

Output Feedback-Based Boundary Control of Uncertain Coupled Semilinear Parabolic PDE Using Neurodynamic Programming.基于输出反馈的不确定耦合半线性抛物型偏微分方程的神经动态规划边界控制。

IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):1263-1274. doi: 10.1109/TNNLS.2017.2669941. Epub 2017 Mar 6.

Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control.基于神经网络的在线同时策略更新算法，用于解决非线性 H∞ 控制中的 HJI 方程。

IEEE Trans Neural Netw Learn Syst. 2012 Dec;23(12):1884-95. doi: 10.1109/TNNLS.2012.2217349.

Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation.基于在线哈密顿-雅可比-贝尔曼公式的一类互联非线性离散时间系统的分布式最优控制

IEEE Trans Neural Netw. 2011 Nov;22(11):1757-69. doi: 10.1109/TNN.2011.2160968. Epub 2011 Sep 29.

Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence.使用具有收敛性证明的离线训练神经网络对未知仿射非线性离散时间系统进行最优控制。

Neural Netw. 2009 Jul-Aug;22(5-6):851-60. doi: 10.1016/j.neunet.2009.06.014. Epub 2009 Jul 1.

引用本文的文献

The Algorithms of Distributed Learning and Distributed Estimation about Intelligent Wireless Sensor Network.智能无线传感器网络中的分布式学习和分布式估计算法。

Sensors (Basel). 2020 Feb 27;20(5):1302. doi: 10.3390/s20051302.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

神经动态规划的高耗散非线性空间分布过程自适应最优控制。

Adaptive optimal control of highly dissipative nonlinear spatially distributed processes with neuro-dynamic programming.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献