Suppr超能文献

用于一类非线性系统最优控制综合的单网络自适应评判器(SNAC)架构。

A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems.

作者信息

Padhi Radhakant, Unnikrishnan Nishant, Wang Xiaohua, Balakrishnan S N

机构信息

Department of Aerospace Engineering, Indian Institute of Science, Bangalore, India.

出版信息

Neural Netw. 2006 Dec;19(10):1648-60. doi: 10.1016/j.neunet.2006.08.010. Epub 2006 Oct 11.

Abstract

Even though dynamic programming offers an optimal control solution in a state feedback form, the method is overwhelmed by computational and storage requirements. Approximate dynamic programming implemented with an Adaptive Critic (AC) neural network structure has evolved as a powerful alternative technique that obviates the need for excessive computations and storage requirements in solving optimal control problems. In this paper, an improvement to the AC architecture, called the "Single Network Adaptive Critic (SNAC)" is presented. This approach is applicable to a wide class of nonlinear systems where the optimal control (stationary) equation can be explicitly expressed in terms of the state and costate variables. The selection of this terminology is guided by the fact that it eliminates the use of one neural network (namely the action network) that is part of a typical dual network AC setup. As a consequence, the SNAC architecture offers three potential advantages: a simpler architecture, lesser computational load and elimination of the approximation error associated with the eliminated network. In order to demonstrate these benefits and the control synthesis technique using SNAC, two problems have been solved with the AC and SNAC approaches and their computational performances are compared. One of these problems is a real-life Micro-Electro-Mechanical-system (MEMS) problem, which demonstrates that the SNAC technique is applicable to complex engineering systems.

摘要

尽管动态规划以状态反馈形式提供了最优控制解决方案,但该方法受到计算和存储需求的困扰。采用自适应评判器(AC)神经网络结构实现的近似动态规划已发展成为一种强大的替代技术,它在解决最优控制问题时无需过多的计算和存储需求。本文提出了对AC架构的一种改进,称为“单网络自适应评判器(SNAC)”。这种方法适用于广泛的一类非线性系统,在这类系统中,最优控制(稳态)方程可以根据状态和协态变量明确表示。选择这个术语的依据是它省去了典型双网络AC设置中作为一部分的一个神经网络(即动作网络)的使用。因此,SNAC架构具有三个潜在优势:架构更简单、计算负载更小以及消除了与被省去网络相关的近似误差。为了证明这些优势以及使用SNAC的控制综合技术,我们用AC和SNAC方法解决了两个问题,并比较了它们的计算性能。其中一个问题是一个实际的微机电系统(MEMS)问题,这表明SNAC技术适用于复杂的工程系统。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验