自适应评论家框架中的完全概率控制设计。

Fully probabilistic control design in an adaptive critic framework.

机构信息

Faculty of Engineering Technology, Al-Balsa Applied University, Jordan.

出版信息

Neural Netw. 2011 Dec;24(10):1128-35. doi: 10.1016/j.neunet.2011.06.006. Epub 2011 Jun 22.

DOI:10.1016/j.neunet.2011.06.006

Abstract

Optimal stochastic controller pushes the closed-loop behavior as close as possible to the desired one. The fully probabilistic design (FPD) uses probabilistic description of the desired closed loop and minimizes Kullback-Leibler divergence of the closed-loop description to the desired one. Practical exploitation of the fully probabilistic design control theory continues to be hindered by the computational complexities involved in numerically solving the associated stochastic dynamic programming problem; in particular, very hard multivariate integration and an approximate interpolation of the involved multivariate functions. This paper proposes a new fully probabilistic control algorithm that uses the adaptive critic methods to circumvent the need for explicitly evaluating the optimal value function, thereby dramatically reducing computational requirements. This is a main contribution of this paper.

摘要

最优随机控制器尽可能地使闭环行为接近期望行为。完全概率设计（FPD）使用期望闭环的概率描述，并最小化闭环描述与期望的 Kullback-Leibler 散度。完全概率设计控制理论的实际应用仍然受到相关随机动态规划问题数值求解所涉及的计算复杂性的阻碍；特别是非常困难的多元积分和所涉及的多元函数的近似插值。本文提出了一种新的完全概率控制算法，该算法使用自适应评价方法来避免需要显式评估最优值函数，从而大大降低计算要求。这是本文的主要贡献。

相似文献

Fully probabilistic control design in an adaptive critic framework.自适应评论家框架中的完全概率控制设计。

Neural Netw. 2011 Dec;24(10):1128-35. doi: 10.1016/j.neunet.2011.06.006. Epub 2011 Jun 22.

Probabilistic DHP adaptive critic for nonlinear stochastic control systems.概率 DHP 自适应评论家非线性随机控制系统。

Neural Netw. 2013 Jun;42:74-82. doi: 10.1016/j.neunet.2013.01.014. Epub 2013 Feb 4.

Fully probabilistic control for stochastic nonlinear control systems with input dependent noise.具有输入相关噪声的随机非线性控制系统的完全概率控制。

Neural Netw. 2015 Mar;63:199-207. doi: 10.1016/j.neunet.2014.12.004. Epub 2014 Dec 17.

L2- L(infinity) control of nonlinear fuzzy Itô stochastic delay systems via dynamic output feedback.基于动态输出反馈的非线性模糊伊藤随机时滞系统的 $L_2 - L_{\infty}$ 控制

IEEE Trans Syst Man Cybern B Cybern. 2009 Oct;39(5):1308-15. doi: 10.1109/TSMCB.2008.2012350. Epub 2009 Mar 24.

Adaptive NN output-feedback stabilization for a class of stochastic nonlinear strict-feedback systems.一类随机非线性严格反馈系统的自适应神经网络输出反馈镇定

ISA Trans. 2009 Oct;48(4):468-75. doi: 10.1016/j.isatra.2009.05.004. Epub 2009 Jun 26.

A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems.用于一类非线性系统最优控制综合的单网络自适应评判器（SNAC）架构。

Neural Netw. 2006 Dec;19(10):1648-60. doi: 10.1016/j.neunet.2006.08.010. Epub 2006 Oct 11.

A boundedness result for the direct heuristic dynamic programming.直接启发式动态规划的有界性结果。

Neural Netw. 2012 Aug;32:229-35. doi: 10.1016/j.neunet.2012.02.005. Epub 2012 Feb 14.

Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems.针对部分未知非线性系统的连续时间直接自适应最优控制的神经网络方法。

Neural Netw. 2009 Apr;22(3):237-46. doi: 10.1016/j.neunet.2009.03.008. Epub 2009 Mar 26.

Adaptive critic learning techniques for engine torque and air-fuel ratio control.用于发动机扭矩和空燃比控制的自适应评判学习技术。

IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):988-93. doi: 10.1109/TSMCB.2008.922019.

Error bounds of adaptive dynamic programming algorithms for solving undiscounted optimal control problems.自适应动态规划算法求解非折扣最优控制问题的误差界。

IEEE Trans Neural Netw Learn Syst. 2015 Jun;26(6):1323-34. doi: 10.1109/TNNLS.2015.2402203. Epub 2015 Mar 3.

引用本文的文献

Tracking Control for Output Probability Density Function of Stochastic Systems Using FPD Method.基于有限脉冲响应（FPD）方法的随机系统输出概率密度函数跟踪控制

Entropy (Basel). 2023 Jan 17;25(2):186. doi: 10.3390/e25020186.

Identification of novel key genes and potential candidate small molecule drugs in diabetic kidney disease using comprehensive bioinformatics analysis.运用综合生物信息学分析鉴定糖尿病肾病中的新型关键基因和潜在候选小分子药物

Front Genet. 2022 Aug 12;13:934555. doi: 10.3389/fgene.2022.934555. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

自适应评论家框架中的完全概率控制设计。

Fully probabilistic control design in an adaptive critic framework.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献