• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经批评学习与加速价值迭代的非线性模型预测控制。

Neural critic learning with accelerated value iteration for nonlinear model predictive control.

机构信息

Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing 100124, China; Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing 100124, China; Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing 100124, China.

出版信息

Neural Netw. 2024 Aug;176:106364. doi: 10.1016/j.neunet.2024.106364. Epub 2024 May 6.

DOI:10.1016/j.neunet.2024.106364
PMID:38754288
Abstract

In practical industrial processes, the receding optimization solution of nonlinear model predictive control (NMPC) is always a very knotty problem. Based on adaptive dynamic programming, the accelerated value iteration predictive control (AVI-PC) algorithm is developed in this paper. Integrating iteration learning with the receding horizon mechanism of NMPC, a novel receding optimization solution pattern is exploited to resolve the optimal control law in each prediction horizon. Besides, the basic architecture and the specific form of the AVI-PC algorithm are demonstrated, including the relationship among the iterative learning process, the prediction process, and the control process. On this basis, the convergence and admissibility conditions are established, and the relevant properties are comprehensively analyzed when the accelerated factor satisfies the established conditions. Furthermore, the accelerated value iterative function is approximated through the single critic network constructed by utilizing the multiple linear regression method. Finally, the plentiful simulation experiments are conducted from various perspectives to verify the effectiveness and progressiveness of the AVI-PC algorithm.

摘要

在实际工业过程中,非线性模型预测控制(NMPC)的滚动优化解一直是一个非常棘手的问题。本文基于自适应动态规划,开发了加速值迭代预测控制(AVI-PC)算法。通过将迭代学习与 NMPC 的滚动时域机制相结合,提出了一种新的滚动优化解决方案模式,用于解决每个预测时域中的最优控制律。此外,还展示了 AVI-PC 算法的基本架构和具体形式,包括迭代学习过程、预测过程和控制过程之间的关系。在此基础上,建立了收敛性和可容许性条件,并在加速因子满足所建立条件时综合分析了相关性质。进一步地,通过利用多元线性回归方法构建的单个评论家网络对加速值迭代函数进行了逼近。最后,从多个角度进行了大量的仿真实验,验证了 AVI-PC 算法的有效性和先进性。

相似文献

1
Neural critic learning with accelerated value iteration for nonlinear model predictive control.神经批评学习与加速价值迭代的非线性模型预测控制。
Neural Netw. 2024 Aug;176:106364. doi: 10.1016/j.neunet.2024.106364. Epub 2024 May 6.
2
Neural Q-learning for discrete-time nonlinear zero-sum games with adjustable convergence rate.具有可调收敛速度的离散时间非线性零和博弈的神经 Q 学习。
Neural Netw. 2024 Jul;175:106274. doi: 10.1016/j.neunet.2024.106274. Epub 2024 Mar 27.
3
Improved value iteration for neural-network-based stochastic optimal control design.基于神经网络的随机最优控制设计的改进价值迭代。
Neural Netw. 2020 Apr;124:280-295. doi: 10.1016/j.neunet.2020.01.004. Epub 2020 Jan 28.
4
Novel optimal trajectory tracking for nonlinear affine systems with an advanced critic learning structure.具有先进评价学习结构的非线性仿射系统的新型最优轨迹跟踪。
Neural Netw. 2022 Oct;154:131-140. doi: 10.1016/j.neunet.2022.07.019. Epub 2022 Jul 16.
5
Optimal H tracking control of nonlinear systems with zero-equilibrium-free via novel adaptive critic designs.通过新颖的自适应评价设计实现具有零平衡点的非线性系统的最优 H 跟踪控制。
Neural Netw. 2023 Jul;164:105-114. doi: 10.1016/j.neunet.2023.04.021. Epub 2023 Apr 20.
6
Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis.离散时间局部值迭代自适应动态规划:可容许性和终止分析。
IEEE Trans Neural Netw Learn Syst. 2017 Nov;28(11):2490-2502. doi: 10.1109/TNNLS.2016.2593743.
7
Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems.值迭代自适应动态规划在离散时间非线性系统最优控制中的应用。
IEEE Trans Cybern. 2016 Mar;46(3):840-53. doi: 10.1109/TCYB.2015.2492242. Epub 2015 Nov 2.
8
Error bounds of adaptive dynamic programming algorithms for solving undiscounted optimal control problems.自适应动态规划算法求解非折扣最优控制问题的误差界。
IEEE Trans Neural Netw Learn Syst. 2015 Jun;26(6):1323-34. doi: 10.1109/TNNLS.2015.2402203. Epub 2015 Mar 3.
9
Adaptive optimal control of affine nonlinear systems via identifier-critic neural network approximation with relaxed PE conditions.基于放松的 PE 条件的辨识 - 评论神经网络逼近的仿射非线性系统自适应最优控制。
Neural Netw. 2023 Oct;167:588-600. doi: 10.1016/j.neunet.2023.08.044. Epub 2023 Sep 1.
10
Neural-network-based discounted optimal control via an integrated value iteration with accuracy guarantee.基于神经网络的具有精度保证的整合价值迭代折扣最优控制。
Neural Netw. 2021 Dec;144:176-186. doi: 10.1016/j.neunet.2021.08.025. Epub 2021 Aug 28.