• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于动作依赖启发式动态规划的具有积分作用的在线离散时间LQR控制器设计,用于斗轮堆取料机的运行过程

Online discrete-time LQR controller design with integral action for bulk Bucket Wheel Reclaimer operational processes via Action-Dependent Heuristic Dynamic Programming.

作者信息

de Moura José Pinheiro, Rego Patrícia Helena Moraes, da Fonseca Neto João Viana

机构信息

UEMA, Brazil.

UEMA, Brazil.

出版信息

ISA Trans. 2019 Jul;90:294-310. doi: 10.1016/j.isatra.2019.01.010. Epub 2019 Jan 30.

DOI:10.1016/j.isatra.2019.01.010
PMID:30732992
Abstract

In this paper, a novel approach for online design of optimal control systems applied to the bulk resumption process by bucket wheel reclaimer (BWR) is presented. This approach is based on reinforcement learning paradigms, more specifically Action Dependent Heuristic Dynamic Programming (ADHDP), that learn online in real-time the Discrete Linear Quadratic Regulator (DLQR) optimal control solution with integral action. Due to the geometric irregularities of the storage yard stacks and variation in physical and chemical characteristics of the stacked material, the flow control of solid bulks by bucket wheel reclaimer requires methods that are suitable with the high degree of imprecision of process variables and environment uncertainties. The resumption of bulk solids is carried out by dividing the stack into layers, each layer is approximately 4 m high, and the layers are divided into workbenches up to 12 m in length. To take up a workbench several translation steps are required (penetration in the stack), with the translation step varying from 0 to 1 m. In order to maintain the desired ore flow throughout the process, the BWR lance speed must be periodically adjusted. The main advantage of the proposed control method is that besides the decision rule is fully independent of plant model, the gains of the resulting controller are self-adjustable. The control system was designed in such a way that the ADHDP-based DLQR controller with integral action would act in real-time in the plant control, using only the input and output signals and states measured along the system trajectory.

摘要

本文提出了一种应用于斗轮堆取料机(BWR)散料恢复过程的最优控制系统在线设计新方法。该方法基于强化学习范式,更具体地说是基于动作相关启发式动态规划(ADHDP),它能实时在线学习具有积分作用的离散线性二次调节器(DLQR)最优控制解。由于堆场料堆的几何不规则性以及堆存物料物理和化学特性的变化,斗轮堆取料机对固体散料的流量控制需要适合过程变量高度不精确性和环境不确定性的方法。散料的恢复是通过将料堆分层进行的,每层大约4米高,并且这些层被划分成长度达12米的工作台。为了占据一个工作台需要几个平移步骤(深入料堆),平移步长从0到1米不等。为了在整个过程中保持所需的矿石流量,必须定期调整BWR喷枪速度。所提出的控制方法的主要优点是,除了决策规则完全独立于工厂模型外,所得控制器的增益是可自我调整的。控制系统的设计方式是,基于ADHDP的具有积分作用的DLQR控制器将仅使用沿系统轨迹测量的输入、输出信号和状态在工厂控制中实时起作用。

相似文献

1
Online discrete-time LQR controller design with integral action for bulk Bucket Wheel Reclaimer operational processes via Action-Dependent Heuristic Dynamic Programming.基于动作依赖启发式动态规划的具有积分作用的在线离散时间LQR控制器设计,用于斗轮堆取料机的运行过程
ISA Trans. 2019 Jul;90:294-310. doi: 10.1016/j.isatra.2019.01.010. Epub 2019 Jan 30.
2
Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming.基于自适应动态规划的吸气式高超音速飞行器跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):584-598. doi: 10.1109/TNNLS.2016.2516948. Epub 2016 Feb 3.
3
Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming.基于作用相关启发式动态规划的盾构机土压平衡最优控制。
ISA Trans. 2019 Nov;94:28-35. doi: 10.1016/j.isatra.2019.04.007. Epub 2019 Apr 18.
4
Output Feedback Q-Learning Control for the Discrete-Time Linear Quadratic Regulator Problem.离散时间线性二次调节器问题的输出反馈Q学习控制
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1523-1536. doi: 10.1109/TNNLS.2018.2870075. Epub 2018 Oct 8.
5
Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators.基于在线逼近器的仿射非线性离散时间系统强化学习控制器设计
IEEE Trans Syst Man Cybern B Cybern. 2012 Apr;42(2):377-90. doi: 10.1109/TSMCB.2011.2166384. Epub 2011 Sep 23.
6
Optimal Tracking Control of Unknown Discrete-Time Linear Systems Using Input-Output Measured Data.基于输入-输出实测数据的未知离散时间线性系统的最优跟踪控制。
IEEE Trans Cybern. 2015 Dec;45(12):2770-9. doi: 10.1109/TCYB.2014.2384016. Epub 2015 Jan 6.
7
Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof.使用近似动态规划的离散时间非线性HJB解:收敛性证明
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):943-9. doi: 10.1109/TSMCB.2008.926614.
8
Adaptive Event-Triggered Control Based on Heuristic Dynamic Programming for Nonlinear Discrete-Time Systems.基于启发式动态规划的非线性离散时间系统自适应事件触发控制。
IEEE Trans Neural Netw Learn Syst. 2017 Jul;28(7):1594-1605. doi: 10.1109/TNNLS.2016.2541020. Epub 2016 Apr 8.
9
Reinforcement Learning-Based Linear Quadratic Regulation of Continuous-Time Systems Using Dynamic Output Feedback.基于强化学习的连续时间系统动态输出反馈线性二次调节
IEEE Trans Cybern. 2019 Jan 3. doi: 10.1109/TCYB.2018.2886735.
10
An optimal PID controller via LQR for standard second order plus time delay systems.一种用于标准二阶加时滞系统的基于线性二次调节器的最优比例积分微分(PID)控制器。
ISA Trans. 2016 Jan;60:244-253. doi: 10.1016/j.isatra.2015.11.020. Epub 2015 Dec 4.