基于分层基元的学习方法中的轨迹跟踪

Trajectory Tracking within a Hierarchical Primitive-Based Learning Approach.

作者信息

Radac Mircea-Bogdan

机构信息

Department of Automation and Applied Informatics, Politehnica University of Timisoara, 300223 Timisoara, Romania.

出版信息

Entropy (Basel). 2022 Jun 28;24(7):889. doi: 10.3390/e24070889.

DOI:10.3390/e24070889

PMID:35885112

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9321877/

Abstract

A hierarchical learning control framework (HLF) has been validated on two affordable control laboratories: an active temperature control system (ATCS) and an electrical rheostatic braking system (EBS). The proposed HLF is data-driven and model-free, while being applicable on general control tracking tasks which are omnipresent. At the lowermost level, L1, virtual state-feedback control is learned from input-output data, using a recently proposed virtual state-feedback reference tuning (VSFRT) principle. L1 ensures a linear reference model tracking (or matching) and thus, indirect closed-loop control system (CLCS) linearization. On top of L1, an experiment-driven model-free iterative learning control (EDMFILC) is then applied for learning reference input-controlled outputs pairs, coined as primitives. The primitives' signals at the L2 level encode the CLCS dynamics, which are not explicitly used in the learning phase. Data reusability is applied to derive monotonic and safely guaranteed learning convergence. The learning primitives in the L2 level are finally used in the uppermost and final L3 level, where a decomposition/recomposition operation enables prediction of the optimal reference input assuring optimal tracking of a previously unseen trajectory, without relearning by repetitions, as it was in level L2. Hence, the HLF enables control systems to generalize their tracking behavior to new scenarios by extrapolating their current knowledge base. The proposed HLF framework endows the CLCSs with learning, memorization and generalization features which are specific to intelligent organisms. This may be considered as an advancement towards intelligent, generalizable and adaptive control systems.

摘要

一种分层学习控制框架（HLF）已在两个经济实惠的控制实验室中得到验证：一个是主动温度控制系统（ATCS），另一个是电动变阻制动系统（EBS）。所提出的HLF是数据驱动且无模型的，同时适用于普遍存在的一般控制跟踪任务。在最底层L1，使用最近提出的虚拟状态反馈参考调整（VSFRT）原理从输入输出数据中学习虚拟状态反馈控制。L1确保线性参考模型跟踪（或匹配），从而实现间接闭环控制系统（CLCS）的线性化。在L1之上，然后应用实验驱动的无模型迭代学习控制（EDMFILC）来学习参考输入控制输出对，称为原语。L2级别的原语信号编码CLCS动态特性，这些特性在学习阶段并未明确使用。应用数据可重用性来推导单调且安全保证的学习收敛。L2级别的学习原语最终用于最顶层也是最后一层L3，在该层中，分解/重组操作能够预测最优参考输入，确保对先前未见过的轨迹进行最优跟踪，而无需像在L2级别那样通过重复重新学习。因此，HLF使控制系统能够通过推断其当前知识库将其跟踪行为推广到新场景。所提出的HLF框架赋予CLCS学习、记忆和泛化特性，这些特性是智能生物所特有的。这可被视为朝着智能、可泛化和自适应控制系统迈出的一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ee5/9321877/2809ea79d31a/entropy-24-00889-g001.jpg

相似文献

Trajectory Tracking within a Hierarchical Primitive-Based Learning Approach.

Entropy (Basel). 2022 Jun 28;24(7):889. doi: 10.3390/e24070889.

Model-Free Primitive-Based Iterative Learning Control Approach to Trajectory Tracking of MIMO Systems With Experimental Validation.

IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2925-38. doi: 10.1109/TNNLS.2015.2460258. Epub 2015 Aug 13.

Data-driven model reference control of MIMO vertical tank systems with model-free VRFT and Q-Learning.

ISA Trans. 2018 Feb;73:227-238. doi: 10.1016/j.isatra.2018.01.014. Epub 2018 Jan 8.

Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method.

IEEE Trans Neural Netw. 2011 Dec;22(12):2226-36. doi: 10.1109/TNN.2011.2168538. Epub 2011 Oct 13.

Model-Free Q-Learning for the Tracking Problem of Linear Discrete-Time Systems.

IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3191-3201. doi: 10.1109/TNNLS.2022.3195357. Epub 2024 Feb 29.

Optimal Tracking Control of Unknown Discrete-Time Linear Systems Using Input-Output Measured Data.

IEEE Trans Cybern. 2015 Dec;45(12):2770-9. doi: 10.1109/TCYB.2014.2384016. Epub 2015 Jan 6.

Learning from adaptive neural dynamic surface control of strict-feedback systems.

IEEE Trans Neural Netw Learn Syst. 2015 Jun;26(6):1247-59. doi: 10.1109/TNNLS.2014.2335749. Epub 2014 Jul 22.

Model-Free Adaptive Control for Unknown MIMO Nonaffine Nonlinear Discrete-Time Systems With Experimental Validation.

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1727-1739. doi: 10.1109/TNNLS.2020.3043711. Epub 2022 Apr 4.

A Data-Driven ILC Framework for a Class of Nonlinear Discrete-Time Systems.

IEEE Trans Cybern. 2022 Jul;52(7):6143-6157. doi: 10.1109/TCYB.2020.3029596. Epub 2022 Jul 4.

Iterative learning-based decentralized adaptive tracker for large-scale systems: a digital redesign approach.

ISA Trans. 2011 Jul;50(3):344-56. doi: 10.1016/j.isatra.2011.01.007. Epub 2011 Feb 18.

引用本文的文献

Near real-time online reinforcement learning with synchronous or asynchronous updates.

Sci Rep. 2025 May 17;15(1):17158. doi: 10.1038/s41598-025-00492-7.

Myoelectric Control in Rehabilitative and Assistive Soft Exoskeletons: A Comprehensive Review of Trends, Challenges, and Integration with Soft Robotic Devices.

Biomimetics (Basel). 2025 Apr 1;10(4):214. doi: 10.3390/biomimetics10040214.

本文引用的文献

Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.

ISA Trans. 2022 Oct;129(Pt B):295-308. doi: 10.1016/j.isatra.2022.02.007. Epub 2022 Feb 10.

Real-Time Leak Location of Long-Distance Pipeline Using Adaptive Dynamic Programming.

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):7004-7013. doi: 10.1109/TNNLS.2021.3136939. Epub 2023 Oct 5.

Iterative Learning Control of Constrained Systems With Varying Trial Lengths Under Alignment Condition.

IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):6670-6676. doi: 10.1109/TNNLS.2021.3135504. Epub 2023 Sep 1.

Inverse Reinforcement Q-Learning Through Expert Imitation for Discrete-Time Systems.

IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2386-2399. doi: 10.1109/TNNLS.2021.3106635. Epub 2023 May 2.

Design and Analysis of Data-Driven Learning Control: An Optimization-Based Approach.

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5527-5541. doi: 10.1109/TNNLS.2021.3070920. Epub 2022 Oct 5.

A Data-Driven ILC Framework for a Class of Nonlinear Discrete-Time Systems.

IEEE Trans Cybern. 2022 Jul;52(7):6143-6157. doi: 10.1109/TCYB.2020.3029596. Epub 2022 Jul 4.

A novel robust Virtual Reference Feedback Tuning approach for minimum and non-minimum phase systems.

ISA Trans. 2021 Sep;115:163-191. doi: 10.1016/j.isatra.2021.01.018. Epub 2021 Jan 9.

A Secure Control Learning Framework for Cyber-Physical Systems Under Sensor and Actuator Attacks.

IEEE Trans Cybern. 2021 Sep;51(9):4648-4660. doi: 10.1109/TCYB.2020.3006871. Epub 2021 Sep 15.

Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems.

IEEE Trans Cybern. 2021 Jul;51(7):3630-3640. doi: 10.1109/TCYB.2020.2970969. Epub 2021 Jun 23.

Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints.

IEEE Trans Neural Netw Learn Syst. 2020 Oct;31(10):4330-4340. doi: 10.1109/TNNLS.2019.2954983. Epub 2019 Dec 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于分层基元的学习方法中的轨迹跟踪

Trajectory Tracking within a Hierarchical Primitive-Based Learning Approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献