• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于探索-利用的新型自适应律,用于智能无模型控制方法。

A Novel Exploration-Exploitation-Based Adaptive Law for Intelligent Model-Free Control Approaches.

作者信息

Tutsoy Onder, Barkana Duygun Erol, Balikci Kemal

出版信息

IEEE Trans Cybern. 2023 Jan;53(1):329-337. doi: 10.1109/TCYB.2021.3091680. Epub 2022 Dec 23.

DOI:10.1109/TCYB.2021.3091680
PMID:34398780
Abstract

Model-free control approaches require advanced exploration-exploitation policies to achieve practical tasks such as learning to bipedal robot walk in unstructured environments. In this article, we first construct a comprehensive exploration-exploitation policy that carries quality knowledge about the long-term predictor and the control policy, and the control signal of the model-free algorithms. Therefore, the developed model-free algorithm continues exploration by adjusting its unknown parameters until the desired learning and control are accomplished. Second, we provide an utterly model-free adaptive law enriched with the exploration-exploitation policy and derived step-by-step using the exact analogy of the model-based solution. The obtained adaptive control law considers the control signal saturation and the control signal (input) delay. Performed Lyapunov stability analysis ensures the convergence of the adaptive law that can also be used for intelligent control approaches. Third, we implement the adaptive algorithm in real time on a challenging benchmark system: a fourth-order, coupled dynamics, input saturated, and time-delayed underactuated manipulator. The results show that the proposed adaptive algorithm explores larger state-action spaces and treats the vanishing gradient problem in both learning and control. Also, we notice from the results that the learning and control properties of the adaptive algorithm are optimized as required.

摘要

无模型控制方法需要先进的探索-利用策略来完成诸如学习让双足机器人在非结构化环境中行走等实际任务。在本文中,我们首先构建了一种综合的探索-利用策略,该策略包含有关长期预测器和控制策略的质量知识以及无模型算法的控制信号。因此,所开发的无模型算法通过调整其未知参数持续进行探索,直到实现期望的学习和控制。其次,我们提供了一种完全无模型的自适应律,该自适应律丰富了探索-利用策略,并使用基于模型的解决方案的精确类比逐步推导得出。所获得的自适应控制律考虑了控制信号饱和以及控制信号(输入)延迟。进行的李雅普诺夫稳定性分析确保了自适应律的收敛,该自适应律也可用于智能控制方法。第三,我们在一个具有挑战性的基准系统上实时实现了自适应算法:一个四阶、耦合动力学、输入饱和且具有时间延迟的欠驱动机械手。结果表明,所提出的自适应算法探索了更大的状态-动作空间,并在学习和控制中处理了梯度消失问题。此外,我们从结果中注意到,自适应算法的学习和控制特性根据要求得到了优化。

相似文献

1
A Novel Exploration-Exploitation-Based Adaptive Law for Intelligent Model-Free Control Approaches.一种基于探索-利用的新型自适应律,用于智能无模型控制方法。
IEEE Trans Cybern. 2023 Jan;53(1):329-337. doi: 10.1109/TCYB.2021.3091680. Epub 2022 Dec 23.
2
Model free adaptive control of the under-actuated robot manipulator with the chaotic dynamics.具有混沌动力学的欠驱动机器人机械臂的无模型自适应控制
ISA Trans. 2021 Dec;118:106-115. doi: 10.1016/j.isatra.2021.02.006. Epub 2021 Feb 8.
3
Fuzzy-neural-network inherited sliding-mode control for robot manipulator including actuator dynamics.带执行器动态的机器人操纵器的模糊神经网络遗传滑模控制。
IEEE Trans Neural Netw Learn Syst. 2013 Feb;24(2):274-87. doi: 10.1109/TNNLS.2012.2228230.
4
Design of a completely model free adaptive control in the presence of parametric, non-parametric uncertainties and random control signal delay.设计一种完全无模型的自适应控制方法,用于处理参数不确定性、非参数不确定性和随机控制信号延迟问题。
ISA Trans. 2018 May;76:67-77. doi: 10.1016/j.isatra.2018.03.002. Epub 2018 Mar 14.
5
Improved intelligent clonal optimizer based on adaptive parameter strategy.基于自适应参数策略的改进型智能克隆优化器。
Math Biosci Eng. 2022 Jul 21;19(10):10275-10315. doi: 10.3934/mbe.2022481.
6
Adaptive Neural-Network Controller for an Uncertain Rigid Manipulator With Input Saturation and Full-Order State Constraint.具有输入饱和和满阶状态约束的不确定刚性机械手的自适应神经网络控制器。
IEEE Trans Cybern. 2022 May;52(5):2907-2915. doi: 10.1109/TCYB.2020.3022084. Epub 2022 May 19.
7
Composite-Learning-Based Adaptive Neural Control for Dual-Arm Robots With Relative Motion.基于组合学习的双臂机器人相对运动自适应神经控制。
IEEE Trans Neural Netw Learn Syst. 2022 Mar;33(3):1010-1021. doi: 10.1109/TNNLS.2020.3037795. Epub 2022 Feb 28.
8
Adaptive fuzzy neural network control design via a T-S fuzzy model for a robot manipulator including actuator dynamics.基于T-S模糊模型的机器人机械手自适应模糊神经网络控制设计,包括执行器动力学。
IEEE Trans Syst Man Cybern B Cybern. 2008 Oct;38(5):1326-46. doi: 10.1109/TSMCB.2008.925749.
9
Sliding-mode controller synthesis of robotic manipulator based on a new modified reaching law.基于新改进趋近律的机器人滑模控制器综合
Math Biosci Eng. 2022 Apr 22;19(6):6362-6378. doi: 10.3934/mbe.2022298.
10
A fuzzy adaptive learning control network with on-line structure and parameter learning.一种具有在线结构和参数学习的模糊自适应学习控制网络。
Int J Neural Syst. 1996 Nov;7(5):569-90. doi: 10.1142/s0129065796000567.

引用本文的文献

1
Enhanced extreme learning machine via competitive learning SSA (CL-SSA) for load capacity factor prediction.基于竞争学习麻雀搜索算法(CL-SSA)的增强型极限学习机用于预测承载能力系数
Heliyon. 2025 Jan 11;11(2):e41892. doi: 10.1016/j.heliyon.2025.e41892. eCollection 2025 Jan 30.
2
Bidirectional recurrent learning of inverse dynamic models for robots with elastic joints: a real-time real-world implementation.具有弹性关节的机器人逆动力学模型的双向递归学习:实时真实世界实现
Front Neurorobot. 2023 Jun 16;17:1166911. doi: 10.3389/fnbot.2023.1166911. eCollection 2023.
3
A Concurrent Mission-Planning Methodology for Robotic Swarms Using Collaborative Motion-Control Strategies.
一种采用协作运动控制策略的机器人集群并发任务规划方法。
J Intell Robot Syst. 2023;108(2):15. doi: 10.1007/s10846-023-01881-8. Epub 2023 May 30.
4
A Survey on UAV Computing Platforms: A Hardware Reliability Perspective.从硬件可靠性角度对无人机计算平台的调查
Sensors (Basel). 2022 Aug 21;22(16):6286. doi: 10.3390/s22166286.
5
A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction.一种用于人机交互的混合近端策略优化(PAC)强化学习算法
Front Robot AI. 2022 Mar 9;9:797213. doi: 10.3389/frobt.2022.797213. eCollection 2022.