• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习整定的约束机器人视觉伺服模型预测控制。

Model predictive control for constrained robot manipulator visual servoing tuned by reinforcement learning.

机构信息

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Nantong street, Harbin 150001, China.

School of Electrical, Electronic, and Computer Engineering, The University of Western Australia, Crawley, WA 6009, Australia.

出版信息

Math Biosci Eng. 2023 Apr 10;20(6):10495-10513. doi: 10.3934/mbe.2023463.

DOI:10.3934/mbe.2023463
PMID:37322945
Abstract

For constrained image-based visual servoing (IBVS) of robot manipulators, a model predictive control (MPC) strategy tuned by reinforcement learning (RL) is proposed in this study. First, model predictive control is used to transform the image-based visual servo task into a nonlinear optimization problem while taking system constraints into consideration. In the design of the model predictive controller, a depth-independent visual servo model is presented as the predictive model. Next, a suitable model predictive control objective function weight matrix is trained and obtained by a deep-deterministic-policy-gradient-based (DDPG) RL algorithm. Then, the proposed controller gives the sequential joint signals, so that the robot manipulator can respond to the desired state quickly. Finally, appropriate comparative simulation experiments are developed to illustrate the efficacy and stability of the suggested strategy.

摘要

针对机器人的基于图像的视觉伺服(IBVS)的约束问题,本研究提出了一种通过强化学习(RL)调整的模型预测控制(MPC)策略。首先,模型预测控制用于将基于图像的视觉伺服任务转换为非线性优化问题,同时考虑系统约束。在模型预测控制器的设计中,提出了一种深度独立的视觉伺服模型作为预测模型。接下来,通过基于深度确定性策略梯度(DDPG)的 RL 算法,训练和获得合适的模型预测控制目标函数权重矩阵。然后,所提出的控制器给出了连续的关节信号,以便机器人能够快速响应期望状态。最后,进行了适当的对比仿真实验,以说明所提出策略的有效性和稳定性。

相似文献

1
Model predictive control for constrained robot manipulator visual servoing tuned by reinforcement learning.基于强化学习整定的约束机器人视觉伺服模型预测控制。
Math Biosci Eng. 2023 Apr 10;20(6):10495-10513. doi: 10.3934/mbe.2023463.
2
Tracking moving target for 6 degree-of-freedom robot manipulator with adaptive visual servoing based on deep reinforcement learning PID controller.基于深度强化学习PID控制器的自适应视觉伺服控制实现6自由度机器人机械手的运动目标跟踪
Rev Sci Instrum. 2022 Apr 1;93(4):045108. doi: 10.1063/5.0087561.
3
Hierarchical multiloop MPC scheme for robot manipulators with nonlinear disturbance observer.基于非线性干扰观测器的机器人机械手分层多回路模型预测控制方案
Math Biosci Eng. 2022 Aug 29;19(12):12601-12616. doi: 10.3934/mbe.2022588.
4
Robot manipulator visual servoing based on image moments and improved firefly optimization algorithm-based extreme learning machine.基于图像矩和改进的基于萤火虫优化算法的极限学习机的机器人操纵器视觉伺服控制
ISA Trans. 2023 Dec;143:188-204. doi: 10.1016/j.isatra.2023.10.010. Epub 2023 Oct 18.
5
Stable neurovisual servoing for robot manipulators.用于机器人操纵器的稳定神经视觉伺服控制
IEEE Trans Neural Netw. 2006 Jul;17(4):953-965. doi: 10.1109/TNN.2006.875993.
6
Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.基于强化学习和逆强化学习的蛇形机器人节能与损伤恢复蠕动步态设计。
Neural Netw. 2020 Sep;129:323-333. doi: 10.1016/j.neunet.2020.05.029. Epub 2020 Jun 16.
7
A data-driven acceleration-level scheme for image-based visual servoing of manipulators with unknown structure.一种用于结构未知的机器人基于图像视觉伺服的数据驱动加速水平方案。
Front Neurorobot. 2024 Mar 20;18:1380430. doi: 10.3389/fnbot.2024.1380430. eCollection 2024.
8
Variable Admittance Control Based on Fuzzy Reinforcement Learning for Minimally Invasive Surgery Manipulator.基于模糊强化学习的微创手术机械臂变阻抗控制。
Sensors (Basel). 2017 Apr 12;17(4):844. doi: 10.3390/s17040844.
9
Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning.基于深度强化学习的改进深度确定性策略梯度算法的机器人抓取方法优化
Rev Sci Instrum. 2021 Feb 1;92(2):025114. doi: 10.1063/5.0034101.
10
CLFs-based optimization control for a class of constrained visual servoing systems.基于分类学习滤波器的一类约束视觉伺服系统优化控制
ISA Trans. 2017 Mar;67:507-514. doi: 10.1016/j.isatra.2016.11.018. Epub 2016 Dec 7.