基于深度强化学习PID控制器的自适应视觉伺服控制实现6自由度机器人机械手的运动目标跟踪

Tracking moving target for 6 degree-of-freedom robot manipulator with adaptive visual servoing based on deep reinforcement learning PID controller.

作者信息

Wang Fei, Ren Baiming, Liu Yue, Cui Ben

机构信息

Faculty of Robot Science and Engineering, Northeastern University, NO. 195, Innovation Road, Hunnan District, Shenyang, People's Republic of China.

College of Information Science and Engineering, Northeastern University, NO. 3-11, Wenhua Road, Heping District, Shenyang, People's Republic of China.

出版信息

Rev Sci Instrum. 2022 Apr 1;93(4):045108. doi: 10.1063/5.0087561.

DOI:10.1063/5.0087561

PMID:35489878

Abstract

In this paper, an image-based visual servoing (IBVS) controller with a 6 degree-of-freedom robotic manipulator that tracks moving objects is investigated using the proposed Deep Q-Networks and proportional-integral-derivative (DQN-PID) controller. First, the classical IBVS controller and the problem of feature loss and large steady-state error for tracking moving targets are introduced. Then, a DQN-PID based IBVS method is proposed to solve the problem of feature loss and large steady-state error and improve the servo precision, as the existing methods are hard to use for solve the problems. Specifically, the IBVS method is inherited by our controller to build the tracking model, and a value-based reinforcement learning method is proposed as an adaptive law for dynamically tuning the PID parameters in the discrete space, which can track the moving target and keep the servo feature in the field of the camera. Finally, compared with the different existing methods, the DQN-PID based IBVS method has merits of higher accuracy and more stable tracking, or generalization.

摘要

本文利用所提出的深度Q网络与比例积分微分（DQN-PID）控制器，对具有6自由度机器人操纵器的基于图像的视觉伺服（IBVS）控制器进行了研究，该控制器用于跟踪移动目标。首先，介绍了经典的IBVS控制器以及跟踪移动目标时的特征丢失和稳态误差大的问题。然后，提出了一种基于DQN-PID的IBVS方法来解决特征丢失和稳态误差大的问题，并提高伺服精度，因为现有方法难以解决这些问题。具体而言，我们的控制器继承了IBVS方法来建立跟踪模型，并提出了一种基于值的强化学习方法作为在离散空间中动态调整PID参数的自适应律，该方法可以跟踪移动目标并在相机视野内保持伺服特征。最后，与不同的现有方法相比，基于DQN-PID的IBVS方法具有更高的精度、更稳定的跟踪性能或泛化能力。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于深度强化学习PID控制器的自适应视觉伺服控制实现6自由度机器人机械手的运动目标跟踪

Tracking moving target for 6 degree-of-freedom robot manipulator with adaptive visual servoing based on deep reinforcement learning PID controller.

作者信息

机构信息

出版信息

相似文献

基于深度强化学习PID控制器的自适应视觉伺服控制实现6自由度机器人机械手的运动目标跟踪

Tracking moving target for 6 degree-of-freedom robot manipulator with adaptive visual servoing based on deep reinforcement learning PID controller.

作者信息

机构信息

出版信息

相似文献