• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自适应滑模干扰观测器和深度强化学习的微定位器运动控制

Adaptive Sliding Mode Disturbance Observer and Deep Reinforcement Learning Based Motion Control for Micropositioners.

作者信息

Liang Shiyun, Xi Ruidong, Xiao Xiao, Yang Zhixin

机构信息

State Key Laboratory of Internet of Things for Smart City and Department of Electromechanical Engineering, University of Macau, Macau 999078, China.

Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen 518055, China.

出版信息

Micromachines (Basel). 2022 Mar 17;13(3):458. doi: 10.3390/mi13030458.

DOI:10.3390/mi13030458
PMID:35334749
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8955352/
Abstract

The motion control of high-precision electromechanitcal systems, such as micropositioners, is challenging in terms of the inherent high nonlinearity, the sensitivity to external interference, and the complexity of accurate identification of the model parameters. To cope with these problems, this work investigates a disturbance observer-based deep reinforcement learning control strategy to realize high robustness and precise tracking performance. Reinforcement learning has shown great potential as optimal control scheme, however, its application in micropositioning systems is still rare. Therefore, embedded with the integral differential compensator (ID), deep deterministic policy gradient (DDPG) is utilized in this work with the ability to not only decrease the state error but also improve the transient response speed. In addition, an adaptive sliding mode disturbance observer (ASMDO) is proposed to further eliminate the collective effect caused by the lumped disturbances. The micropositioner controlled by the proposed algorithm can track the target path precisely with less than 1 μm error in simulations and actual experiments, which shows the sterling performance and the accuracy improvement of the controller.

摘要

诸如微定位器等高精度机电系统的运动控制,在固有高度非线性、对外部干扰的敏感性以及模型参数精确辨识的复杂性方面都具有挑战性。为应对这些问题,本文研究了一种基于干扰观测器的深度强化学习控制策略,以实现高鲁棒性和精确跟踪性能。强化学习作为一种最优控制方案已展现出巨大潜力,然而,其在微定位系统中的应用仍然很少。因此,本文将深度确定性策略梯度(DDPG)与积分微分补偿器(ID)相结合,不仅能够减小状态误差,还能提高瞬态响应速度。此外,还提出了一种自适应滑模干扰观测器(ASMDO),以进一步消除集总干扰引起的综合影响。在仿真和实际实验中,由所提算法控制的微定位器能够精确跟踪目标路径,误差小于1μm,这表明了该控制器出色的性能和精度提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/076f7fb65853/micromachines-13-00458-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/41d5ada4e4e7/micromachines-13-00458-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/dbea1f9bf58d/micromachines-13-00458-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/a499cefe72c5/micromachines-13-00458-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/fed45ad9faa3/micromachines-13-00458-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/5f86605bc08c/micromachines-13-00458-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/65b0360568d5/micromachines-13-00458-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/751ff91c17c7/micromachines-13-00458-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/d6cd0e41bde9/micromachines-13-00458-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/17f88571eae9/micromachines-13-00458-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/8e53b703089f/micromachines-13-00458-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/6f486f5cb5bd/micromachines-13-00458-g011a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/88e8a7d38789/micromachines-13-00458-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/f436ea3e84e6/micromachines-13-00458-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/076f7fb65853/micromachines-13-00458-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/41d5ada4e4e7/micromachines-13-00458-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/dbea1f9bf58d/micromachines-13-00458-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/a499cefe72c5/micromachines-13-00458-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/fed45ad9faa3/micromachines-13-00458-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/5f86605bc08c/micromachines-13-00458-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/65b0360568d5/micromachines-13-00458-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/751ff91c17c7/micromachines-13-00458-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/d6cd0e41bde9/micromachines-13-00458-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/17f88571eae9/micromachines-13-00458-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/8e53b703089f/micromachines-13-00458-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/6f486f5cb5bd/micromachines-13-00458-g011a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/88e8a7d38789/micromachines-13-00458-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/f436ea3e84e6/micromachines-13-00458-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fdc/8955352/076f7fb65853/micromachines-13-00458-g014.jpg

相似文献

1
Adaptive Sliding Mode Disturbance Observer and Deep Reinforcement Learning Based Motion Control for Micropositioners.基于自适应滑模干扰观测器和深度强化学习的微定位器运动控制
Micromachines (Basel). 2022 Mar 17;13(3):458. doi: 10.3390/mi13030458.
2
Adaptive fuzzy PID cross coupled control for multi-axis motion system based on sliding mode disturbance observation.基于滑模干扰观测的多轴运动系统自适应模糊PID交叉耦合控制
Sci Prog. 2021 Apr-Jun;104(2):368504211011847. doi: 10.1177/00368504211011847.
3
Adaptive sliding mode current control with sliding mode disturbance observer for PMSM drives.基于滑模干扰观测器的永磁同步电机驱动器自适应滑模电流控制
ISA Trans. 2019 May;88:113-126. doi: 10.1016/j.isatra.2018.11.039. Epub 2018 Nov 30.
4
Robust Control Strategy of Gradient Magnetic Drive for Microrobots Based on Extended State Observer.基于扩张状态观测器的微机器人梯度磁驱动鲁棒控制策略
Cyborg Bionic Syst. 2022 Oct 21;2022:9835014. doi: 10.34133/2022/9835014. eCollection 2022.
5
Research on coupling control of multiple permanent magnet synchronous motors based on NAISMC and SMDO.基于 NAISMC 和 SMDO 的多永磁同步电机耦合控制研究。
PLoS One. 2023 Oct 20;18(10):e0292913. doi: 10.1371/journal.pone.0292913. eCollection 2023.
6
Path Following Control for Unmanned Surface Vehicles: A Reinforcement Learning-Based Method With Experimental Validation.无人水面舰艇的路径跟踪控制:一种基于强化学习的方法及实验验证
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):18237-18250. doi: 10.1109/TNNLS.2023.3313312. Epub 2024 Dec 2.
7
Fast integral terminal sliding mode control with a novel disturbance observer based on iterative learning for speed control of PMSM.基于迭代学习的新型干扰观测器的快速积分终端滑模控制在永磁同步电机速度控制中的应用。
ISA Trans. 2023 Mar;134:460-471. doi: 10.1016/j.isatra.2022.07.029. Epub 2022 Aug 23.
8
Adaptive sliding mode disturbance rejection control with prescribed performance for robotic manipulators.具有规定性能的机器人机械臂自适应滑模干扰抑制控制
ISA Trans. 2019 Aug;91:41-51. doi: 10.1016/j.isatra.2019.01.017. Epub 2019 Feb 4.
9
Adaptive Sliding-Mode Disturbance Observer-Based Finite-Time Control for Unmanned Aerial Manipulator With Prescribed Performance.基于自适应滑模干扰观测器的具有规定性能的无人空中机械手有限时间控制
IEEE Trans Cybern. 2023 May;53(5):3263-3276. doi: 10.1109/TCYB.2022.3168030. Epub 2023 Apr 21.
10
Sliding mode output feedback control based on tracking error observer with disturbance estimator.基于带有干扰估计器的跟踪误差观测器的滑模输出反馈控制
ISA Trans. 2014 Jul;53(4):1061-72. doi: 10.1016/j.isatra.2014.04.001. Epub 2014 May 1.

引用本文的文献

1
An enhanced deep deterministic policy gradient algorithm for intelligent control of robotic arms.一种用于机器人手臂智能控制的增强型深度确定性策略梯度算法。
Front Neuroinform. 2023 Jan 23;17:1096053. doi: 10.3389/fninf.2023.1096053. eCollection 2023.
2
Development of Magnetically Levitated Rotary Table for Repetitive Trajectory Tracking.磁悬浮旋转工作台的开发用于重复轨迹跟踪。
Sensors (Basel). 2022 Jun 3;22(11):4270. doi: 10.3390/s22114270.

本文引用的文献

1
Positioning Accuracy in Holographic Optical Traps.全息光镊中的定位精度。
Micromachines (Basel). 2021 May 15;12(5):559. doi: 10.3390/mi12050559.
2
Optimization of a Spin-Orbit Torque Switching Scheme Based on Micromagnetic Simulations and Reinforcement Learning.基于微磁模拟和强化学习的自旋轨道扭矩切换方案优化
Micromachines (Basel). 2021 Apr 15;12(4):443. doi: 10.3390/mi12040443.
3
Friction Compensation Control of Electromechanical Actuator Based on Neural Network Adaptive Sliding Mode.基于神经网络自适应滑模的机电作动器摩擦补偿控制
Sensors (Basel). 2021 Feb 22;21(4):1508. doi: 10.3390/s21041508.
4
Adaptive Fuzzy Sliding Mode Control for a Micro Gyroscope with Backstepping Controller.基于反步控制器的微陀螺仪自适应模糊滑模控制
Micromachines (Basel). 2020 Oct 29;11(11):968. doi: 10.3390/mi11110968.
5
A New Methodology for a Retrofitted Self-tuned Controller with Open-Source FPGA.基于开源 FPGA 的改进型自整定控制器的新方法
Sensors (Basel). 2020 Oct 29;20(21):6155. doi: 10.3390/s20216155.
6
A PID-Type Fuzzy Logic Controller-Based Approach for Motion Control Applications.一种基于PID型模糊逻辑控制器的运动控制应用方法。
Sensors (Basel). 2020 Sep 17;20(18):5323. doi: 10.3390/s20185323.
7
Autonomous robotic nanofabrication with reinforcement learning.基于强化学习的自主机器人纳米制造
Sci Adv. 2020 Sep 2;6(36). doi: 10.1126/sciadv.abb6987. Print 2020 Sep.
8
Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems.基于事件触发强化学习的完全未知连续时间非线性系统自适应跟踪控制
IEEE Trans Cybern. 2020 Jul;50(7):3231-3242. doi: 10.1109/TCYB.2019.2903108. Epub 2019 Mar 29.
9
Abdominal-Waving Control of Tethered Bumblebees Based on Sarsa With Transformed Reward.基于带变换奖励的 Sarsa 的有绳大黄蜂腹部摆动控制。
IEEE Trans Cybern. 2019 Aug;49(8):3064-3073. doi: 10.1109/TCYB.2018.2838595. Epub 2018 Jun 22.
10
Optimal and Autonomous Control Using Reinforcement Learning: A Survey.基于强化学习的最优和自主控制:综述。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2042-2062. doi: 10.1109/TNNLS.2017.2773458.