• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于SAC最优参数寻优算法的机器人恒力磨削控制器

Constant force grinding controller for robots based on SAC optimal parameter finding algorithm.

作者信息

Rei Chosei, Wang Qichao, Chen Linlin, Yan Xinhua, Zhang Peng, Fu Liwei, Wang Chong, Liu Xinghui

机构信息

Nobot Intelligent Equipment (Shandong) Co., Ltd, Liaocheng, Shandong, China.

School of Mechanical and Automotive Engineering, Liaocheng University, Liaocheng, Shandong, China.

出版信息

Sci Rep. 2024 Jun 19;14(1):14127. doi: 10.1038/s41598-024-63384-2.

DOI:10.1038/s41598-024-63384-2
PMID:38898063
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11724894/
Abstract

Since conventional PID (Proportional-Integral-Derivative) controllers hardly control the robot to stabilize for constant force grinding under changing environmental conditions, it is necessary to add a compensation term to conventional PID controllers. An optimal parameter finding algorithm based on SAC (Soft-Actor-Critic) is proposed to solve the problem that the compensation term parameters are difficult to obtain, including training state action and normalization preprocessing, reward function design, and targeted deep neural network design. The algorithm is used to find the optimal controller compensation term parameters and applied to the PID controller to complete the compensation through the inverse kinematics of the robot to achieve constant force grinding control. To verify the algorithm's feasibility, a simulation model of a grinding robot with sensible force information is established, and the simulation results show that the controller trained with the algorithm can achieve constant force grinding of the robot. Finally, the robot constant force grinding experimental system platform is built for testing, which verifies the control effect of the optimal parameter finding algorithm on the robot constant force grinding and has specific environmental adaptability.

摘要

由于传统的比例-积分-微分(PID)控制器在不断变化的环境条件下几乎无法控制机器人实现恒力磨削的稳定,因此有必要在传统PID控制器中添加一个补偿项。提出了一种基于软演员-评论家(SAC)的最优参数寻优算法,以解决补偿项参数难以获取的问题,包括训练状态动作和归一化预处理、奖励函数设计以及目标深度神经网络设计。该算法用于寻找最优控制器补偿项参数,并应用于PID控制器,通过机器人的逆运动学完成补偿,以实现恒力磨削控制。为验证该算法的可行性,建立了具有力敏信息的磨削机器人仿真模型,仿真结果表明,用该算法训练的控制器能够实现机器人的恒力磨削。最后,搭建了机器人恒力磨削实验系统平台进行测试,验证了最优参数寻优算法对机器人恒力磨削的控制效果,且具有特定的环境适应性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/2c47ca02e38a/41598_2024_63384_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/b05b7e3fe0c4/41598_2024_63384_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/6383e014f371/41598_2024_63384_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/2c47ca02e38a/41598_2024_63384_Fig16_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/b05b7e3fe0c4/41598_2024_63384_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/6383e014f371/41598_2024_63384_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/995f/11724894/2c47ca02e38a/41598_2024_63384_Fig16_HTML.jpg

相似文献

1
Constant force grinding controller for robots based on SAC optimal parameter finding algorithm.基于SAC最优参数寻优算法的机器人恒力磨削控制器
Sci Rep. 2024 Jun 19;14(1):14127. doi: 10.1038/s41598-024-63384-2.
2
A controller of robot constant force grinding based on proximal policy optimization algorithm.基于近端策略优化算法的机器人恒力磨削控制器。
PLoS One. 2025 May 7;20(5):e0319440. doi: 10.1371/journal.pone.0319440. eCollection 2025.
3
Research on Surface Tracking and Constant Force Control of a Grinding Robot.研磨机器人的表面跟踪与恒力控制研究。
Sensors (Basel). 2023 May 12;23(10):4702. doi: 10.3390/s23104702.
4
Hybrid controller with neural network PID/FOPID operations for two-link rigid robot manipulator based on the zebra optimization algorithm.基于斑马优化算法的两连杆刚性机器人机械臂神经网络PID/FOPID操作混合控制器
Front Robot AI. 2024 Jun 14;11:1386968. doi: 10.3389/frobt.2024.1386968. eCollection 2024.
5
Coot optimization algorithm-tuned neural network-enhanced PID controllers for robust trajectory tracking of three-link rigid robot manipulator.用于三连杆刚性机器人机械手鲁棒轨迹跟踪的布谷鸟优化算法调整的神经网络增强型PID控制器。
Heliyon. 2024 Jun 17;10(13):e32661. doi: 10.1016/j.heliyon.2024.e32661. eCollection 2024 Jul 15.
6
UAV Trajectory Tracking Using Proportional-Integral-Derivative-Type-2 Fuzzy Logic Controller with Genetic Algorithm Parameter Tuning.基于遗传算法参数调整的比例积分微分二型模糊逻辑控制器的无人机轨迹跟踪
Sensors (Basel). 2024 Oct 17;24(20):6678. doi: 10.3390/s24206678.
7
PD Control Compensation Based on a Cascade Neural Network Applied to a Robot Manipulator.基于级联神经网络的PD控制补偿在机器人操纵器中的应用
Front Neurorobot. 2020 Dec 3;14:577749. doi: 10.3389/fnbot.2020.577749. eCollection 2020.
8
A Hybrid Controller for Musculoskeletal Robots Targeting Lifting Tasks in Industrial Metaverse.
IEEE Trans Cybern. 2024 May;54(5):2708-2719. doi: 10.1109/TCYB.2024.3358739. Epub 2024 Apr 16.
9
Particle Swarm Optimization aided PID gait controller design for a humanoid robot.粒子群优化辅助 PID 步态控制器设计的仿人机器人。
ISA Trans. 2021 Aug;114:306-330. doi: 10.1016/j.isatra.2020.12.033. Epub 2020 Dec 19.
10
Simultaneous Control and Guidance of an AUV Based on Soft Actor-Critic.基于软演员-评论家的自主水下航行器同步控制与制导
Sensors (Basel). 2022 Aug 14;22(16):6072. doi: 10.3390/s22166072.

引用本文的文献

1
A Robot Floating Grinding and Rust Removal Approach Based on Composite Force-Position Fuzzy Control.一种基于复合力位模糊控制的机器人浮动磨削与除锈方法。
Sensors (Basel). 2025 Mar 31;25(7):2204. doi: 10.3390/s25072204.

本文引用的文献

1
The neurobiology of deep reinforcement learning.深度强化学习的神经生物学
Curr Biol. 2020 Jun 8;30(11):R629-R632. doi: 10.1016/j.cub.2020.04.021.