• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于尖峰神经网络的机器人新型混合学习控制系统。

A new hybrid learning control system for robots based on spiking neural networks.

机构信息

School of Engineering, University Of Kent, UK.

Faculty Of Mechanical Engineering, University Of Tabriz, Tabriz, Iran.

出版信息

Neural Netw. 2024 Dec;180:106656. doi: 10.1016/j.neunet.2024.106656. Epub 2024 Aug 22.

DOI:10.1016/j.neunet.2024.106656
PMID:39208462
Abstract

This paper presents a new hybrid learning and control method that can tune their parameters based on reinforcement learning. In the new proposed method, nonlinear controllers are considered multi-input multi-output functions and then the functions are replaced with SNNs with reinforcement learning algorithms. Dopamine-modulated spike-timing-dependent plasticity (STDP) is used for reinforcement learning and manipulating the synaptic weights between the input and output of neuronal groups (for parameter adjustment). Details of the method are presented and some case studies are done on nonlinear controllers such as Fractional Order PID (FOPID) and Feedback Linearization. The structure and the dynamic equations for learning are presented, and the proposed algorithm is tested on robots and results are compared with other works. Moreover, to demonstrate the effectiveness of SNNFOPID, we conducted rigorous testing on a variety of systems including a two-wheel mobile robot, a double inverted pendulum, and a four-link manipulator robot. The results revealed impressively low errors of 0.01 m, 0.03 rad, and 0.03 rad for each system, respectively. The method is tested on another controller named Feedback Linearization, which provides acceptable results. Results show that the new method has better performance in terms of Integral Absolute Error (IAE) and is highly useful in hardware implementation due to its low energy consumption, high speed, and accuracy. The duration necessary for achieving full and stable proficiency in the control of various robotic systems using SNNFOPD, and SNNFL on an Asus Core i5 system within Simulink's Simscape environment is as follows: - Two-link robot manipulator with SNNFOPID: 19.85656 hours - Two-link robot manipulator with SNNFL: 0.45828 hours - Double inverted pendulum with SNNFOPID: 3.455 hours - Mobile robot with SNNFOPID: 3.71948 hours - Four-link robot manipulator with SNNFOPID: 16.6789 hours. This method can be generalized to other controllers and systems like robots.

摘要

本文提出了一种新的混合学习和控制方法,该方法可以基于强化学习来调整其参数。在新提出的方法中,非线性控制器被视为多输入多输出函数,然后这些函数被用具有强化学习算法的 SNN 替换。多巴胺调制的尖峰时间依赖可塑性(STDP)用于强化学习,并操纵神经元群体的输入和输出之间的突触权重(用于参数调整)。本文介绍了该方法的细节,并对非线性控制器(如分数阶 PID(FOPID)和反馈线性化)进行了一些案例研究。介绍了学习的结构和动态方程,并用机器人对提出的算法进行了测试,并将结果与其他工作进行了比较。此外,为了展示 SNNFOPID 的有效性,我们在各种系统(包括两轮移动机器人、双倒立摆和四连杆机械手机器人)上进行了严格的测试。结果令人印象深刻,每个系统的误差分别为 0.01 米、0.03 弧度和 0.03 弧度。该方法在另一个名为反馈线性化的控制器上进行了测试,结果令人满意。结果表明,该新方法在积分绝对误差(IAE)方面具有更好的性能,并且由于其低能耗、高速和准确性,在硬件实现中非常有用。在华硕 Core i5 系统内的 Simulink 的 Simscape 环境中,使用 SNNFOPD 和 SNNFL 实现对各种机器人系统的完全和稳定控制所需的时间如下:- 带有 SNNFOPID 的两连杆机器人:19.85656 小时- 带有 SNNFL 的两连杆机器人:0.45828 小时- 带有 SNNFOPID 的双倒立摆:3.455 小时- 带有 SNNFOPID 的移动机器人:3.71948 小时- 带有 SNNFOPID 的四连杆机器人:16.6789 小时。该方法可以推广到其他控制器和系统,如机器人。

相似文献

1
A new hybrid learning control system for robots based on spiking neural networks.基于尖峰神经网络的机器人新型混合学习控制系统。
Neural Netw. 2024 Dec;180:106656. doi: 10.1016/j.neunet.2024.106656. Epub 2024 Aug 22.
2
Hybrid controller with neural network PID/FOPID operations for two-link rigid robot manipulator based on the zebra optimization algorithm.基于斑马优化算法的两连杆刚性机器人机械臂神经网络PID/FOPID操作混合控制器
Front Robot AI. 2024 Jun 14;11:1386968. doi: 10.3389/frobt.2024.1386968. eCollection 2024.
3
Indirect and direct training of spiking neural networks for end-to-end control of a lane-keeping vehicle.用于车道保持车辆端到端控制的尖峰神经网络的间接和直接训练。
Neural Netw. 2020 Jan;121:21-36. doi: 10.1016/j.neunet.2019.05.019. Epub 2019 Jul 9.
4
An STDP training algorithm for a spiking neural network with dynamic threshold neurons.一种具有动态阈值神经元的尖峰神经网络的 STDP 训练算法。
Int J Neural Syst. 2010 Dec;20(6):463-80. doi: 10.1142/S0129065710002553.
5
A biologically plausible supervised learning method for spiking neural networks using the symmetric STDP rule.基于对称 STDP 规则的尖峰神经网络的生物合理有监督学习方法。
Neural Netw. 2020 Jan;121:387-395. doi: 10.1016/j.neunet.2019.09.007. Epub 2019 Sep 27.
6
Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.通过调节尖峰时间依赖性突触可塑性进行强化学习。
Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.
7
PD plus error-dependent integral nonlinear controllers for robot manipulators with an uncertain Jacobian matrix.具有不确定雅可比矩阵的机器人操纵器的 PD 加误差相关积分非线性控制器。
ISA Trans. 2012 Nov;51(6):792-800. doi: 10.1016/j.isatra.2012.06.003. Epub 2012 Jul 18.
8
Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses.具有随机和确定性突触的尖峰神经网络中的强化学习。
Neural Comput. 2019 Dec;31(12):2368-2389. doi: 10.1162/neco_a_01238. Epub 2019 Oct 15.
9
Performance analysis of two-degree of freedom fractional order PID controllers for robotic manipulator with payload.两自由度分数阶 PID 控制器在带负载机器人机械手中的性能分析。
ISA Trans. 2015 Sep;58:279-91. doi: 10.1016/j.isatra.2015.03.013. Epub 2015 Apr 18.
10
Coot optimization algorithm-tuned neural network-enhanced PID controllers for robust trajectory tracking of three-link rigid robot manipulator.用于三连杆刚性机器人机械手鲁棒轨迹跟踪的布谷鸟优化算法调整的神经网络增强型PID控制器。
Heliyon. 2024 Jun 17;10(13):e32661. doi: 10.1016/j.heliyon.2024.e32661. eCollection 2024 Jul 15.

引用本文的文献

1
Dynamic Cascade Spiking Neural Network Supervisory Controller for a Nonplanar Twelve-Rotor UAV.用于非平面十二旋翼无人机的动态级联脉冲神经网络监督控制器
Sensors (Basel). 2025 Feb 14;25(4):1177. doi: 10.3390/s25041177.