• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有经验回放的欠驱动蛇形机器人的鲁棒神经最优控制。

Robust Neuro-Optimal Control of Underactuated Snake Robots With Experience Replay.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):208-217. doi: 10.1109/TNNLS.2017.2768820.

DOI:10.1109/TNNLS.2017.2768820
PMID:29300697
Abstract

In this paper, the problem of path following for underactuated snake robots is investigated by using approximate dynamic programming and neural networks (NNs). The lateral undulatory gait of a snake robot is stabilized in a virtual holonomic constraint manifold through a partial feedback linearizing control law. Based on a dynamic compensator and Line-of-Sight guidance law, the path-following problem is transformed to a regulation problem of a nonlinear system with uncertainties. Subsequently, it is solved by an infinite horizon optimal control scheme using a single critic NN. A novel fluctuating learning algorithm is derived to approximate the associated cost function online and relax the initial stabilizing control requirement. The approximate optimal control input is derived by solving a modified Hamilton-Jacobi-Bellman equation. The conventional persistence of excitation condition is relaxed by using experience replay technique. The proposed control scheme ensures that all states of the snake robot are uniformly ultimate bounded which is analyzed by using the Lyapunov approach, and the tracking error asymptotically converges to a residual set. Simulation results are presented to verify the effectiveness of the proposed method.

摘要

本文采用近似动态规划和神经网络(NNs)研究了欠驱动蛇形机器人的路径跟踪问题。通过部分反馈线性化控制律,在虚拟完整约束流形中稳定蛇形机器人的横向波动步态。基于动态补偿器和视线制导律,将路径跟踪问题转化为具有不确定性的非线性系统的调节问题。随后,使用单个批评者 NN 通过无限 horizon 最优控制方案解决。导出了一种新颖的波动学习算法,用于在线近似相关成本函数,并放宽初始稳定控制要求。通过求解修正的 Hamilton-Jacobi-Bellman 方程推导出近似最优控制输入。通过使用经验回放技术放宽传统的持续激励条件。所提出的控制方案确保蛇形机器人的所有状态都是一致的有界的,这是通过 Lyapunov 方法进行分析的,并且跟踪误差渐近收敛于残差集。仿真结果验证了所提出方法的有效性。

相似文献

1
Robust Neuro-Optimal Control of Underactuated Snake Robots With Experience Replay.具有经验回放的欠驱动蛇形机器人的鲁棒神经最优控制。
IEEE Trans Neural Netw Learn Syst. 2018 Jan;29(1):208-217. doi: 10.1109/TNNLS.2017.2768820.
2
Adaptive Critic Designs for Event-Triggered Robust Control of Nonlinear Systems With Unknown Dynamics.自适应评论家设计在具有未知动态的非线性系统的事件触发鲁棒控制。
IEEE Trans Cybern. 2019 Jun;49(6):2255-2267. doi: 10.1109/TCYB.2018.2823199. Epub 2018 Apr 17.
3
Neural network robust tracking control with adaptive critic framework for uncertain nonlinear systems.基于自适应 critic 框架的不确定非线性系统神经网络鲁棒跟踪控制。
Neural Netw. 2018 Jan;97:11-18. doi: 10.1016/j.neunet.2017.09.005. Epub 2017 Sep 21.
4
Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update.基于时间的策略更新的未知内部动态仿射非线性离散时间系统的在线最优控制
IEEE Trans Neural Netw Learn Syst. 2012 Jul;23(7):1118-29. doi: 10.1109/TNNLS.2012.2196708.
5
Adaptive Dynamic Programming-Based Event-Triggered Robust Control for Multiplayer Nonzero-Sum Games With Unknown Dynamics.基于自适应动态规划的具有未知动态的多人非零和博弈的事件触发鲁棒控制
IEEE Trans Cybern. 2023 Aug;53(8):5151-5164. doi: 10.1109/TCYB.2022.3175650. Epub 2023 Jul 18.
6
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics.具有未知动态的非零和博弈系统最优控制的经验回放。
IEEE Trans Cybern. 2016 Mar;46(3):854-65. doi: 10.1109/TCYB.2015.2488680. Epub 2015 Oct 26.
7
Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances.自学习鲁棒最优控制用于具有失配干扰的连续时间非线性系统。
Neural Netw. 2018 Mar;99:19-30. doi: 10.1016/j.neunet.2017.11.022. Epub 2017 Dec 13.
8
Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints.基于强化学习的输入不确定约束未知非线性系统最优镇定。
IEEE Trans Neural Netw Learn Syst. 2020 Oct;31(10):4330-4340. doi: 10.1109/TNNLS.2019.2954983. Epub 2019 Dec 27.
9
Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems.基于神经网络的连续不确定非线性系统最优鲁棒保性能控制的在线 HJB 解法。
IEEE Trans Cybern. 2014 Dec;44(12):2834-47. doi: 10.1109/TCYB.2014.2357896.
10
Robust Neurooptimal Control for a Robot via Adaptive Dynamic Programming.通过自适应动态规划实现机器人的鲁棒神经最优控制。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2584-2594. doi: 10.1109/TNNLS.2020.3006850. Epub 2021 Jun 2.