• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于人类优化控制器专业知识转移的互补学习方法。

A complementary learning approach for expertise transference of human-optimized controllers.

机构信息

School of Aerospace, Transport and Manufacturing, Cranfield University, Bedford MK43 0AL, UK.

出版信息

Neural Netw. 2022 Jan;145:33-41. doi: 10.1016/j.neunet.2021.10.009. Epub 2021 Oct 21.

DOI:10.1016/j.neunet.2021.10.009
PMID:34715533
Abstract

In this paper, a complementary learning scheme for experience transference of unknown continuous-time linear systems is proposed. The algorithm is inspired in the complementary learning properties that exhibit the hippocampus and neocortex learning systems via the striatum. The hippocampus is modelled as pattern-separated data of a human optimized controller. The neocortex is modelled as a Q-reinforcement learning algorithm which improves the hippocampus control policy. The complementary learning (striatum) is designed as an inverse reinforcement learning algorithm which relates the hippocampus and neocortex learning models to seek and transfer the weights of the hidden expert's utility function. Convergence of the proposed approach is analysed using Lyapunov recursions. Simulations are given to verify the proposed approach.

摘要

本文提出了一种用于未知连续时间线性系统经验迁移的补充学习方案。该算法的灵感来源于纹状体中展示海马体和新皮层学习系统的互补学习特性。海马体被建模为人类优化控制器的模式分离数据。新皮层被建模为 Q 强化学习算法,该算法改进了海马体控制策略。互补学习(纹状体)被设计为一种逆强化学习算法,它将海马体和新皮层学习模型联系起来,以寻求和转移隐藏专家效用函数的权重。使用 Lyapunov 递归分析了所提出方法的收敛性。给出了仿真结果以验证所提出的方法。

相似文献

1
A complementary learning approach for expertise transference of human-optimized controllers.一种用于人类优化控制器专业知识转移的互补学习方法。
Neural Netw. 2022 Jan;145:33-41. doi: 10.1016/j.neunet.2021.10.009. Epub 2021 Oct 21.
2
Hippocampus experience inference for safety critical control of unknown multi-agent linear systems.针对未知多智能体线性系统的安全关键控制的海马体经验推断。
ISA Trans. 2023 Jun;137:646-655. doi: 10.1016/j.isatra.2022.12.011. Epub 2022 Dec 16.
3
A complementary learning systems approach to temporal difference learning.一种互补学习系统方法对时间差分学习。
Neural Netw. 2020 Feb;122:218-230. doi: 10.1016/j.neunet.2019.10.011. Epub 2019 Oct 26.
4
Drone's Objective Inference Using Policy Error Inverse Reinforcement Learning.基于策略误差逆强化学习的无人机目标推理
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1329-1340. doi: 10.1109/TNNLS.2023.3333551. Epub 2025 Jan 7.
5
Inverse Reinforcement Learning in Tracking Control Based on Inverse Optimal Control.基于逆最优控制的跟踪控制中的逆强化学习。
IEEE Trans Cybern. 2022 Oct;52(10):10570-10581. doi: 10.1109/TCYB.2021.3062856. Epub 2022 Sep 19.
6
Inverse Reinforcement Learning for Trajectory Imitation Using Static Output Feedback Control.基于静态输出反馈控制的轨迹模仿逆强化学习
IEEE Trans Cybern. 2024 Mar;54(3):1695-1707. doi: 10.1109/TCYB.2023.3241015. Epub 2024 Feb 9.
7
Inverse Reinforcement Q-Learning Through Expert Imitation for Discrete-Time Systems.基于专家模仿的离散时间系统逆强化Q学习
IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2386-2399. doi: 10.1109/TNNLS.2021.3106635. Epub 2023 May 2.
8
Minibatch Recursive Least Squares Q-Learning.小批量递归最小二乘 Q 学习。
Comput Intell Neurosci. 2021 Oct 8;2021:5370281. doi: 10.1155/2021/5370281. eCollection 2021.
9
Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.基于强化学习和逆强化学习的蛇形机器人节能与损伤恢复蠕动步态设计。
Neural Netw. 2020 Sep;129:323-333. doi: 10.1016/j.neunet.2020.05.029. Epub 2020 Jun 16.
10
Kernel-based least squares policy iteration for reinforcement learning.用于强化学习的基于核的最小二乘策略迭代
IEEE Trans Neural Netw. 2007 Jul;18(4):973-92. doi: 10.1109/TNN.2007.899161.

引用本文的文献

1
Machine Learning-Based Surgical Planning for Neurosurgery: Artificial Intelligent Approaches to the Cranium.基于机器学习的神经外科手术规划:颅骨的人工智能方法
Front Surg. 2022 Apr 29;9:863633. doi: 10.3389/fsurg.2022.863633. eCollection 2022.