• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于重复的模仿学习任务自适应方法。

Repetition-Based Approach for Task Adaptation in Imitation Learning.

机构信息

Graduate School of Engineering and Science, Shibaura Institute of Technology, Tokyo 135-8548, Japan.

Department of Information and Communications Engineering, Shibaura Institute of Technology, Tokyo 135-8548, Japan.

出版信息

Sensors (Basel). 2022 Sep 14;22(18):6959. doi: 10.3390/s22186959.

DOI:10.3390/s22186959
PMID:36146306
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9502931/
Abstract

Transfer learning is an effective approach for adapting an autonomous agent to a new target task by transferring knowledge learned from the previously learned source task. The major problem with traditional transfer learning is that it only focuses on optimizing learning performance on the target task. Thus, the performance on the target task may be improved in exchange for the deterioration of the source task's performance, resulting in an agent that is not able to revisit the earlier task. Therefore, transfer learning methods are still far from being comparable with the learning capability of humans, as humans can perform well on both source and new target tasks. In order to address this limitation, a task adaptation method for imitation learning is proposed in this paper. Being inspired by the idea of repetition learning in neuroscience, the proposed adaptation method enables the agent to repeatedly review the learned knowledge of the source task, while learning the new knowledge of the target task. This ensures that the learning performance on the target task is high, while the deterioration of the learning performance on the source task is small. A comprehensive evaluation over several simulated tasks with varying difficulty levels shows that the proposed method can provide high and consistent performance on both source and target tasks, outperforming existing transfer learning methods.

摘要

迁移学习是一种有效的方法,可以通过从先前学习的源任务中转移知识来适应新的目标任务。传统迁移学习的主要问题是它只关注于优化目标任务上的学习性能。因此,目标任务的性能可能会提高,而源任务的性能可能会恶化,导致代理无法重新访问早期任务。因此,迁移学习方法仍然远远不能与人类的学习能力相媲美,因为人类可以在源任务和新的目标任务上都表现出色。为了解决这个限制,本文提出了一种模仿学习的任务自适应方法。受神经科学中重复学习思想的启发,所提出的自适应方法使代理能够反复复习源任务的所学知识,同时学习目标任务的新知识。这确保了目标任务的学习性能较高,而源任务的学习性能恶化较小。在几个具有不同难度级别的模拟任务上的综合评估表明,所提出的方法可以在源任务和目标任务上提供高且一致的性能,优于现有的迁移学习方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/d08b28b5cef4/sensors-22-06959-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/14f383f0d070/sensors-22-06959-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/3d26e1f3ad21/sensors-22-06959-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/83d91019a9c9/sensors-22-06959-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/2d885cc6d3b2/sensors-22-06959-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/26223a203535/sensors-22-06959-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/e6cfc74a0459/sensors-22-06959-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/d08b28b5cef4/sensors-22-06959-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/14f383f0d070/sensors-22-06959-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/3d26e1f3ad21/sensors-22-06959-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/83d91019a9c9/sensors-22-06959-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/2d885cc6d3b2/sensors-22-06959-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/26223a203535/sensors-22-06959-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/e6cfc74a0459/sensors-22-06959-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5812/9502931/d08b28b5cef4/sensors-22-06959-g007.jpg

相似文献

1
Repetition-Based Approach for Task Adaptation in Imitation Learning.基于重复的模仿学习任务自适应方法。
Sensors (Basel). 2022 Sep 14;22(18):6959. doi: 10.3390/s22186959.
2
Domain Adaptation for Imitation Learning Using Generative Adversarial Network.基于生成对抗网络的模仿学习的领域自适应。
Sensors (Basel). 2021 Jul 9;21(14):4718. doi: 10.3390/s21144718.
3
Distributional generative adversarial imitation learning with reproducing kernel generalization.基于再生核泛化的分布生成对抗模仿学习。
Neural Netw. 2023 Aug;165:43-59. doi: 10.1016/j.neunet.2023.05.027. Epub 2023 May 25.
4
Domain Shift Preservation for Zero-Shot Domain Adaptation.零样本域适应中的域转移保留
IEEE Trans Image Process. 2021;30:5505-5517. doi: 10.1109/TIP.2021.3084354. Epub 2021 Jun 11.
5
Task's Choice: Pruning-Based Feature Sharing (PBFS) for Multi-Task Learning.任务选择:用于多任务学习的基于剪枝的特征共享(PBFS)
Entropy (Basel). 2022 Mar 21;24(3):432. doi: 10.3390/e24030432.
6
A Bayesian Developmental Approach to Robotic Goal-Based Imitation Learning.一种基于贝叶斯发展方法的机器人目标导向模仿学习
PLoS One. 2015 Nov 4;10(11):e0141965. doi: 10.1371/journal.pone.0141965. eCollection 2015.
7
Addressing implicit bias in adversarial imitation learning with mutual information.利用互信息解决对抗性模仿学习中的隐性偏差。
Neural Netw. 2023 Oct;167:847-864. doi: 10.1016/j.neunet.2023.08.058. Epub 2023 Sep 4.
8
Model expertise does not influence automatic imitation.模型专业知识不会影响自动模仿。
Exp Brain Res. 2022 Apr;240(4):1267-1277. doi: 10.1007/s00221-022-06338-2. Epub 2022 Feb 25.
9
Overcoming Long-Term Catastrophic Forgetting Through Adversarial Neural Pruning and Synaptic Consolidation.通过对抗性神经修剪和突触巩固来克服长期灾难性遗忘。
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4243-4256. doi: 10.1109/TNNLS.2021.3056201. Epub 2022 Aug 31.
10
BAGAIL: Multi-modal imitation learning from imbalanced demonstrations.贝加尔:基于不平衡演示的多模态模仿学习。
Neural Netw. 2024 Jun;174:106251. doi: 10.1016/j.neunet.2024.106251. Epub 2024 Mar 19.

本文引用的文献

1
A Transfer Learning Approach with a Convolutional Neural Network for the Classification of Lung Carcinoma.一种基于卷积神经网络的迁移学习方法用于肺癌分类
Healthcare (Basel). 2022 Jun 8;10(6):1058. doi: 10.3390/healthcare10061058.
2
Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning.机器人学习:深度强化学习、模仿学习、迁移学习。
Sensors (Basel). 2021 Feb 11;21(4):1278. doi: 10.3390/s21041278.
3
CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.
基于卷积神经网络的迁移学习-双向长短期记忆网络:一种用于新冠病毒感染检测的新方法。
Appl Soft Comput. 2021 Jan;98:106912. doi: 10.1016/j.asoc.2020.106912. Epub 2020 Nov 18.
4
Deep Transfer Learning Based Classification Model for COVID-19 Disease.基于深度迁移学习的新冠肺炎疾病分类模型
Ing Rech Biomed. 2022 Apr;43(2):87-92. doi: 10.1016/j.irbm.2020.05.003. Epub 2020 May 20.
5
Effects of Repetition Learning on Associative Recognition Over Time: Role of the Hippocampus and Prefrontal Cortex.重复学习对联想识别随时间变化的影响:海马体和前额叶皮质的作用。
Front Hum Neurosci. 2018 Jul 11;12:277. doi: 10.3389/fnhum.2018.00277. eCollection 2018.
6
Memory: a contribution to experimental psychology.《记忆:对实验心理学的一项贡献》
Ann Neurosci. 2013 Oct;20(4):155-6. doi: 10.5214/ans.0972.7531.200408.