• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用状态-动作-预测自组织映射学习直观物理和单次模仿。

Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing Maps.

机构信息

Department of Bioengineering Sciences, Weihenstephan-Triesdorf University of Applied Sciences, Freising D-85354, Germany.

Computational Intelligence and Machine Learning Group, Department of Biophysics, University of Regensburg, Regensburg D-93053, Germany.

出版信息

Comput Intell Neurosci. 2021 Nov 12;2021:5590445. doi: 10.1155/2021/5590445. eCollection 2021.

DOI:10.1155/2021/5590445
PMID:34804145
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8604601/
Abstract

Human learning and intelligence work differently from the supervised pattern recognition approach adopted in most deep learning architectures. Humans seem to learn rich representations by exploration and imitation, build causal models of the world, and use both to flexibly solve new tasks. We suggest a simple but effective unsupervised model which develops such characteristics. The agent learns to represent the dynamical physical properties of its environment by intrinsically motivated exploration and performs inference on this representation to reach goals. For this, a set of self-organizing maps which represent state-action pairs is combined with a causal model for sequence prediction. The proposed system is evaluated in the environment. After an initial phase of playful exploration, the agent can execute kinematic simulations of the environment's future and use those for action planning. We demonstrate its performance on a set of several related, but different one-shot imitation tasks, which the agent flexibly solves in an active inference style.

摘要

人类的学习和智能与大多数深度学习架构所采用的监督模式识别方法不同。人类似乎通过探索和模仿来学习丰富的表示,建立对世界的因果模型,并利用这两者灵活地解决新任务。我们提出了一个简单但有效的无监督模型,该模型具有这些特点。该智能体通过内在激励的探索来学习表示环境的动态物理属性,并在此表示上进行推理以达到目标。为此,一组表示状态-动作对的自组织图与用于序列预测的因果模型相结合。所提出的系统在 环境中进行了评估。在初始的游戏探索阶段之后,智能体可以对环境未来的运动学进行模拟,并将其用于动作规划。我们在一组几个相关但不同的一次性模仿任务中展示了它的性能,智能体以主动推理的方式灵活地解决了这些任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/0b9d1057542f/CIN2021-5590445.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/7330e79fda60/CIN2021-5590445.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/8fe651838620/CIN2021-5590445.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/3d74316a371b/CIN2021-5590445.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/f46b984e55f2/CIN2021-5590445.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/0b9d1057542f/CIN2021-5590445.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/7330e79fda60/CIN2021-5590445.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/8fe651838620/CIN2021-5590445.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/3d74316a371b/CIN2021-5590445.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/f46b984e55f2/CIN2021-5590445.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/236e/8604601/0b9d1057542f/CIN2021-5590445.005.jpg

相似文献

1
Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing Maps.使用状态-动作-预测自组织映射学习直观物理和单次模仿。
Comput Intell Neurosci. 2021 Nov 12;2021:5590445. doi: 10.1155/2021/5590445. eCollection 2021.
2
A Bayesian Developmental Approach to Robotic Goal-Based Imitation Learning.一种基于贝叶斯发展方法的机器人目标导向模仿学习
PLoS One. 2015 Nov 4;10(11):e0141965. doi: 10.1371/journal.pone.0141965. eCollection 2015.
3
An active inference model of hierarchical action understanding, learning and imitation.一个分层动作理解、学习和模仿的主动推理模型。
Phys Life Rev. 2023 Sep;46:92-118. doi: 10.1016/j.plrev.2023.05.012. Epub 2023 Jun 5.
4
Physical reasoning is the missing link between action goals and kinematics: A comment on "An active inference model of hierarchical action understanding, learning, and imitation" by Proietti et al.物理推理是行动目标与运动学之间缺失的环节:对Proietti等人的《层次化行动理解、学习与模仿的主动推理模型》的评论
Phys Life Rev. 2024 Mar;48:198-200. doi: 10.1016/j.plrev.2023.08.017. Epub 2023 Sep 1.
5
Imitation and mirror systems in robots through Deep Modality Blending Networks.机器人中的模仿和镜像系统通过深度模态混合网络。
Neural Netw. 2022 Feb;146:22-35. doi: 10.1016/j.neunet.2021.11.004. Epub 2021 Nov 16.
6
Bayesian Disturbance Injection: Robust imitation learning of flexible policies for robot manipulation.贝叶斯干扰注入:用于机器人操作的灵活策略的稳健模仿学习。
Neural Netw. 2023 Jan;158:42-58. doi: 10.1016/j.neunet.2022.11.008. Epub 2022 Nov 11.
7
Intuitive physics learning in a deep-learning model inspired by developmental psychology.受发展心理学启发的深度学习模型中的直观物理学习。
Nat Hum Behav. 2022 Sep;6(9):1257-1267. doi: 10.1038/s41562-022-01394-8. Epub 2022 Jul 11.
8
A neural signature for combined action observation and motor imagery? An fNIRS study into prefrontal activation, automatic imitation, and self-other perceptions.联合动作观察和运动想象的神经特征?一项前额叶激活、自动模仿和自我-他人感知的功能性近红外光谱研究。
Brain Behav. 2022 Feb;12(2):e2407. doi: 10.1002/brb3.2407. Epub 2022 Jan 7.
9
One-Shot Imitation Drone Filming of Human Motion Videos.单次模仿无人机拍摄的人类运动视频。
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5335-5348. doi: 10.1109/TPAMI.2021.3067359. Epub 2022 Aug 4.
10
Developmental Approach for Behavior Learning Using Primitive Motion Skills.发展基于基本动作技能的行为学习方法。
Int J Neural Syst. 2018 May;28(4):1750038. doi: 10.1142/S0129065717500381. Epub 2017 Aug 6.

本文引用的文献

1
IntPhys 2019: A Benchmark for Visual Intuitive Physics Understanding.IntPhys2019:视觉直观物理理解基准
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5016-5025. doi: 10.1109/TPAMI.2021.3083839. Epub 2022 Aug 4.
2
Computational models of motivated frontal function.动机性额叶功能的计算模型。
Handb Clin Neurol. 2019;163:317-332. doi: 10.1016/B978-0-12-804281-6.00017-3.
3
Prefrontal cortex as a meta-reinforcement learning system.前额皮质作为一个元强化学习系统。
Nat Neurosci. 2018 Jun;21(6):860-868. doi: 10.1038/s41593-018-0147-8. Epub 2018 May 14.
4
Building machines that learn and think like people.建造像人一样学习和思考的机器。
Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24.
5
Human-level concept learning through probabilistic program induction.通过概率编程归纳实现人类水平的概念学习。
Science. 2015 Dec 11;350(6266):1332-8. doi: 10.1126/science.aab3050.
6
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
7
Model-based hierarchical reinforcement learning and human action control.基于模型的分层强化学习与人类行为控制。
Philos Trans R Soc Lond B Biol Sci. 2014 Nov 5;369(1655). doi: 10.1098/rstb.2013.0480.
8
Simulation as an engine of physical scene understanding.模拟作为物理场景理解的引擎。
Proc Natl Acad Sci U S A. 2013 Nov 5;110(45):18327-32. doi: 10.1073/pnas.1306572110. Epub 2013 Oct 21.
9
Kinematic mental simulations in abduction and deduction.内省和演绎中的运动心理模拟。
Proc Natl Acad Sci U S A. 2013 Oct 15;110(42):16766-71. doi: 10.1073/pnas.1316275110. Epub 2013 Sep 30.
10
Generating adaptive behaviour within a memory-prediction framework.在记忆预测框架内生成自适应行为。
PLoS One. 2012;7(1):e29264. doi: 10.1371/journal.pone.0029264. Epub 2012 Jan 17.