• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个通过内在动机学习可供性并通过注意力和单步规划解决外在任务的具身智能体。

An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning.

作者信息

Baldassarre Gianluca, Lord William, Granato Giovanni, Santucci Vieri Giuliano

机构信息

Laboratory of Computational Embodied Neuroscience, Institute of Cognitive Sciences and Technologies, National Research Council of Italy, Rome, Italy.

School of Engineering Sciences, KTH Royal Institute of Technology, Stockholm, Sweden.

出版信息

Front Neurorobot. 2019 Jul 26;13:45. doi: 10.3389/fnbot.2019.00045. eCollection 2019.

DOI:10.3389/fnbot.2019.00045
PMID:31402859
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6676802/
Abstract

We propose an architecture for the open-ended learning and control of embodied agents. The architecture learns action affordances and forward models based on intrinsic motivations and can later use the acquired knowledge to solve extrinsic tasks by decomposing them into sub-tasks, each solved with one-step planning. An affordance is here operationalized as the agent's estimate of the probability of success of an action performed on a given object. The focus of the work is on the overall architecture while single sensorimotor components are simplified. A key element of the architecture is the use of "active vision" that plays two functions, namely to focus on single objects and to factorize visual information into the object appearance and object position. These processes serve both the acquisition and use of object-related affordances, and the decomposition of extrinsic goals (tasks) into multiple sub-goals (sub-tasks). The architecture gives novel contributions on three problems: (a) the learning of affordances based on intrinsic motivations; (b) the use of active vision to decompose complex extrinsic tasks; (c) the possible role of affordances within planning systems endowed with models of the world. The architecture is tested in a simulated stylized 2D scenario in which objects need to be moved or "manipulated" in order to accomplish new desired overall configurations of the objects (extrinsic goals). The results show the utility of using intrinsic motivations to support affordance learning; the utility of active vision to solve composite tasks; and the possible utility of affordances for solving utility-based planning problems.

摘要

我们提出了一种用于具身智能体的开放式学习与控制的架构。该架构基于内在动机学习动作可供性和前向模型,随后可以利用所获取的知识,通过将外部任务分解为子任务,并采用单步规划来解决每个子任务,从而解决外部任务。在这里,可供性被定义为智能体对在给定对象上执行动作成功概率的估计。这项工作的重点在于整体架构,同时简化了单个的感觉运动组件。该架构的一个关键要素是使用“主动视觉”,它具有两个功能,即聚焦于单个对象以及将视觉信息分解为对象外观和对象位置。这些过程既服务于与对象相关的可供性的获取和使用,也服务于将外部目标(任务)分解为多个子目标(子任务)。该架构在三个问题上做出了新颖的贡献:(a)基于内在动机学习可供性;(b)利用主动视觉分解复杂的外部任务;(c)可供性在配备世界模型的规划系统中可能发挥的作用。该架构在一个模拟的二维风格场景中进行了测试,在该场景中,需要移动或“操纵”对象,以实现对象的新的期望整体配置(外部目标)。结果表明了利用内在动机支持可供性学习的效用;利用主动视觉解决复合任务的效用;以及可供性在解决基于效用的规划问题方面可能具有的效用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/cf7db00aa7ea/fnbot-13-00045-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/abdfc29fed09/fnbot-13-00045-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/4dbfad6dc307/fnbot-13-00045-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/54705bd35ebd/fnbot-13-00045-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/29334ce23a45/fnbot-13-00045-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/d70b8f5ca35c/fnbot-13-00045-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/f4e03273710e/fnbot-13-00045-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/d0f148700d0c/fnbot-13-00045-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/05a0822d56ec/fnbot-13-00045-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/e75eee08ea90/fnbot-13-00045-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/8e23afaf1679/fnbot-13-00045-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/cf879dcb46ed/fnbot-13-00045-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/3795b65fc72d/fnbot-13-00045-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/22064b749bf5/fnbot-13-00045-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/cf7db00aa7ea/fnbot-13-00045-g0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/abdfc29fed09/fnbot-13-00045-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/4dbfad6dc307/fnbot-13-00045-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/54705bd35ebd/fnbot-13-00045-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/29334ce23a45/fnbot-13-00045-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/d70b8f5ca35c/fnbot-13-00045-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/f4e03273710e/fnbot-13-00045-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/d0f148700d0c/fnbot-13-00045-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/05a0822d56ec/fnbot-13-00045-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/e75eee08ea90/fnbot-13-00045-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/8e23afaf1679/fnbot-13-00045-g0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/cf879dcb46ed/fnbot-13-00045-g0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/3795b65fc72d/fnbot-13-00045-g0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/22064b749bf5/fnbot-13-00045-g0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd0b/6676802/cf7db00aa7ea/fnbot-13-00045-g0014.jpg

相似文献

1
An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning.一个通过内在动机学习可供性并通过注意力和单步规划解决外在任务的具身智能体。
Front Neurorobot. 2019 Jul 26;13:45. doi: 10.3389/fnbot.2019.00045. eCollection 2019.
2
Is there any Influence of Variations in Context on Object-Affordance Effects in Schizophrenia? Perception of Property and Goals of Action.情境变化对精神分裂症中物体可供性效应有影响吗?对属性和行动目标的感知。
Front Psychol. 2016 Oct 5;7:1551. doi: 10.3389/fpsyg.2016.01551. eCollection 2016.
3
On the relations between affordance and representation of the agent's effector.关于主体效应器的可供性与表征之间的关系。
Exp Brain Res. 2007 Jul;180(3):421-33. doi: 10.1007/s00221-007-0869-4. Epub 2007 Feb 1.
4
Learning to grasp and extract affordances: the Integrated Learning of Grasps and Affordances (ILGA) model.学习掌握并提取可供性:抓取与可供性整合学习(ILGA)模型
Biol Cybern. 2015 Dec;109(6):639-69. doi: 10.1007/s00422-015-0666-2. Epub 2015 Nov 19.
5
Know Your Body Through Intrinsic Goals.通过内在目标了解你的身体。
Front Neurorobot. 2018 Jul 3;12:30. doi: 10.3389/fnbot.2018.00030. eCollection 2018.
6
Automatic Generation of Object Shapes With Desired Affordances Using Voxelgrid Representation.使用体素网格表示法自动生成具有所需功能的物体形状。
Front Neurorobot. 2020 May 14;14:22. doi: 10.3389/fnbot.2020.00022. eCollection 2020.
7
Task-Oriented Robot Cognitive Manipulation Planning Using Affordance Segmentation and Logic Reasoning.基于可供性分割和逻辑推理的面向任务的机器人认知操作规划
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12172-12185. doi: 10.1109/TNNLS.2023.3252578. Epub 2024 Sep 3.
8
Geometric Affordance Perception: Leveraging Deep 3D Saliency With the Interaction Tensor.几何可供性感知:利用交互张量的深度3D显著性
Front Neurorobot. 2020 Jul 7;14:45. doi: 10.3389/fnbot.2020.00045. eCollection 2020.
9
Early Visual Perception Potentiated by Object Affordances: Evidence From a Temporal Order Judgment Task.由物体可供性增强的早期视觉感知:来自时间顺序判断任务的证据。
Iperception. 2016 Sep 9;7(5):2041669516666550. doi: 10.1177/2041669516666550. eCollection 2016 Sep-Oct.
10
The role of eye height in perceiving affordances and object dimensions.眼睛高度在感知可供性和物体尺寸方面的作用。
Percept Psychophys. 1999 Apr;61(3):490-507. doi: 10.3758/bf03211968.

引用本文的文献

1
A Biological Inspired Cognitive Framework for Memory-Based Multi-Sensory Joint Attention in Human-Robot Interactive Tasks.一种用于人机交互任务中基于记忆的多感官联合注意力的生物启发式认知框架。
Front Neurorobot. 2021 Nov 23;15:648595. doi: 10.3389/fnbot.2021.648595. eCollection 2021.
2
A computational model of language functions in flexible goal-directed behaviour.灵活目标导向行为中语言功能的计算模型。
Sci Rep. 2020 Dec 10;10(1):21623. doi: 10.1038/s41598-020-78252-y.
3
A generative spiking neural-network model of goal-directed behaviour and one-step planning.

本文引用的文献

1
Active Fovea-Based Vision Through Computationally-Effective Model-Based Prediction.通过基于计算高效模型的预测实现基于中央凹的主动视觉。
Front Neurorobot. 2018 Dec 14;12:76. doi: 10.3389/fnbot.2018.00076. eCollection 2018.
2
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
3
Novelty or surprise?新颖还是惊喜?
一种用于目标导向行为和一步规划的生成尖峰神经网络模型。
PLoS Comput Biol. 2020 Dec 8;16(12):e1007579. doi: 10.1371/journal.pcbi.1007579. eCollection 2020 Dec.
Front Psychol. 2013 Dec 11;4:907. doi: 10.3389/fpsyg.2013.00907. eCollection 2013.
4
Which is the best intrinsic motivation signal for learning multiple skills?哪种内源性动机信号最适合学习多种技能?
Front Neurorobot. 2013 Nov 12;7:22. doi: 10.3389/fnbot.2013.00022. eCollection 2013.
5
Theories and computational models of affordance and mirror systems: an integrative review.可供性和镜像系统的理论和计算模型:综合述评。
Neurosci Biobehav Rev. 2013 Mar;37(3):491-521. doi: 10.1016/j.neubiorev.2013.01.012. Epub 2013 Jan 17.
6
Intrinsically motivated action-outcome learning and goal-based action recall: a system-level bio-constrained computational model.内在动机驱动的动作-结果学习和基于目标的动作回忆:一种系统级的生物约束计算模型。
Neural Netw. 2013 May;41:168-87. doi: 10.1016/j.neunet.2012.09.015. Epub 2012 Oct 4.
7
Motor functions of the superior colliculus.上丘的运动功能。
Annu Rev Neurosci. 2011;34:205-31. doi: 10.1146/annurev-neuro-061010-113728.
8
What is Intrinsic Motivation? A Typology of Computational Approaches.内在动机是什么?计算方法的类型学。
Front Neurorobot. 2007 Nov 2;1:6. doi: 10.3389/neuro.12.006.2007. eCollection 2007.
9
Motivation reconsidered: the concept of competence.重新审视动机:能力的概念。
Psychol Rev. 1959 Sep;66:297-333. doi: 10.1037/h0040934.
10
A direct projection from superior colliculus to substantia nigra for detecting salient visual events.上丘至黑质的直接投射用于检测显著视觉事件。
Nat Neurosci. 2003 Sep;6(9):974-80. doi: 10.1038/nn1113.