• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越模仿:通过将概念作为认知程序来学习,实现机器人的零样本任务迁移。

Beyond imitation: Zero-shot task transfer on robots by learning concepts as cognitive programs.

机构信息

Vicarious AI, CA, USA.

出版信息

Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aav3150.

DOI:10.1126/scirobotics.aav3150
PMID:33137758
Abstract

Humans can infer concepts from image pairs and apply those in the physical world in a completely different setting, enabling tasks like IKEA assembly from diagrams. If robots could represent and infer high-level concepts, then it would notably improve their ability to understand our intent and to transfer tasks between different environments. To that end, we introduce a computational framework that replicates aspects of human concept learning. Concepts are represented as programs on a computer architecture consisting of a visual perception system, working memory, and action controller. The instruction set of this cognitive computer has commands for parsing a visual scene, directing gaze and attention, imagining new objects, manipulating the contents of a visual working memory, and controlling arm movement. Inferring a concept corresponds to inducing a program that can transform the input to the output. Some concepts require the use of imagination and recursion. Previously learned concepts simplify the learning of subsequent, more elaborate concepts and create a hierarchy of abstractions. We demonstrate how a robot can use these abstractions to interpret novel concepts presented to it as schematic images and then apply those concepts in very different situations. By bringing cognitive science ideas on mental imagery, perceptual symbols, embodied cognition, and deictic mechanisms into the realm of machine learning, our work brings us closer to the goal of building robots that have interpretable representations and common sense.

摘要

人类可以从图像对中推断概念,并将这些概念应用于完全不同的物理环境中,从而实现从图表中组装宜家家具等任务。如果机器人能够表示和推断高级概念,那么它们将显著提高理解我们意图和在不同环境中转移任务的能力。为此,我们引入了一个计算框架,该框架复制了人类概念学习的某些方面。概念被表示为计算机体系结构上的程序,该体系结构由视觉感知系统、工作记忆和动作控制器组成。这种认知计算机的指令集具有解析视觉场景、引导注视和注意力、想象新物体、操作视觉工作记忆的内容以及控制手臂运动的命令。推断一个概念相当于诱导一个可以将输入转换为输出的程序。有些概念需要想象力和递归的运用。之前学习的概念简化了后续更复杂概念的学习,并创建了一个抽象层次结构。我们展示了机器人如何使用这些抽象来解释以示意图像呈现给它的新概念,然后在非常不同的情况下应用这些概念。通过将认知科学关于心理意象、知觉符号、具身认知和指示机制的思想引入机器学习领域,我们的工作使我们更接近构建具有可解释表示和常识的机器人的目标。

相似文献

1
Beyond imitation: Zero-shot task transfer on robots by learning concepts as cognitive programs.超越模仿:通过将概念作为认知程序来学习,实现机器人的零样本任务迁移。
Sci Robot. 2019 Jan 16;4(26). doi: 10.1126/scirobotics.aav3150.
2
Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars.递归神经程序:一种用于学习组合部分-整体层次结构和图像语法的可微框架。
PNAS Nexus. 2023 Oct 14;2(11):pgad337. doi: 10.1093/pnasnexus/pgad337. eCollection 2023 Nov.
3
Social Cognition for Human-Robot Symbiosis-Challenges and Building Blocks.人机共生的社会认知——挑战与基石
Front Neurorobot. 2018 Jul 11;12:34. doi: 10.3389/fnbot.2018.00034. eCollection 2018.
4
The developmental origins of naïve psychology in infancy.婴儿期朴素心理学的发展起源。
Adv Child Dev Behav. 2009;37:55-104. doi: 10.1016/s0065-2407(09)03702-1.
5
Humanoid Cognitive Robots That Learn by Imitating: Implications for Consciousness Studies.通过模仿学习的类人认知机器人:对意识研究的启示
Front Robot AI. 2018 Jan 26;5:1. doi: 10.3389/frobt.2018.00001. eCollection 2018.
6
Representation in natural and artificial agents: an embodied cognitive science perspective.自然与人工智能体中的表征:一种具身认知科学视角
Z Naturforsch C J Biosci. 1998 Jul-Aug;53(7-8):480-503. doi: 10.1515/znc-1998-7-804.
7
Learning Actions From Natural Language Instructions Using an ON-World Embodied Cognitive Architecture.使用基于现实世界的具身认知架构从自然语言指令中学习动作
Front Neurorobot. 2021 May 13;15:626380. doi: 10.3389/fnbot.2021.626380. eCollection 2021.
8
Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能
9
Transforming task representations to perform novel tasks.将任务表示转化为执行新任务。
Proc Natl Acad Sci U S A. 2020 Dec 29;117(52):32970-32981. doi: 10.1073/pnas.2008852117. Epub 2020 Dec 10.
10
Mental imagery in animals: Learning, memory, and decision-making in the face of missing information.动物中的心理意象:面对缺失信息时的学习、记忆与决策
Learn Behav. 2019 Sep;47(3):193-216. doi: 10.3758/s13420-019-00386-5.

引用本文的文献

1
A detailed theory of thalamic and cortical microcircuits for predictive visual inference.一种用于预测性视觉推理的丘脑和皮层微电路的详细理论。
Sci Adv. 2025 Feb 7;11(6):eadr6698. doi: 10.1126/sciadv.adr6698. Epub 2025 Feb 5.
2
Neurodynamical Computing at the Information Boundaries of Intelligent Systems.智能系统信息边界处的神经动力学计算。
Cognit Comput. 2024;16(5):1-13. doi: 10.1007/s12559-022-10081-9. Epub 2022 Dec 27.
3
Learning Soft Millirobot Multimodal Locomotion with Sim-to-Real Transfer.通过模拟到现实的迁移学习软微型机器人的多模态运动。
Adv Sci (Weinh). 2024 Aug;11(30):e2308881. doi: 10.1002/advs.202308881. Epub 2024 Jun 18.
4
Stochastic neuro-fuzzy system implemented in memristor crossbar arrays.在忆阻器交叉阵列中实现的随机神经模糊系统。
Sci Adv. 2024 Mar 22;10(12):eadl3135. doi: 10.1126/sciadv.adl3135.
5
Clone-structured graph representations enable flexible learning and vicarious evaluation of cognitive maps.克隆结构的图表示能够灵活地学习和替代评估认知图。
Nat Commun. 2021 Apr 22;12(1):2392. doi: 10.1038/s41467-021-22559-5.
6
AI, visual imagery, and a case study on the challenges posed by human intelligence tests.人工智能、视觉意象,以及对人类智力测验所带来的挑战的案例研究。
Proc Natl Acad Sci U S A. 2020 Nov 24;117(47):29390-29397. doi: 10.1073/pnas.1912335117.
7
From CAPTCHA to Commonsense: How Brain Can Teach Us About Artificial Intelligence.从验证码到常识:大脑如何教会我们关于人工智能的知识。
Front Comput Neurosci. 2020 Oct 22;14:554097. doi: 10.3389/fncom.2020.554097. eCollection 2020.
8
Zero-shot learning and its applications from autonomous vehicles to COVID-19 diagnosis: A review.零样本学习及其从自动驾驶车辆到新冠病毒诊断的应用:综述
Intell Based Med. 2020 Dec;3:100005. doi: 10.1016/j.ibmed.2020.100005. Epub 2020 Oct 2.