• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过自然交互和大语言模型对人形机器人行为进行增量学习。

Incremental learning of humanoid robot behavior from natural interaction and large language models.

作者信息

Bärmann Leonard, Kartmann Rainer, Peller-Konrad Fabian, Niehues Jan, Waibel Alex, Asfour Tamim

机构信息

Institute for Anthropomatics and Robotics (IAR), Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany.

出版信息

Front Robot AI. 2024 Oct 10;11:1455375. doi: 10.3389/frobt.2024.1455375. eCollection 2024.

DOI:10.3389/frobt.2024.1455375
PMID:39449715
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11499633/
Abstract

Natural-language dialog is key for an intuitive human-robot interaction. It can be used not only to express humans' intents but also to communicate instructions for improvement if a robot does not understand a command correctly. It is of great importance to let robots learn from such interaction experiences in an incremental way to allow them to improve their behaviors or avoid mistakes in the future. In this paper, we propose a system to achieve such incremental learning of complex high-level behavior from natural interaction and demonstrate its implementation on a humanoid robot. Our system deploys large language models (LLMs) for high-level orchestration of the robot's behavior based on the idea of enabling the LLM to generate Python statements in an interactive console to invoke both robot perception and action. Human instructions, environment observations, and execution results are fed back to the LLM, thus informing the generation of the next statement. Since an LLM can misunderstand (potentially ambiguous) user instructions, we introduce incremental learning from the interaction, which enables the system to learn from its mistakes. For that purpose, the LLM can call another LLM responsible for code-level improvements in the current interaction based on human feedback. Subsequently, we store the improved interaction in the robot's memory so that it can later be retrieved on semantically similar requests. We integrate the system in the robot cognitive architecture of the humanoid robot ARMAR-6 and evaluate our methods both quantitatively (in simulation) and qualitatively (in simulation and real-world) by demonstrating generalized incrementally learned knowledge.

摘要

自然语言对话是直观的人机交互的关键。它不仅可以用来表达人类的意图,还可以在机器人没有正确理解命令时传达改进的指令。让机器人以增量方式从这种交互经验中学习,以便它们在未来改进行为或避免错误,这非常重要。在本文中,我们提出了一个系统,以实现从自然交互中对复杂高级行为的这种增量学习,并在人形机器人上展示其实现。我们的系统基于使大语言模型(LLMs)在交互式控制台中生成Python语句以调用机器人感知和动作的想法,部署大语言模型用于机器人行为的高级编排。人类指令、环境观察和执行结果被反馈给大语言模型,从而为下一条语句的生成提供信息。由于大语言模型可能误解(潜在模糊的)用户指令,我们引入了从交互中进行增量学习,这使系统能够从错误中学习。为此,大语言模型可以调用另一个大语言模型,该模型负责根据人类反馈对当前交互进行代码级改进。随后,我们将改进后的交互存储在机器人的内存中,以便以后在语义相似的请求中检索。我们将该系统集成到人形机器人ARMAR-6的机器人认知架构中,并通过展示广义的增量学习知识,在定量(在模拟中)和定性(在模拟和现实世界中)两方面评估我们的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/6e955b455c7f/frobt-11-1455375-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/73abf534a274/frobt-11-1455375-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/8b323ec85788/frobt-11-1455375-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/5075c90635f4/frobt-11-1455375-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/487cc2a98edb/frobt-11-1455375-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/6e955b455c7f/frobt-11-1455375-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/73abf534a274/frobt-11-1455375-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/8b323ec85788/frobt-11-1455375-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/5075c90635f4/frobt-11-1455375-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/487cc2a98edb/frobt-11-1455375-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5c4/11499633/6e955b455c7f/frobt-11-1455375-g005.jpg

相似文献

1
Incremental learning of humanoid robot behavior from natural interaction and large language models.通过自然交互和大语言模型对人形机器人行为进行增量学习。
Front Robot AI. 2024 Oct 10;11:1455375. doi: 10.3389/frobt.2024.1455375. eCollection 2024.
2
Learning Actions From Natural Language Instructions Using an ON-World Embodied Cognitive Architecture.使用基于现实世界的具身认知架构从自然语言指令中学习动作
Front Neurorobot. 2021 May 13;15:626380. doi: 10.3389/fnbot.2021.626380. eCollection 2021.
3
Real-time emotion generation in human-robot dialogue using large language models.使用大语言模型在人机对话中进行实时情感生成
Front Robot AI. 2023 Dec 1;10:1271610. doi: 10.3389/frobt.2023.1271610. eCollection 2023.
4
Interactive and incremental learning of spatial object relations from human demonstrations.从人类演示中交互式增量学习空间对象关系
Front Robot AI. 2023 May 18;10:1151303. doi: 10.3389/frobt.2023.1151303. eCollection 2023.
5
Self-Explaining Social Robots: An Explainable Behavior Generation Architecture for Human-Robot Interaction.自我解释的社交机器人:一种用于人机交互的可解释行为生成架构。
Front Artif Intell. 2022 Apr 29;5:866920. doi: 10.3389/frai.2022.866920. eCollection 2022.
6
Teaching NICO How to Grasp: An Empirical Study on Crossmodal Social Interaction as a Key Factor for Robots Learning From Humans.教NICO如何抓取:关于跨模态社会互动作为机器人向人类学习的关键因素的实证研究。
Front Neurorobot. 2020 Jun 9;14:28. doi: 10.3389/fnbot.2020.00028. eCollection 2020.
7
iCub-HRI: A Software Framework for Complex Human-Robot Interaction Scenarios on the iCub Humanoid Robot.iCub-HRI:用于iCub人形机器人复杂人机交互场景的软件框架。
Front Robot AI. 2018 Mar 12;5:22. doi: 10.3389/frobt.2018.00022. eCollection 2018.
8
Facing the FACS-Using AI to Evaluate and Control Facial Action Units in Humanoid Robot Face Development.面向FACS——在人形机器人面部开发中利用人工智能评估和控制面部动作单元
Front Robot AI. 2022 Jun 14;9:887645. doi: 10.3389/frobt.2022.887645. eCollection 2022.
9
Older adults' communication with an interactive humanoid robot : Expectations and experiences of older adults in verbal and nonverbal communication with a socially interactive humanoid robot: a mixed methods design in Germany.老年人与互动人形机器人的交流:德国一项关于老年人与社交互动人形机器人进行言语和非言语交流的期望和体验的混合方法设计。
Z Gerontol Geriatr. 2024 Aug;57(5):371-375. doi: 10.1007/s00391-023-02268-y. Epub 2024 Jan 5.
10
A Study on the Effectiveness of IT Application Education for Older Adults by Interaction Method of Humanoid Robots.《基于仿人机器人交互方法的老年人信息技术应用教育效果研究》
Int J Environ Res Public Health. 2022 Sep 2;19(17):10988. doi: 10.3390/ijerph191710988.

本文引用的文献

1
Interactive and incremental learning of spatial object relations from human demonstrations.从人类演示中交互式增量学习空间对象关系
Front Robot AI. 2023 May 18;10:1151303. doi: 10.3389/frobt.2023.1151303. eCollection 2023.