• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

A formal methods approach to interpretable reinforcement learning for robotic planning.

作者信息

Li Xiao, Serlin Zachary, Yang Guang, Belta Calin

机构信息

Department of Mechanical Engineering, Boston University, Boston, MA, USA.

Division of Systems Engineering, Boston University, Boston, MA, USA.

出版信息

Sci Robot. 2019 Dec 18;4(37). doi: 10.1126/scirobotics.aay6276.

DOI:10.1126/scirobotics.aay6276
PMID:33137718
Abstract

Growing interest in reinforcement learning approaches to robotic planning and control raises concerns of predictability and safety of robot behaviors realized solely through learned control policies. In addition, formally defining reward functions for complex tasks is challenging, and faulty rewards are prone to exploitation by the learning agent. Here, we propose a formal methods approach to reinforcement learning that (i) provides a formal specification language that integrates high-level, rich, task specifications with a priori, domain-specific knowledge; (ii) makes the reward generation process easily interpretable; (iii) guides the policy generation process according to the specification; and (iv) guarantees the satisfaction of the (critical) safety component of the specification. The main ingredients of our computational framework are a predicate temporal logic specifically tailored for robotic tasks and an automaton-guided, safe reinforcement learning algorithm based on control barrier functions. Although the proposed framework is quite general, we motivate it and illustrate it experimentally for a robotic cooking task, in which two manipulators worked together to make hot dogs.

摘要

相似文献

1
A formal methods approach to interpretable reinforcement learning for robotic planning.
Sci Robot. 2019 Dec 18;4(37). doi: 10.1126/scirobotics.aay6276.
2
Safe reinforcement learning under temporal logic with reward design and quantum action selection.基于奖励设计和量子动作选择的时序逻辑下的安全强化学习。
Sci Rep. 2023 Feb 2;13(1):1925. doi: 10.1038/s41598-023-28582-4.
3
Koopman Operator-Based Knowledge-Guided Reinforcement Learning for Safe Human-Robot Interaction.基于库普曼算子的知识引导强化学习用于安全人机交互
Front Robot AI. 2022 Jun 16;9:779194. doi: 10.3389/frobt.2022.779194. eCollection 2022.
4
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
5
Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types.维持型、进取型、回避型和成就目标类型的内在奖励。
Front Neurorobot. 2018 Oct 9;12:63. doi: 10.3389/fnbot.2018.00063. eCollection 2018.
6
A learning-based synthesis approach of reward asynchronous probabilistic games against the linear temporal logic winning condition.一种基于学习的针对线性时态逻辑获胜条件的奖励异步概率博弈合成方法。
PeerJ Comput Sci. 2022 Sep 5;8:e1094. doi: 10.7717/peerj-cs.1094. eCollection 2022.
7
The Task Decomposition and Dedicated Reward-System-Based Reinforcement Learning Algorithm for Pick-and-Place.基于任务分解和专用奖励系统的抓取与放置强化学习算法
Biomimetics (Basel). 2023 Jun 6;8(2):240. doi: 10.3390/biomimetics8020240.
8
From Semantics to Execution: Integrating Action Planning With Reinforcement Learning for Robotic Causal Problem-Solving.从语义到执行:将动作规划与强化学习相结合以解决机器人因果问题
Front Robot AI. 2019 Nov 26;6:123. doi: 10.3389/frobt.2019.00123. eCollection 2019.
9
Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.用于序列任务的多通道交互式强化学习
Front Robot AI. 2020 Sep 24;7:97. doi: 10.3389/frobt.2020.00097. eCollection 2020.
10
Fuzzy reinforcement learning control for compliance tasks of robotic manipulators.用于机器人操纵器柔顺任务的模糊强化学习控制
IEEE Trans Syst Man Cybern B Cybern. 2002;32(1):107-13. doi: 10.1109/3477.979965.

引用本文的文献

1
A High-Efficient Reinforcement Learning Approach for Dexterous Manipulation.一种用于灵巧操作的高效强化学习方法。
Biomimetics (Basel). 2023 Jun 16;8(2):264. doi: 10.3390/biomimetics8020264.
2
Safe reinforcement learning under temporal logic with reward design and quantum action selection.基于奖励设计和量子动作选择的时序逻辑下的安全强化学习。
Sci Rep. 2023 Feb 2;13(1):1925. doi: 10.1038/s41598-023-28582-4.