• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

连续多目标问题的分层决策与控制:具有动作延迟的策略评估

Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.

作者信息

Zhu Jiangcheng, Zhu Jun, Wang Zhepei, Guo Shan, Xu Chao

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Feb;30(2):464-473. doi: 10.1109/TNNLS.2018.2844466. Epub 2018 Jul 2.

DOI:10.1109/TNNLS.2018.2844466
PMID:29994732
Abstract

This paper proposes a hierarchical decision-making and control algorithm for the shepherd game, the seventh mission in the International Aerial Robotics Competition (IARC). In this game, the agent (a multirotor aerial robot) is required to contact targets (ground vehicles) sequentially and drive them to a certain boundary to earn score. During the game of 10 min, the agent should be fully autonomous without any human interference. Regarding the lower-level controller and dynamics of the agent, each action takes a duration of time to accomplish. Denoted as an action delay, in this paper, this action duration is nonconstant and is related to the final reward. Therefore, the challenging point is making the agent "aware of time" when applying a certain action. We solve this problem by two approaches: deep Q-networks and lookup table. The action delay predictor in the decision-level is fitted by a lower-level controller. Through simulations by the example of the shepherd game, the effectiveness and efficiency of this approach are validated. This paper helps our team winning the first prize in IARC 2017, and keeps the best record of this mission since it was released in 2013.

摘要

本文针对国际空中机器人竞赛(IARC)的第七项任务——牧羊游戏,提出了一种分层决策与控制算法。在该游戏中,智能体(多旋翼空中机器人)需要依次接触目标(地面车辆)并将它们驱赶到特定边界以获取分数。在10分钟的游戏过程中,智能体应完全自主,不受任何人为干扰。考虑到智能体的低级控制器和动力学,每个动作都需要一定的时间来完成。在本文中,这个动作持续时间被称为动作延迟,它是不固定的,并且与最终奖励有关。因此,具有挑战性的一点是让智能体在执行某个动作时“意识到时间”。我们通过两种方法解决这个问题:深度Q网络和查找表。决策层中的动作延迟预测器由低级控制器拟合。通过以牧羊游戏为例进行仿真,验证了该方法的有效性和效率。本文帮助我们团队在2017年IARC中获得一等奖,并保持了该任务自2013年发布以来的最佳记录。

相似文献

1
Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.连续多目标问题的分层决策与控制:具有动作延迟的策略评估
IEEE Trans Neural Netw Learn Syst. 2019 Feb;30(2):464-473. doi: 10.1109/TNNLS.2018.2844466. Epub 2018 Jul 2.
2
Autonomous Navigation of a Team of Unmanned Surface Vehicles for Intercepting Intruders on a Region Boundary.一组无人水面航行器在区域边界拦截入侵者的自主导航
Sensors (Basel). 2021 Jan 4;21(1):297. doi: 10.3390/s21010297.
3
A Robotic Cognitive Architecture for Slope and Dam Inspections.用于边坡和大坝检查的机器人认知架构。
Sensors (Basel). 2020 Aug 15;20(16):4579. doi: 10.3390/s20164579.
4
Robust Satisficing Decision Making for Unmanned Aerial Vehicle Complex Missions under Severe Uncertainty.严重不确定性下无人机复杂任务的稳健满意决策
PLoS One. 2016 Nov 11;11(11):e0166448. doi: 10.1371/journal.pone.0166448. eCollection 2016.
5
Experimental evaluation of HJB optimal controllers for the attitude dynamics of a multirotor aerial vehicle.多旋翼飞行器姿态动力学的 HJB 最优控制器的实验评估。
ISA Trans. 2018 Jun;77:188-200. doi: 10.1016/j.isatra.2018.04.003. Epub 2018 Apr 17.
6
Vision-Based Target Finding and Inspection of a Ground Target Using a Multirotor UAV System.基于视觉的地面目标搜索与使用多旋翼无人机系统对地面目标进行检测
Sensors (Basel). 2017 Dec 17;17(12):2929. doi: 10.3390/s17122929.
7
General Framework for the Optimization of the Human-Robot Collaboration Decision-Making Process Through the Ability to Change Performance Metrics.通过改变性能指标的能力优化人机协作决策过程的通用框架。
Front Robot AI. 2021 Oct 25;8:736644. doi: 10.3389/frobt.2021.736644. eCollection 2021.
8
Hierarchical Deep Reinforcement Learning for Continuous Action Control.用于连续动作控制的分层深度强化学习
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5174-5184. doi: 10.1109/TNNLS.2018.2805379. Epub 2018 Mar 9.
9
Resilient Autonomous Control of Distributed Multiagent Systems in Contested Environments.在竞争环境中分布式多智能体系统的弹性自主控制
IEEE Trans Cybern. 2019 Nov;49(11):3957-3967. doi: 10.1109/TCYB.2018.2856089. Epub 2018 Aug 17.
10
Heterogeneous Multi-Robot System for Mapping Environmental Variables of Greenhouses.用于绘制温室环境变量的异构多机器人系统。
Sensors (Basel). 2016 Jul 1;16(7):1018. doi: 10.3390/s16071018.

引用本文的文献

1
Systematic Mapping Literature Review of Mobile Robotics Competitions.系统地梳理移动机器人竞赛的文献综述。
Sensors (Basel). 2022 Mar 10;22(6):2160. doi: 10.3390/s22062160.