机器人集群性能的离策略评估：用于评估对控制机器人的有限状态机进行潜在修改的重要性采样

Off-Policy Evaluation of the Performance of a Robot Swarm: Importance Sampling to Assess Potential Modifications to the Finite-State Machine That Controls the Robots.

作者信息

Pagnozzi Federico, Birattari Mauro

机构信息

IRIDIA, Université libre de Bruxelles, Brussels, Belgium.

出版信息

Front Robot AI. 2021 Apr 29;8:625125. doi: 10.3389/frobt.2021.625125. eCollection 2021.

DOI:10.3389/frobt.2021.625125

PMID:33996923

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8117342/

Abstract

Due to the decentralized, loosely coupled nature of a swarm and to the lack of a general design methodology, the development of control software for robot swarms is typically an iterative process. Control software is generally modified and refined repeatedly, either manually or automatically, until satisfactory results are obtained. In this paper, we propose a technique based on off-policy evaluation to estimate how the performance of an instance of control software-implemented as a probabilistic finite-state machine-would be impacted by modifying the structure and the value of the parameters. The proposed technique is particularly appealing when coupled with automatic design methods belonging to the AutoMoDe family, as it can exploit the data generated during the design process. The technique can be used either to reduce the complexity of the control software generated, improving therefore its readability, or to evaluate perturbations of the parameters, which could help in prioritizing the exploration of the neighborhood of the current solution within an iterative improvement algorithm. To evaluate the technique, we apply it to control software generated with an AutoMoDe method, . In a first experiment, we use the proposed technique to estimate the impact of removing a state from a probabilistic finite-state machine. In a second experiment, we use it to predict the impact of changing the value of the parameters. The results show that the technique is promising and significantly better than a naive estimation. We discuss the limitations of the current implementation of the technique, and we sketch possible improvements, extensions, and generalizations.

摘要

由于群体的分散性、松散耦合性以及缺乏通用的设计方法，机器人群体控制软件的开发通常是一个迭代过程。控制软件通常需要手动或自动反复修改和完善，直到获得满意的结果。在本文中，我们提出了一种基于离策略评估的技术，以估计作为概率有限状态机实现的控制软件实例的性能将如何受到修改结构和参数值的影响。当与属于AutoMoDe家族的自动设计方法相结合时，所提出的技术特别有吸引力，因为它可以利用设计过程中生成的数据。该技术既可以用于降低生成的控制软件的复杂性，从而提高其可读性，也可以用于评估参数的扰动，这有助于在迭代改进算法中确定当前解决方案邻域探索的优先级。为了评估该技术，我们将其应用于用AutoMoDe方法生成的控制软件。在第一个实验中，我们使用所提出的技术来估计从概率有限状态机中移除一个状态的影响。在第二个实验中，我们用它来预测改变参数值的影响。结果表明，该技术很有前景，并且明显优于简单估计。我们讨论了该技术当前实现的局限性，并概述了可能的改进、扩展和推广。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6342/8117342/215a9aebf23d/frobt-08-625125-g001.jpg

相似文献

Off-Policy Evaluation of the Performance of a Robot Swarm: Importance Sampling to Assess Potential Modifications to the Finite-State Machine That Controls the Robots.机器人集群性能的离策略评估：用于评估对控制机器人的有限状态机进行潜在修改的重要性采样

Front Robot AI. 2021 Apr 29;8:625125. doi: 10.3389/frobt.2021.625125. eCollection 2021.

Iterative improvement in the automatic modular design of robot swarms.机器人集群自动模块化设计的迭代改进。

PeerJ Comput Sci. 2020 Dec 7;6:e322. doi: 10.7717/peerj-cs.322. eCollection 2020.

Concurrent design of control software and configuration of hardware for robot swarms under economic constraints.经济约束下机器人集群控制软件的协同设计与硬件配置

PeerJ Comput Sci. 2019 Sep 30;5:e221. doi: 10.7717/peerj-cs.221. eCollection 2019.

Automatic modular design of robot swarms using behavior trees as a control architecture.使用行为树作为控制架构的机器人集群自动模块化设计。

PeerJ Comput Sci. 2020 Nov 9;6:e314. doi: 10.7717/peerj-cs.314. eCollection 2020.

Recent trends in robot learning and evolution for swarm robotics.群体机器人技术中机器人学习与进化的最新趋势。

Front Robot AI. 2023 Apr 24;10:1134841. doi: 10.3389/frobt.2023.1134841. eCollection 2023.

Towards an integrated automatic design process for robot swarms.迈向机器人群体的集成自动设计流程。

Open Res Eur. 2022 Nov 4;1:112. doi: 10.12688/openreseurope.14025.2. eCollection 2021.

A Concurrent Mission-Planning Methodology for Robotic Swarms Using Collaborative Motion-Control Strategies.一种采用协作运动控制策略的机器人集群并发任务规划方法。

J Intell Robot Syst. 2023;108(2):15. doi: 10.1007/s10846-023-01881-8. Epub 2023 May 30.

Decentralized Control for Swarm Robots That Can Effectively Execute Spatially Distributed Tasks.用于能有效执行空间分布式任务的群体机器人的分散式控制。

Artif Life. 2020 Spring;26(2):242-259. doi: 10.1162/artl_a_00317. Epub 2020 Apr 9.

Information Exchange Design Patterns for Robot Swarm Foraging and Their Application in Robot Control Algorithms.用于机器人集群觅食的信息交换设计模式及其在机器人控制算法中的应用

Front Robot AI. 2018 Jun 7;5:47. doi: 10.3389/frobt.2018.00047. eCollection 2018.

Blockchain Technology Secures Robot Swarms: A Comparison of Consensus Protocols and Their Resilience to Byzantine Robots.区块链技术保障机器人集群安全：共识协议及其对拜占庭机器人的弹性比较

Front Robot AI. 2020 May 12;7:54. doi: 10.3389/frobt.2020.00054. eCollection 2020.

本文引用的文献

Automatic Off-Line Design of Robot Swarms: A Manifesto.机器人集群的自动离线设计：宣言

Front Robot AI. 2019 Jul 19;6:59. doi: 10.3389/frobt.2019.00059. eCollection 2019.

Embodied Evolution in Collective Robotics: A Review.群体机器人技术中的具身进化：综述

Front Robot AI. 2018 Feb 22;5:12. doi: 10.3389/frobt.2018.00012. eCollection 2018.

A Design Pattern for Decentralised Decision Making.一种去中心化决策的设计模式。

PLoS One. 2015 Oct 23;10(10):e0140950. doi: 10.1371/journal.pone.0140950. eCollection 2015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

机器人集群性能的离策略评估：用于评估对控制机器人的有限状态机进行潜在修改的重要性采样

Off-Policy Evaluation of the Performance of a Robot Swarm: Importance Sampling to Assess Potential Modifications to the Finite-State Machine That Controls the Robots.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献