反事实奖励促进使用个体控制的群体微型机器人进行集体运输。

Counterfactual rewards promote collective transport using individually controlled swarm microrobots.

作者信息

Heuthe Veit-Lorenz, Panizon Emanuele, Gu Hongri, Bechinger Clemens

机构信息

Department of Physics, University of Konstanz, Universitaetsstrasse 10, Konstanz, 78464, Germany.

Centre for the Advanced Study of Collective Behaviour, Universitaetsstrasse 10, Konstanz, 78464, Germany.

出版信息

Sci Robot. 2024 Dec 18;9(97):eado5888. doi: 10.1126/scirobotics.ado5888.

DOI:10.1126/scirobotics.ado5888

PMID:39693403

Abstract

Swarm robots offer fascinating opportunities to perform complex tasks beyond the capabilities of individual machines. Just as a swarm of ants collectively moves large objects, similar functions can emerge within a group of robots through individual strategies based on local sensing. However, realizing collective functions with individually controlled microrobots is particularly challenging because of their micrometer size, large number of degrees of freedom, strong thermal noise relative to the propulsion speed, and complex physical coupling between neighboring microrobots. Here, we implemented multiagent reinforcement learning (MARL) to generate a control strategy for up to 200 microrobots whose motions are individually controlled by laser spots. During the learning process, we used so-called counterfactual rewards that automatically assign credit to the individual microrobots, which allows fast and unbiased training. With the help of this efficient reward scheme, swarm microrobots learn to collectively transport a large cargo object to an arbitrary position and orientation, similar to ant swarms. We show that this flexible and versatile swarm robotic system is robust to variations in group size, the presence of malfunctioning units, and environmental noise. In addition, we let the robot swarms manipulate multiple objects simultaneously in a demonstration experiment, highlighting the benefits of distributed control and independent microrobot motion. Control strategies such as ours can potentially enable complex and automated assembly of mobile micromachines, programmable drug delivery capsules, and other advanced lab-on-a-chip applications.

摘要

群体机器人为执行单个机器无法完成的复杂任务提供了迷人的机会。正如一群蚂蚁能共同移动大型物体一样，通过基于局部感知的个体策略，一组机器人也能展现出类似功能。然而，对于个体可控的微型机器人而言，要实现集体功能尤其具有挑战性，这是因为它们尺寸微小、自由度多、相对于推进速度的热噪声大，以及相邻微型机器人之间存在复杂的物理耦合。在此，我们实施了多智能体强化学习（MARL），为多达200个微型机器人生成控制策略，这些微型机器人的运动由激光点单独控制。在学习过程中，我们使用了所谓的反事实奖励，它能自动将功劳归于各个微型机器人，从而实现快速且无偏差的训练。借助这种高效的奖励机制，群体微型机器人学会了将一个大型货物集体运送到任意位置和方向，类似于蚁群。我们表明，这种灵活通用的群体机器人系统对于群体规模的变化、故障单元的存在以及环境噪声具有鲁棒性。此外，在一个演示实验中，我们让机器人群体同时操控多个物体，突出了分布式控制和微型机器人独立运动的优势。像我们这样的控制策略有可能实现移动微机器、可编程药物递送胶囊以及其他先进的芯片实验室应用的复杂自动化组装。

相似文献

Counterfactual rewards promote collective transport using individually controlled swarm microrobots.反事实奖励促进使用个体控制的群体微型机器人进行集体运输。

Sci Robot. 2024 Dec 18;9(97):eado5888. doi: 10.1126/scirobotics.ado5888.

Programmable Collective Behavior in Dynamically Self-Assembled Mobile Microrobotic Swarms.动态自组装移动微型机器人集群中的可编程集体行为

Adv Sci (Weinh). 2019 Jan 23;6(6):1801837. doi: 10.1002/advs.201801837. eCollection 2019 Mar 20.

Collective Reconfiguration and Propulsion Behaviors of -Based Biohybrid Magnetic Microrobot Swarm.基于-的生物杂交磁性微型机器人集群的集体重构与推进行为

ACS Appl Mater Interfaces. 2025 Feb 19;17(7):11062-11072. doi: 10.1021/acsami.4c19275. Epub 2025 Feb 5.

Propulsion Mechanisms in Magnetic Microrobotics: From Single Microrobots to Swarms.磁性微型机器人技术中的推进机制：从单个微型机器人到群体

Micromachines (Basel). 2025 Jan 31;16(2):181. doi: 10.3390/mi16020181.

Multimodal-Driven Magnetic Microrobots with Enhanced Bactericidal Activity for Biofilm Eradication and Removal from Titanium Mesh.多模态驱动的磁性微机器人，具有增强的杀菌活性，可用于从钛网中清除和去除生物膜。

Adv Mater. 2023 Jun;35(23):e2300191. doi: 10.1002/adma.202300191. Epub 2023 Apr 23.

Magnetically Actuated Cell-Robot System: Precise Control, Manipulation, and Multimode Conversion.磁驱动细胞机器人系统：精确控制、操作与多模式转换

Small. 2022 Apr;18(15):e2105414. doi: 10.1002/smll.202105414. Epub 2022 Mar 1.

Collective Behaviors of Magnetic Microparticle Swarms: From Dexterous Tentacles to Reconfigurable Carpets.磁性微粒子群的集体行为：从灵巧的触手到可重构的地毯。

ACS Nano. 2022 Sep 27;16(9):13728-13739. doi: 10.1021/acsnano.2c05244. Epub 2022 Aug 4.

Long-Distance Autonomous Navigation of Optical Microrobotic Swarms in Complex Environments.复杂环境中光学微型机器人群的长距离自主导航

Adv Intell Syst. 2024 Dec;6(12). doi: 10.1002/aisy.202400409. Epub 2024 Sep 19.

Solitary and Collective Motion Behaviors of Microrobots under the Coupling of Multiple Light Fields.多光场耦合下微型机器人的单独和集体运动行为

Micromachines (Basel). 2022 Dec 29;14(1):89. doi: 10.3390/mi14010089.

Reconfigurable Particle Swarm Robotics Powered by Acoustic Vibration Tweezer.由声振动镊子驱动的可重构粒子群机器人

Soft Robot. 2021 Dec;8(6):735-743. doi: 10.1089/soro.2020.0050. Epub 2020 Nov 20.

引用本文的文献

A light-fueled self-oscillator that senses force.一种可感知力的光驱动自振荡器。

Commun Mater. 2025;6(1):173. doi: 10.1038/s43246-025-00903-2. Epub 2025 Aug 5.

Integrated decision-control for social robot autonomous navigation considering nonlinear dynamics model.考虑非线性动力学模型的社交机器人自主导航集成决策控制

PLoS One. 2025 Jun 6;20(6):e0324341. doi: 10.1371/journal.pone.0324341. eCollection 2025.

Emergent collective behavior of cohesive, aligning particles.具有凝聚力、排列性粒子的涌现集体行为。

Eur Phys J E Soft Matter. 2025 May 7;48(4-5):22. doi: 10.1140/epje/s10189-025-00482-7.

The 2025 motile active matter roadmap.2025年可移动活性物质路线图。

J Phys Condens Matter. 2025 Feb 19;37(14):143501. doi: 10.1088/1361-648X/adac98.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

反事实奖励促进使用个体控制的群体微型机器人进行集体运输。

Counterfactual rewards promote collective transport using individually controlled swarm microrobots.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献