• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

A Local Information Aggregation-Based Multiagent Reinforcement Learning for Robot Swarm Dynamic Task Allocation.

作者信息

Lv Yang, Lei Jinlong, Yi Peng

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):10437-10449. doi: 10.1109/TNNLS.2025.3558282.

DOI:10.1109/TNNLS.2025.3558282
PMID:40266869
Abstract

In this article, we explore how to optimize task allocation for robot swarms in dynamic environments, emphasizing the necessity of formulating robust, flexible, and scalable strategies for robot cooperation. We introduce a novel framework using a decentralized partially observable Markov decision process (Dec-POMDP), specifically designed for distributed robot swarm networks. At the core of our methodology is the local information aggregation multiagent deep deterministic policy gradient (LIA-MADDPG) algorithm, which merges centralized training with distributed execution. During the centralized training phase, a local information aggregation (LIA) module is meticulously designed to gather critical data from neighboring robots, enhancing decision-making efficiency. In the distributed execution phase, a strategy improvement method is proposed to dynamically adjust task allocation based on changing and partially observable environmental conditions. Our empirical evaluations show that the LIA module can be seamlessly integrated into various centralized training and decentralized execution (CTDE)-based multiagent reinforcement learning (MARL) methods, significantly enhancing their performance. Additionally, by comparing LIA-MADDPG with six conventional reinforcement learning algorithms and a heuristic algorithm, we demonstrate its superior scalability, rapid adaptation to environmental changes, and ability to maintain both stability and convergence speed. These results underscore LIA-MADDPG's outstanding performance and its potential to significantly improve dynamic task allocation in robot swarms through enhanced local collaboration and adaptive strategy execution.

摘要

相似文献

1
A Local Information Aggregation-Based Multiagent Reinforcement Learning for Robot Swarm Dynamic Task Allocation.
IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):10437-10449. doi: 10.1109/TNNLS.2025.3558282.
2
Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making.基于专家系统的多智能体深度确定性策略梯度用于群体机器人决策
IEEE Trans Cybern. 2024 Mar;54(3):1614-1624. doi: 10.1109/TCYB.2022.3228578. Epub 2024 Feb 9.
3
Meta Learning Task Representation in Multiagent Reinforcement Learning: From Global Inference to Local Inference.多智能体强化学习中的元学习任务表示:从全局推理到局部推理
IEEE Trans Neural Netw Learn Syst. 2025 Aug;36(8):14908-14921. doi: 10.1109/TNNLS.2025.3540758.
4
MAARS: Multiagent Actor-Critic Approach for Resource Allocation and Network Slicing in Multiaccess Edge Computing.MAARS:多接入边缘计算中用于资源分配和网络切片的多智能体演员-评论家方法。
Sensors (Basel). 2024 Dec 4;24(23):7760. doi: 10.3390/s24237760.
5
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning.SMIX(λ):增强用于协作多智能体强化学习的集中式价值函数
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):52-63. doi: 10.1109/TNNLS.2021.3089493. Epub 2023 Jan 5.
6
Multi-robot task allocation in e-commerce RMFS based on deep reinforcement learning.基于深度强化学习的电子商务 RMFS 中的多机器人任务分配。
Math Biosci Eng. 2023 Jan;20(2):1903-1918. doi: 10.3934/mbe.2023087. Epub 2022 Nov 8.
7
SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning.SATF:一种用于高效多智能体强化学习的可扩展注意力转移框架
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6627-6641. doi: 10.1109/TNNLS.2024.3387397. Epub 2025 Apr 4.
8
A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation.一种用于多智能体协作导航的局部与全局注意力强化学习算法
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7767-7777. doi: 10.1109/TNNLS.2022.3220798. Epub 2024 Jun 3.
9
Residual Q-Networks for Value Function Factorizing in Multiagent Reinforcement Learning.用于多智能体强化学习中价值函数分解的残差Q网络
IEEE Trans Neural Netw Learn Syst. 2024 Feb;35(2):1534-1544. doi: 10.1109/TNNLS.2022.3183865. Epub 2024 Feb 5.
10
Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks.用于无人机支持的物联网网络中地空通信规划的多智能体深度强化学习
Sensors (Basel). 2024 Oct 10;24(20):6535. doi: 10.3390/s24206535.