• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分散式多智能体强化学习中的注意力关系状态表示

Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.

作者信息

Liu Xiangyu, Tan Ying

出版信息

IEEE Trans Cybern. 2022 Jan;52(1):252-264. doi: 10.1109/TCYB.2020.2979803. Epub 2022 Jan 11.

DOI:10.1109/TCYB.2020.2979803
PMID:32224477
Abstract

In multiagent reinforcement learning (MARL), it is crucial for each agent to model the relation with its neighbors. Existing approaches usually resort to concatenate the features of multiple neighbors, fixing the size and the identity of the inputs. But these settings are inflexible and unscalable. In this article, we propose an attentive relational encoder (ARE), which is a novel scalable feedforward neural module, to attentionally aggregate an arbitrary-sized neighboring feature set for state representation in the decentralized MARL. The ARE actively selects the relevant information from the neighboring agents and is permutation invariant, computationally efficient, and flexible to interactive multiagent systems. Our method consistently outperforms the latest competing decentralized MARL methods in several multiagent tasks. In particular, it shows strong cooperative performance in challenging StarCraft micromanagement tasks and achieves over a 96% winning rate against the most difficult noncheating built-in artificial intelligence bots.

摘要

在多智能体强化学习(MARL)中,每个智能体对其与邻居的关系进行建模至关重要。现有方法通常采用拼接多个邻居的特征,固定输入的大小和身份。但这些设置缺乏灵活性且不可扩展。在本文中,我们提出了一种注意力关系编码器(ARE),它是一种新颖的可扩展前馈神经模块,用于在分散式MARL中注意力聚合任意大小的相邻特征集以进行状态表示。ARE从相邻智能体中主动选择相关信息,具有排列不变性、计算效率高且对交互式多智能体系统具有灵活性。我们的方法在多个多智能体任务中始终优于最新的竞争分散式MARL方法。特别是,它在具有挑战性的星际争霸微观管理任务中表现出强大的合作性能,对阵最困难的非作弊内置人工智能机器人时胜率超过96%。

相似文献

1
Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.分散式多智能体强化学习中的注意力关系状态表示
IEEE Trans Cybern. 2022 Jan;52(1):252-264. doi: 10.1109/TCYB.2020.2979803. Epub 2022 Jan 11.
2
SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning.SATF:一种用于高效多智能体强化学习的可扩展注意力转移框架
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6627-6641. doi: 10.1109/TNNLS.2024.3387397. Epub 2025 Apr 4.
3
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios.联合国排雷行动处:非成形合作场景下的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Apr;34(4):2093-2104. doi: 10.1109/TNNLS.2021.3105869. Epub 2023 Apr 4.
4
Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning.
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9044-9056. doi: 10.1109/TNNLS.2024.3420791. Epub 2025 May 2.
5
Multiexperience-Assisted Efficient Multiagent Reinforcement Learning.多体验辅助的高效多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12678-12692. doi: 10.1109/TNNLS.2023.3264275. Epub 2024 Sep 3.
6
Multiagent Reinforcement Learning With Graphical Mutual Information Maximization.基于图形互信息最大化的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Feb 16;PP. doi: 10.1109/TNNLS.2023.3243557.
7
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning.SMIX(λ):增强用于协作多智能体强化学习的集中式价值函数
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):52-63. doi: 10.1109/TNNLS.2021.3089493. Epub 2023 Jan 5.
8
Strangeness-driven exploration in multi-agent reinforcement learning.多智能体强化学习中的奇异驱动探索。
Neural Netw. 2024 Apr;172:106149. doi: 10.1016/j.neunet.2024.106149. Epub 2024 Jan 26.
9
Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network.基于异构图注意力网络的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6851-6860. doi: 10.1109/TNNLS.2022.3215774. Epub 2023 Oct 5.
10
Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning.基于新型多智能体强化学习的大规模交通信号控制
IEEE Trans Cybern. 2021 Jan;51(1):174-187. doi: 10.1109/TCYB.2020.3015811. Epub 2020 Dec 22.