• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning.

作者信息

Guan Cong, Chen Feng, Yuan Lei, Zhang Zongzhang, Yu Yang

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9044-9056. doi: 10.1109/TNNLS.2024.3420791. Epub 2025 May 2.

DOI:10.1109/TNNLS.2024.3420791
PMID:39283788
Abstract

Utilizing messages from teammates can improve coordination in cooperative multiagent reinforcement learning (MARL). Previous works typically combine raw messages of teammates with local information as inputs for policy. However, neglecting message aggregation poses significant inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in cooperative MARL. In this article, we propose Multiagent communication via Self-supervised Information Aggregation (MASIA), where agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation-invariant message encoder to generate common information-aggregated representation from messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Hence, each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Furthermore, considering the potential of offline learning for real-world applications, we build offline benchmarks for multiagent communication, which is the first as we know. Empirical results demonstrate the superiority of our method in both online and offline settings. We also release the built offline benchmarks in this article as a testbed for communication ability validation to facilitate further future research in this direction.

摘要

相似文献

1
Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning.
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9044-9056. doi: 10.1109/TNNLS.2024.3420791. Epub 2025 May 2.
2
Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.分散式多智能体强化学习中的注意力关系状态表示
IEEE Trans Cybern. 2022 Jan;52(1):252-264. doi: 10.1109/TCYB.2020.2979803. Epub 2022 Jan 11.
3
Multiagent Reinforcement Learning With Graphical Mutual Information Maximization.基于图形互信息最大化的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Feb 16;PP. doi: 10.1109/TNNLS.2023.3243557.
4
TIMAR: Transition-informed representation for sample-efficient multi-agent reinforcement learning.TIMAR:用于样本高效多智能体强化学习的转换感知表示
Neural Netw. 2025 Apr;184:107081. doi: 10.1016/j.neunet.2024.107081. Epub 2024 Dec 31.
5
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning.NVIF:用于协作式大规模多智能体强化学习的相邻变分信息流
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17829-17841. doi: 10.1109/TNNLS.2023.3309608. Epub 2024 Dec 2.
6
Constraining an Unconstrained Multi-agent Policy with offline data.使用离线数据约束无约束多智能体策略。
Neural Netw. 2025 Jun;186:107253. doi: 10.1016/j.neunet.2025.107253. Epub 2025 Feb 13.
7
Lateral Transfer Learning for Multiagent Reinforcement Learning.多智能体强化学习的横向迁移学习。
IEEE Trans Cybern. 2023 Mar;53(3):1699-1711. doi: 10.1109/TCYB.2021.3108237. Epub 2023 Feb 15.
8
Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network.基于异构图注意力网络的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6851-6860. doi: 10.1109/TNNLS.2022.3215774. Epub 2023 Oct 5.
9
Multiexperience-Assisted Efficient Multiagent Reinforcement Learning.多体验辅助的高效多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12678-12692. doi: 10.1109/TNNLS.2023.3264275. Epub 2024 Sep 3.
10
Multiagent Continual Coordination via Progressive Task Contextualization.通过渐进式任务情境化实现多智能体持续协调
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6326-6340. doi: 10.1109/TNNLS.2024.3394513. Epub 2025 Apr 4.