• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于异构图注意力网络的多智能体强化学习

Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network.

作者信息

Du Wei, Ding Shifei, Zhang Chenglong, Shi Zhongzhi

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6851-6860. doi: 10.1109/TNNLS.2022.3215774. Epub 2023 Oct 5.

DOI:10.1109/TNNLS.2022.3215774
PMID:36331648
Abstract

Most recent research on multiagent reinforcement learning (MARL) has explored how to deploy cooperative policies for homogeneous agents. However, realistic multiagent environments may contain heterogeneous agents that have different attributes or tasks. The heterogeneity of the agents and the diversity of relationships cause the learning of policy excessively tough. To tackle this difficulty, we present a novel method that employs a heterogeneous graph attention network to model the relationships between heterogeneous agents. The proposed method can generate an integrated feature representation for each agent by hierarchically aggregating latent feature information of neighbor agents, with the importance of the agent level and the relationship level being entirely considered. The method is agnostic to specific MARL methods and can be flexibly integrated with diverse value decomposition methods. We conduct experiments in predator-prey and StarCraft Multiagent Challenge (SMAC) environments, and the empirical results demonstrate that the performance of our method is superior to existing methods in several heterogeneous scenarios.

摘要

最近关于多智能体强化学习(MARL)的研究大多探讨了如何为同质智能体部署合作策略。然而,现实中的多智能体环境可能包含具有不同属性或任务的异质智能体。智能体的异质性和关系的多样性使得策略学习过于困难。为了解决这一难题,我们提出了一种新颖的方法,该方法采用异构图注意力网络对异质智能体之间的关系进行建模。所提出的方法可以通过分层聚合邻居智能体的潜在特征信息,为每个智能体生成一个综合特征表示,同时充分考虑智能体层面和关系层面的重要性。该方法与特定的MARL方法无关,可以灵活地与各种值分解方法集成。我们在捕食者 - 猎物和星际争霸多智能体挑战赛(SMAC)环境中进行了实验,实证结果表明,在几种异质场景下,我们的方法性能优于现有方法。

相似文献

1
Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network.基于异构图注意力网络的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6851-6860. doi: 10.1109/TNNLS.2022.3215774. Epub 2023 Oct 5.
2
Multiagent Reinforcement Learning With Graphical Mutual Information Maximization.基于图形互信息最大化的多智能体强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Feb 16;PP. doi: 10.1109/TNNLS.2023.3243557.
3
VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning.VGN:用于多智能体强化学习的基于图注意力网络的价值分解
IEEE Trans Neural Netw Learn Syst. 2024 Jan;35(1):182-195. doi: 10.1109/TNNLS.2022.3172572. Epub 2024 Jan 4.
4
Lateral Transfer Learning for Multiagent Reinforcement Learning.多智能体强化学习的横向迁移学习。
IEEE Trans Cybern. 2023 Mar;53(3):1699-1711. doi: 10.1109/TCYB.2021.3108237. Epub 2023 Feb 15.
5
A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning.基于深度强化学习的多智能体合作的分布视角
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):4246-4259. doi: 10.1109/TNNLS.2022.3202097. Epub 2024 Feb 29.
6
SATF: A Scalable Attentive Transfer Framework for Efficient Multiagent Reinforcement Learning.SATF:一种用于高效多智能体强化学习的可扩展注意力转移框架
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6627-6641. doi: 10.1109/TNNLS.2024.3387397. Epub 2025 Apr 4.
7
Attentive Relational State Representation in Decentralized Multiagent Reinforcement Learning.分散式多智能体强化学习中的注意力关系状态表示
IEEE Trans Cybern. 2022 Jan;52(1):252-264. doi: 10.1109/TCYB.2020.2979803. Epub 2022 Jan 11.
8
IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control.IHG-MA:用于多交叉口交通信号控制的归纳异质图多智能体强化学习。
Neural Netw. 2021 Jul;139:265-277. doi: 10.1016/j.neunet.2021.03.015. Epub 2021 Mar 22.
9
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning.SMIX(λ):增强用于协作多智能体强化学习的集中式价值函数
IEEE Trans Neural Netw Learn Syst. 2023 Jan;34(1):52-63. doi: 10.1109/TNNLS.2021.3089493. Epub 2023 Jan 5.
10
TVDO: Tchebycheff Value-Decomposition Optimization for Multiagent Reinforcement Learning.TVDO:用于多智能体强化学习的切比雪夫值分解优化
IEEE Trans Neural Netw Learn Syst. 2025 Jul;36(7):12521-12534. doi: 10.1109/TNNLS.2024.3455422.