QTypeMix：通过异构和同构值分解增强多智能体合作策略

QTypeMix: Enhancing multi-agent cooperative strategies through heterogeneous and homogeneous value decomposition.

作者信息

Fu Songchen, Zhao Shaojing, Li Ta, Yan Yonghong

机构信息

Laboratory of Speech and Intelligent Information Processing, Institute of Acoustics, CAS, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.

出版信息

Neural Netw. 2025 Apr;184:107093. doi: 10.1016/j.neunet.2024.107093. Epub 2024 Dec 29.

DOI:10.1016/j.neunet.2024.107093

PMID:39746247

Abstract

In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of multi-agent systems often involves a large amount of complex interaction information, making it more challenging to learn heterogeneous strategies. Related multi-agent reinforcement learning methods sometimes use grouping mechanisms to form smaller cooperative groups or leverage prior domain knowledge to learn strategies for different roles. In contrast, agents should learn deeper role features without relying on additional information. Therefore, we propose QTypeMix, which divides the value decomposition process into homogeneous and heterogeneous stages. QTypeMix learns to extract type features from local historical observations through the TE loss. In addition, we introduce advanced network structures containing attention mechanisms and hypernets to enhance the representation capability and achieve the value decomposition process. The results of testing the proposed method on 14 maps from SMAC and SMACv2 show that QTypeMix achieves state-of-the-art performance in tasks of varying difficulty.

摘要

在多智能体协作任务中，异构智能体的存在很常见。与同构智能体之间的协作相比，异构协作需要考虑每个智能体最适合的子任务。然而，多智能体系统的运行通常涉及大量复杂的交互信息，这使得学习异构策略更具挑战性。相关的多智能体强化学习方法有时会使用分组机制来形成较小的协作组，或者利用先验领域知识来学习不同角色的策略。相比之下，智能体应该在不依赖额外信息的情况下学习更深层次的角色特征。因此，我们提出了QTypeMix，它将值分解过程分为同构和异构阶段。QTypeMix通过TE损失学习从局部历史观测中提取类型特征。此外，我们引入了包含注意力机制和超网络的先进网络结构，以增强表示能力并实现值分解过程。在来自SMAC和SMACv2的14个地图上测试该方法的结果表明，QTypeMix在不同难度的任务中都取得了领先的性能。

相似文献

QTypeMix: Enhancing multi-agent cooperative strategies through heterogeneous and homogeneous value decomposition.QTypeMix：通过异构和同构值分解增强多智能体合作策略

Neural Netw. 2025 Apr;184:107093. doi: 10.1016/j.neunet.2024.107093. Epub 2024 Dec 29.

IHG-MA: Inductive heterogeneous graph multi-agent reinforcement learning for multi-intersection traffic signal control.IHG-MA：用于多交叉口交通信号控制的归纳异质图多智能体强化学习。

Neural Netw. 2021 Jul;139:265-277. doi: 10.1016/j.neunet.2021.03.015. Epub 2021 Mar 22.

A fully value distributional deep reinforcement learning framework for multi-agent cooperation.一种用于多智能体合作的全值分布深度强化学习框架。

Neural Netw. 2025 Apr;184:107035. doi: 10.1016/j.neunet.2024.107035. Epub 2024 Dec 14.

MuDE: Multi-agent decomposed reward-based exploration.MuDE：基于多代理分解奖励的探索。

Neural Netw. 2024 Nov;179:106565. doi: 10.1016/j.neunet.2024.106565. Epub 2024 Jul 22.

Coordinating Multi-Agent Reinforcement Learning via Dual Collaborative Constraints.通过双重协作约束协调多智能体强化学习

Neural Netw. 2025 Feb;182:106858. doi: 10.1016/j.neunet.2024.106858. Epub 2024 Nov 12.

Hierarchical Attention Master-Slave for heterogeneous multi-agent reinforcement learning.分层注意力主从式异构多智能体强化学习。

Neural Netw. 2023 May;162:359-368. doi: 10.1016/j.neunet.2023.02.037. Epub 2023 Mar 4.

HyperComm: Hypergraph-based communication in multi-agent reinforcement learning.超通讯：多智能体强化学习中的基于超图的通讯。

Neural Netw. 2024 Oct;178:106432. doi: 10.1016/j.neunet.2024.106432. Epub 2024 Jun 10.

Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning.技能很重要：多智能体合作强化学习中的动态技能学习

Neural Netw. 2025 Jan;181:106852. doi: 10.1016/j.neunet.2024.106852. Epub 2024 Nov 2.

Multiagent Reinforcement Learning With Heterogeneous Graph Attention Network.基于异构图注意力网络的多智能体强化学习

IEEE Trans Neural Netw Learn Syst. 2023 Oct;34(10):6851-6860. doi: 10.1109/TNNLS.2022.3215774. Epub 2023 Oct 5.

VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning.VGN：用于多智能体强化学习的基于图注意力网络的价值分解

IEEE Trans Neural Netw Learn Syst. 2024 Jan;35(1):182-195. doi: 10.1109/TNNLS.2022.3172572. Epub 2024 Jan 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

QTypeMix：通过异构和同构值分解增强多智能体合作策略

QTypeMix: Enhancing multi-agent cooperative strategies through heterogeneous and homogeneous value decomposition.

作者信息

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献