Suppr超能文献

合作星际争霸游戏中的半集中式深度确定性策略梯度

Semicentralized Deep Deterministic Policy Gradient in Cooperative StarCraft Games.

作者信息

Xie Dong, Zhong Xiangnan

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1584-1593. doi: 10.1109/TNNLS.2020.3042943. Epub 2022 Apr 4.

Abstract

In this article, we propose a novel semicentralized deep deterministic policy gradient (SCDDPG) algorithm for cooperative multiagent games. Specifically, we design a two-level actor-critic structure to help the agents with interactions and cooperation in the StarCraft combat. The local actor-critic structure is established for each kind of agents with partially observable information received from the environment. Then, the global actor-critic structure is built to provide the local design an overall view of the combat based on the limited centralized information, such as the health value. These two structures work together to generate the optimal control action for each agent and to achieve better cooperation in the games. Comparing with the fully centralized methods, this design can reduce the communication burden by only sending limited information to the global level during the learning process. Furthermore, the reward functions are also designed for both local and global structures based on the agents' attributes to further improve the learning performance in the stochastic environment. The developed method has been demonstrated on several scenarios in a real-time strategy game, i.e., StarCraft. The simulation results show that the agents can effectively cooperate with their teammates and defeat the enemies in various StarCraft scenarios.

摘要

在本文中,我们提出了一种用于合作多智能体游戏的新型半集中式深度确定性策略梯度(SCDDPG)算法。具体而言,我们设计了一种两级演员-评论家结构,以帮助智能体在星际争霸战斗中进行交互与合作。针对从环境中接收到部分可观测信息的每种智能体,建立局部演员-评论家结构。然后,构建全局演员-评论家结构,以便基于有限的集中信息(如生命值)为局部设计提供战斗的整体视图。这两种结构协同工作,为每个智能体生成最优控制动作,并在游戏中实现更好的合作。与完全集中式方法相比,这种设计可以通过在学习过程中仅向全局级别发送有限信息来减轻通信负担。此外,还基于智能体的属性为局部和全局结构设计了奖励函数,以进一步提高在随机环境中的学习性能。所开发的方法已在实时策略游戏(即星际争霸)的多个场景中得到验证。仿真结果表明,智能体能够在各种星际争霸场景中有效地与队友合作并击败敌人。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验