用于协作多智能体系统的基于任务分解的强化学习

Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.

作者信息

Sun Changyin, Liu Wenzhang, Dong Lu

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2054-2065. doi: 10.1109/TNNLS.2020.2996209. Epub 2021 May 3.

DOI:10.1109/TNNLS.2020.2996209

Abstract

In this article, we study cooperative multiagent systems (MASs) with multiple tasks by using reinforcement learning (RL)-based algorithms. The target for a single-agent RL system is represented by its scalar reward signals. However, for an MAS with multiple cooperative tasks, the holistic reward signal consists of multiple parts to represent the tasks, which makes the problem complicated. Existing multiagent RL algorithms search distributed policies with holistic reward signals directly, making it difficult to obtain an optimal policy for each task. This article provides efficient learning-based algorithms such that each agent can learn a joint optimal policy to accomplish these multiple tasks cooperatively with other agents. The main idea of the algorithms is to decompose the holistic reward signal for each agent into multiple parts according to the subtasks, and then the proposed algorithms learn multiple value functions with the decomposed reward signals and update the policy with the sum of distributed value functions. In addition, this article presents a theoretical analysis of the proposed approach. Finally, the simulation results for both discrete decision-making and continuous control problems have demonstrated the effectiveness of the proposed algorithms.

摘要

在本文中，我们通过使用基于强化学习（RL）的算法来研究具有多个任务的协作多智能体系统（MAS）。单智能体RL系统的目标由其标量奖励信号表示。然而，对于具有多个协作任务的MAS，整体奖励信号由多个部分组成以表示这些任务，这使得问题变得复杂。现有的多智能体RL算法直接使用整体奖励信号搜索分布式策略，使得难以获得针对每个任务的最优策略。本文提供了基于学习的高效算法，使得每个智能体能够学习联合最优策略，以便与其他智能体协作完成这些多个任务。这些算法的主要思想是根据子任务将每个智能体的整体奖励信号分解为多个部分，然后所提出的算法使用分解后的奖励信号学习多个价值函数，并使用分布式价值函数的总和更新策略。此外，本文对所提出的方法进行了理论分析。最后，离散决策和连续控制问题的仿真结果都证明了所提出算法的有效性。

相似文献

Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.用于协作多智能体系统的基于任务分解的强化学习

IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2054-2065. doi: 10.1109/TNNLS.2020.2996209. Epub 2021 May 3.

VGN: Value Decomposition With Graph Attention Networks for Multiagent Reinforcement Learning.VGN：用于多智能体强化学习的基于图注意力网络的价值分解

IEEE Trans Neural Netw Learn Syst. 2024 Jan;35(1):182-195. doi: 10.1109/TNNLS.2022.3172572. Epub 2024 Jan 4.

Consensus, cooperative learning, and flocking for multiagent predator avoidance.多智能体避掠食者的共识、合作学习与群聚行为

Int J Adv Robot Syst. 2020 Sep 1;17(5). doi: 10.1177/1729881420960342. Epub 2020 Sep 24.

Off-Policy Reinforcement Learning for Synchronization in Multiagent Graphical Games.多智能体图博弈中的非策略强化学习同步。

IEEE Trans Neural Netw Learn Syst. 2017 Oct;28(10):2434-2445. doi: 10.1109/TNNLS.2016.2609500. Epub 2017 Apr 17.

Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks.基于学习自动机的多智能体强化学习用于协作任务优化

IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4639-4652. doi: 10.1109/TNNLS.2020.3025711. Epub 2021 Oct 5.

A Collaborative Multiagent Reinforcement Learning Method Based on Policy Gradient Potential.一种基于策略梯度势的协作多智能体强化学习方法。

IEEE Trans Cybern. 2021 Feb;51(2):1015-1027. doi: 10.1109/TCYB.2019.2932203. Epub 2021 Jan 15.

Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model.基于核动态模型的机器人机械手强化学习跟踪控制

IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3570-3578. doi: 10.1109/TNNLS.2019.2945019. Epub 2019 Nov 1.

Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning.基于新型多智能体强化学习的大规模交通信号控制

IEEE Trans Cybern. 2021 Jan;51(1):174-187. doi: 10.1109/TCYB.2020.3015811. Epub 2020 Dec 22.

LJIR: Learning Joint-Action Intrinsic Reward in cooperative multi-agent reinforcement learning.LJIR：在合作多智能体强化学习中学习联合行动内在奖励

Neural Netw. 2023 Oct;167:450-459. doi: 10.1016/j.neunet.2023.08.016. Epub 2023 Aug 22.

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications.用于多智能体系统的深度强化学习：挑战、解决方案及应用综述

IEEE Trans Cybern. 2020 Sep;50(9):3826-3839. doi: 10.1109/TCYB.2020.2977374. Epub 2020 Mar 20.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于协作多智能体系统的基于任务分解的强化学习

Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献