基于多智能体强化学习的完全协作任务算法。

FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

出版信息

IEEE Trans Cybern. 2017 Jun;47(6):1367-1379. doi: 10.1109/TCYB.2016.2544866. Epub 2016 Apr 14.

DOI:10.1109/TCYB.2016.2544866

Abstract

In this paper, we propose a multiagent reinforcement learning algorithm dealing with fully cooperative tasks. The algorithm is called frequency of the maximum reward Q-learning (FMRQ). FMRQ aims to achieve one of the optimal Nash equilibria so as to optimize the performance index in multiagent systems. The frequency of obtaining the highest global immediate reward instead of immediate reward is used as the reinforcement signal. With FMRQ each agent does not need the observation of the other agents' actions and only shares its state and reward at each step. We validate FMRQ through case studies of repeated games: four cases of two-player two-action and one case of three-player two-action. It is demonstrated that FMRQ can converge to one of the optimal Nash equilibria in these cases. Moreover, comparison experiments on tasks with multiple states and finite steps are conducted. One is box-pushing and the other one is distributed sensor network problem. Experimental results show that the proposed algorithm outperforms others with higher performance.

摘要

在本文中，我们提出了一种用于完全合作任务的多智能体强化学习算法。该算法称为最大奖励频率 Q 学习（FMRQ）。FMRQ 旨在达到其中一个最优纳什均衡，从而优化多智能体系统中的性能指标。使用获得最高全局即时奖励的频率而不是即时奖励作为强化信号。使用 FMRQ，每个智能体不需要观察其他智能体的动作，只需在每个步骤中共享其状态和奖励。我们通过重复游戏的案例研究验证了 FMRQ：四个两人两动作案例和一个三人两动作案例。结果表明，FMRQ 可以在这些情况下收敛到其中一个最优纳什均衡。此外，还进行了具有多个状态和有限步骤的任务的对比实验。一个是推箱子问题，另一个是分布式传感器网络问题。实验结果表明，该算法在性能方面优于其他算法。

相似文献

FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

IEEE Trans Cybern. 2017 Jun;47(6):1367-1379. doi: 10.1109/TCYB.2016.2544866. Epub 2016 Apr 14.

Multiagent reinforcement learning with unshared value functions.

IEEE Trans Cybern. 2015 Apr;45(4):647-62. doi: 10.1109/TCYB.2014.2332042. Epub 2014 Jul 2.

Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks.

IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4639-4652. doi: 10.1109/TNNLS.2020.3025711. Epub 2021 Oct 5.

A Collaborative Multiagent Reinforcement Learning Method Based on Policy Gradient Potential.

IEEE Trans Cybern. 2021 Feb;51(2):1015-1027. doi: 10.1109/TCYB.2019.2932203. Epub 2021 Jan 15.

Reinforcement Learning With Task Decomposition for Cooperative Multiagent Systems.

IEEE Trans Neural Netw Learn Syst. 2021 May;32(5):2054-2065. doi: 10.1109/TNNLS.2020.2996209. Epub 2021 May 3.

Multiagent Trust Region Policy Optimization.

IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12873-12887. doi: 10.1109/TNNLS.2023.3265358. Epub 2024 Sep 3.

Data-Based Optimal Control of Multiagent Systems: A Reinforcement Learning Design Approach.

IEEE Trans Cybern. 2019 Dec;49(12):4441-4449. doi: 10.1109/TCYB.2018.2868715. Epub 2018 Sep 26.

Consensus, cooperative learning, and flocking for multiagent predator avoidance.

Int J Adv Robot Syst. 2020 Sep 1;17(5). doi: 10.1177/1729881420960342. Epub 2020 Sep 24.

Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer.

IEEE Trans Cybern. 2017 May;47(5):1238-1250. doi: 10.1109/TCYB.2016.2543238. Epub 2016 Mar 31.

Adaptive Individual Q-Learning-A Multiagent Reinforcement Learning Method for Coordination Optimization.

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7739-7750. doi: 10.1109/TNNLS.2024.3385097. Epub 2025 Apr 4.

引用本文的文献

Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment.

IEEE Trans Cybern. 2021 Dec;51(12):5717-5727. doi: 10.1109/TCYB.2019.2958912. Epub 2021 Dec 22.

Multi-AGV path planning with double-path constraints by using an improved genetic algorithm.

PLoS One. 2017 Jul 26;12(7):e0181747. doi: 10.1371/journal.pone.0181747. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多智能体强化学习的完全协作任务算法。

FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献