Kaya Mehmet, Alhajj Reda
Department of Computer Engineering, Firat University, 23119 Elaziğ, Turkey.
IEEE Trans Syst Man Cybern B Cybern. 2004 Apr;34(2):1210-23. doi: 10.1109/tsmcb.2003.821869.
To date, many researchers have proposed various methods to improve the learning ability in multiagent systems. However, most of these studies are not appropriate to more complex multiagent learning problems because the state space of each learning agent grows exponentially in terms of the number of partners present in the environment. Modeling other learning agents present in the domain as part of the state of the environment is not a realistic approach. In this paper, we combine advantages of the modular approach, fuzzy logic and the internal model in a single novel multiagent system architecture. The architecture is based on a fuzzy modular approach whose rule base is partitioned into several different modules. Each module deals with a particular agent in the environment and maps the input fuzzy sets to the action Q-values; these represent the state space of each learning module and the action space, respectively. Each module also uses an internal model table to estimate actions of the other agents. Finally, we investigate the integration of a parallel update method with the proposed architecture. Experimental results obtained on two different environments of a well-known pursuit domain show the effectiveness and robustness of the proposed multiagent architecture and learning approach.
到目前为止,许多研究人员已经提出了各种方法来提高多智能体系统中的学习能力。然而,这些研究大多不适用于更复杂的多智能体学习问题,因为每个学习智能体的状态空间会随着环境中存在的伙伴数量呈指数级增长。将领域中存在的其他学习智能体建模为环境状态的一部分并不是一种现实的方法。在本文中,我们在一个新颖的多智能体系统架构中结合了模块化方法、模糊逻辑和内部模型的优点。该架构基于一种模糊模块化方法,其规则库被划分为几个不同的模块。每个模块处理环境中的一个特定智能体,并将输入模糊集映射到动作Q值;这些分别代表每个学习模块的状态空间和动作空间。每个模块还使用一个内部模型表来估计其他智能体的动作。最后,我们研究了并行更新方法与所提出架构的集成。在一个著名的追踪领域的两种不同环境上获得的实验结果表明了所提出的多智能体架构和学习方法的有效性和鲁棒性。