连续和离散化追踪学习方案：各种算法及其比较

Continuous and discretized pursuit learning schemes: various algorithms and their comparison.

作者信息

Oommen B J, Agache M

机构信息

Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2001;31(3):277-87. doi: 10.1109/3477.931507.

DOI:10.1109/3477.931507

PMID:18244792

Abstract

A learning automaton (LA) is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata (LAs) have been proposed, with the class of estimator algorithms being among the fastest ones, Thathachar and Sastry, through the pursuit algorithm, introduced the concept of learning algorithms that pursue the current optimal action, following a reward-penalty learning philosophy. Later, Oommen and Lanctot extended the pursuit algorithm into the discretized world by presenting the discretized pursuit algorithm, based on a reward-inaction learning philosophy. In this paper we argue that the reward-penalty and reward-inaction learning paradigms in conjunction with the continuous and discrete models of computation, lead to four versions of pursuit learning automata. We contend that a scheme that merges the pursuit concept with the most recent response of the environment, permits the algorithm to utilize the LAs long-term and short-term perspectives of the environment. In this paper, we present all four resultant pursuit algorithms, prove the E-optimality of the newly introduced algorithms, and present a quantitative comparison between them.

摘要

学习自动机（LA）是一种与随机环境交互的自动机，其目标是基于所获得的经验学习最优动作的任务。已经提出了许多学习自动机（LA），估计器算法类别是其中最快的算法之一，塔哈查尔和萨斯特里通过追踪算法引入了遵循奖惩学习理念追踪当前最优动作的学习算法概念。后来，奥门和兰科托通过提出离散化追踪算法，基于奖惩无为学习理念将追踪算法扩展到离散世界。在本文中，我们认为奖惩和奖惩无为学习范式与连续和离散计算模型相结合，导致了四种版本的追踪学习自动机。我们认为，一种将追踪概念与环境的最新响应相结合的方案，允许算法利用学习自动机对环境的长期和短期视角。在本文中，我们给出了所有四种由此产生的追踪算法，证明了新引入算法的E最优性，并对它们进行了定量比较。

相似文献

Continuous and discretized pursuit learning schemes: various algorithms and their comparison.

IEEE Trans Syst Man Cybern B Cybern. 2001;31(3):277-87. doi: 10.1109/3477.931507.

Generalized pursuit learning schemes: new families of continuous and discretized learning automata.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):738-49. doi: 10.1109/TSMCB.2002.1049608.

Last-position elimination-based learning automata.

IEEE Trans Cybern. 2014 Dec;44(12):2484-92. doi: 10.1109/TCYB.2014.2309478. Epub 2014 Apr 2.

Finite time analysis of the pursuit algorithm for learning automata.

IEEE Trans Syst Man Cybern B Cybern. 1996;26(4):590-8. doi: 10.1109/3477.517033.

Discretized learning automata solutions to the capacity assignment problem for prioritized networks.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):821-31. doi: 10.1109/TSMCB.2002.1049616.

Modeling a student's behavior in a tutorial-like system using learning automata.

IEEE Trans Syst Man Cybern B Cybern. 2010 Apr;40(2):481-92. doi: 10.1109/TSMCB.2009.2027220. Epub 2009 Sep 9.

Fast and Epsilon-Optimal Discretized Pursuit Learning Automata.

IEEE Trans Cybern. 2015 Oct;45(10):2089-99. doi: 10.1109/TCYB.2014.2365463. Epub 2014 Nov 13.

Random early detection for congestion avoidance in wired networks: a discretized pursuit learning-automata-like solution.

IEEE Trans Syst Man Cybern B Cybern. 2010 Feb;40(1):66-76. doi: 10.1109/TSMCB.2009.2032363.

Varieties of learning automata: an overview.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):711-22. doi: 10.1109/TSMCB.2002.1049606.

A generalized learning algorithm for an automaton operating in a multiteacher environment.

IEEE Trans Syst Man Cybern B Cybern. 1999;29(5):592-600. doi: 10.1109/3477.790442.

引用本文的文献

A parameter-free learning automaton scheme.

Front Neurorobot. 2022 Sep 23;16:999658. doi: 10.3389/fnbot.2022.999658. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

连续和离散化追踪学习方案：各种算法及其比较

Continuous and discretized pursuit learning schemes: various algorithms and their comparison.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献