广义追踪学习方案：连续和离散学习自动机的新类别

Generalized pursuit learning schemes: new families of continuous and discretized learning automata.

作者信息

Agache M, Oommen B J

机构信息

Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):738-49. doi: 10.1109/TSMCB.2002.1049608.

DOI:10.1109/TSMCB.2002.1049608

PMID:18244880

Abstract

The fastest learning automata (LA) algorithms currently available fall in the family of estimator algorithms introduced by Thathachar and Sastry (1986). The pioneering work of these authors was the pursuit algorithm, which pursues only the current estimated optimal action. If this action is not the one with the minimum penalty probability, this algorithm pursues a wrong action. In this paper, we argue that a pursuit scheme that generalizes the traditional pursuit algorithm by pursuing all the actions with higher reward estimates than the chosen action, minimizes the probability of pursuing a wrong action, and is a faster converging scheme. To attest this, we present two new generalized pursuit algorithms (GPAs) and also present a quantitative comparison of their performance against the existing pursuit algorithms. Empirically, the algorithms proposed here are among the fastest reported LA to date.

摘要

目前可用的最快学习自动机（LA）算法属于Thathachar和Sastry（1986）引入的估计器算法家族。这些作者的开创性工作是追踪算法，该算法仅追踪当前估计的最优动作。如果此动作不是具有最小惩罚概率的动作，则该算法追踪的是错误动作。在本文中，我们认为一种追踪方案通过追踪所有奖励估计高于所选动作的动作来推广传统追踪算法，可将追踪错误动作的概率降至最低，并且是一种收敛更快的方案。为了证明这一点，我们提出了两种新的广义追踪算法（GPA），并对它们与现有追踪算法的性能进行了定量比较。从经验上看，这里提出的算法是迄今为止报道的最快的LA算法之一。

相似文献

Generalized pursuit learning schemes: new families of continuous and discretized learning automata.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):738-49. doi: 10.1109/TSMCB.2002.1049608.

Continuous and discretized pursuit learning schemes: various algorithms and their comparison.

IEEE Trans Syst Man Cybern B Cybern. 2001;31(3):277-87. doi: 10.1109/3477.931507.

Last-position elimination-based learning automata.

IEEE Trans Cybern. 2014 Dec;44(12):2484-92. doi: 10.1109/TCYB.2014.2309478. Epub 2014 Apr 2.

Random early detection for congestion avoidance in wired networks: a discretized pursuit learning-automata-like solution.

IEEE Trans Syst Man Cybern B Cybern. 2010 Feb;40(1):66-76. doi: 10.1109/TSMCB.2009.2032363.

Fast and Epsilon-Optimal Discretized Pursuit Learning Automata.

IEEE Trans Cybern. 2015 Oct;45(10):2089-99. doi: 10.1109/TCYB.2014.2365463. Epub 2014 Nov 13.

Finite time analysis of the pursuit algorithm for learning automata.

IEEE Trans Syst Man Cybern B Cybern. 1996;26(4):590-8. doi: 10.1109/3477.517033.

Varieties of learning automata: an overview.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):711-22. doi: 10.1109/TSMCB.2002.1049606.

Discretized learning automata solutions to the capacity assignment problem for prioritized networks.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):821-31. doi: 10.1109/TSMCB.2002.1049616.

The Hierarchical Continuous Pursuit Learning Automation: A Novel Scheme for Environments With Large Numbers of Actions.

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):512-526. doi: 10.1109/TNNLS.2019.2905162. Epub 2019 Apr 11.

A team of continuous-action learning automata for noise-tolerant learning of half-spaces.

IEEE Trans Syst Man Cybern B Cybern. 2010 Feb;40(1):19-28. doi: 10.1109/TSMCB.2009.2032155.

引用本文的文献

A parameter-free learning automaton scheme.

Front Neurorobot. 2022 Sep 23;16:999658. doi: 10.3389/fnbot.2022.999658. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

广义追踪学习方案：连续和离散学习自动机的新类别

Generalized pursuit learning schemes: new families of continuous and discretized learning automata.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献