一类新型的ε-最优学习自动机。

A new class of epsilon-optimal learning automata.

作者信息

Papadimitriou Georgios I, Sklira Maria, Pomportsis Andreas S

机构信息

Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):246-54. doi: 10.1109/tsmcb.2003.811117.

DOI:10.1109/tsmcb.2003.811117

PMID:15369067

Abstract

A new class of P-model absorbing learning automata is introduced. The proposed automata are based on the use of a stochastic estimator in order to achieve a rapid and accurate convergence when operating in stationary random environments. According to the proposed stochastic estimator scheme, the estimates of the reward probabilities of actions are not strictly dependent on the environmental responses. The dependence between the stochastic estimates and the deterministic ones is more relaxed for actions that have been selected only a few times. In this way, actions that have been selected only a few times, have the opportunity to be estimated as "optimal," to increase their choice probability and consequently, to be selected. In this way, the estimates become more reliable and consequently, the automaton rapidly and accurately converges to the optimal action. The asymptotic behavior of the proposed scheme is analyzed and it is proved to be epsilon-optimal in every stationary random environment. Furthermore, extensive simulation results are presented that indicate that the proposed stochastic estimator scheme converges faster than the deterministic-estimator-based DP(RI) and DGPA schemes when operating in stationary P-model random environments.

摘要

引入了一类新的P模型吸收学习自动机。所提出的自动机基于使用随机估计器，以便在平稳随机环境中运行时实现快速且准确的收敛。根据所提出的随机估计器方案，动作奖励概率的估计并不严格依赖于环境响应。对于仅被选择过几次的动作，随机估计与确定性估计之间的依赖性更为宽松。通过这种方式，仅被选择过几次的动作有机会被估计为“最优”，以增加其选择概率，从而被选中。这样，估计变得更加可靠，因此自动机能够快速且准确地收敛到最优动作。分析了所提出方案的渐近行为，并证明其在每个平稳随机环境中都是ε最优的。此外，给出了广泛的仿真结果，表明所提出的随机估计器方案在平稳P模型随机环境中运行时比基于确定性估计器的DP(RI)和DGPA方案收敛得更快。

相似文献

A new class of epsilon-optimal learning automata.

IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):246-54. doi: 10.1109/tsmcb.2003.811117.

Last-position elimination-based learning automata.

IEEE Trans Cybern. 2014 Dec;44(12):2484-92. doi: 10.1109/TCYB.2014.2309478. Epub 2014 Apr 2.

Generalized pursuit learning schemes: new families of continuous and discretized learning automata.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):738-49. doi: 10.1109/TSMCB.2002.1049608.

Continuous and discretized pursuit learning schemes: various algorithms and their comparison.

IEEE Trans Syst Man Cybern B Cybern. 2001;31(3):277-87. doi: 10.1109/3477.931507.

Fast and Epsilon-Optimal Discretized Pursuit Learning Automata.

IEEE Trans Cybern. 2015 Oct;45(10):2089-99. doi: 10.1109/TCYB.2014.2365463. Epub 2014 Nov 13.

Local linear estimation for spatial random processes with stochastic trend and stationary noise.

Sankhya Ser B. 2018 Nov;80(2):369-394. doi: 10.1007/s13571-018-0155-4. Epub 2018 Mar 9.

The Hierarchical Continuous Pursuit Learning Automation: A Novel Scheme for Environments With Large Numbers of Actions.

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):512-526. doi: 10.1109/TNNLS.2019.2905162. Epub 2019 Apr 11.

An Efficient Parameter-Free Learning Automaton Scheme.

IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):4849-4863. doi: 10.1109/TNNLS.2020.3025937. Epub 2021 Oct 27.

The Hierarchical Discrete Pursuit Learning Automaton: A Novel Scheme With Fast Convergence and Epsilon-Optimality.

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8278-8292. doi: 10.1109/TNNLS.2022.3226538. Epub 2024 Jun 3.

Multimodal searching technique based on learning automata with continuous input and changing number of actions.

IEEE Trans Syst Man Cybern B Cybern. 1996;26(4):666-73. doi: 10.1109/3477.517043.

引用本文的文献

A parameter-free learning automaton scheme.

Front Neurorobot. 2022 Sep 23;16:999658. doi: 10.3389/fnbot.2022.999658. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一类新型的ε-最优学习自动机。

A new class of epsilon-optimal learning automata.

作者信息

Papadimitriou Georgios I, Sklira Maria, Pomportsis Andreas S

机构信息

Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):246-54. doi: 10.1109/tsmcb.2003.811117.

DOI:10.1109/tsmcb.2003.811117

PMID:15369067

Abstract

摘要

一类新型的ε-最优学习自动机。

A new class of epsilon-optimal learning automata.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一类新型的ε-最优学习自动机。

A new class of epsilon-optimal learning automata.

作者信息

机构信息

出版信息

相似文献

引用本文的文献