• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分层离散追踪学习自动机:一种具有快速收敛性和ε最优性的新方案。

The Hierarchical Discrete Pursuit Learning Automaton: A Novel Scheme With Fast Convergence and Epsilon-Optimality.

作者信息

Omslandseter Rebekka Olsson, Jiao Lei, Zhang Xuan, Yazidi Anis, Oommen B John

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8278-8292. doi: 10.1109/TNNLS.2022.3226538. Epub 2024 Jun 3.

DOI:10.1109/TNNLS.2022.3226538
PMID:37015672
Abstract

Since the early 1960s, the paradigm of learning automata (LA) has experienced abundant interest. Arguably, it has also served as the foundation for the phenomenon and field of reinforcement learning (RL). Over the decades, new concepts and fundamental principles have been introduced to increase the LA's speed and accuracy. These include using probability updating functions, discretizing the probability space, and using the "Pursuit" concept. Very recently, the concept of incorporating "structure" into the ordering of the LA's actions has improved both the speed and accuracy of the corresponding hierarchical machines, when the number of actions is large. This has led to the ϵ -optimal hierarchical continuous pursuit LA (HCPA). This article pioneers the inclusion of all the above-mentioned phenomena into a new single LA, leading to the novel hierarchical discretized pursuit LA (HDPA). Indeed, although the previously proposed HCPA is powerful, its speed has an impediment when any action probability is close to unity, because the updates of the components of the probability vector are correspondingly smaller when any action probability becomes closer to unity. We propose here, the novel HDPA, where we infuse the phenomenon of discretization into the action probability vector's updating functionality, and which is invoked recursively at every stage of the machine's hierarchical structure. This discretized functionality does not possess the same impediment, because discretization prohibits it. We demonstrate the HDPA's robustness and validity by formally proving the ϵ -optimality by utilizing the moderation property. We also invoke the submartingale characteristic at every level, to prove that the action probability of the optimal action converges to unity as time goes to infinity. Apart from the new machine being ϵ -optimal, the numerical results demonstrate that the number of iterations required for convergence is significantly reduced for the HDPA, when compared to the state-of-the-art HCPA scheme.

摘要

自20世纪60年代初以来,学习自动机(LA)范式一直备受关注。可以说,它也为强化学习(RL)现象和领域奠定了基础。几十年来,人们引入了新的概念和基本原理来提高LA的速度和准确性。这些包括使用概率更新函数、离散化概率空间以及使用“追踪”概念。最近,当动作数量很大时,将“结构”纳入LA动作排序的概念提高了相应分层机器的速度和准确性。这导致了ε -最优分层连续追踪LA(HCPA)。本文率先将上述所有现象纳入一个新的单一LA,从而产生了新颖的分层离散追踪LA(HDPA)。事实上,尽管先前提出的HCPA很强大,但当任何动作概率接近1时,其速度会受到阻碍,因为当任何动作概率变得更接近1时,概率向量各分量的更新相应变小。我们在此提出新颖的HDPA,我们将离散化现象融入动作概率向量的更新功能中,并在机器分层结构的每个阶段递归调用。这种离散化功能不存在同样的阻碍,因为离散化阻止了它。我们通过利用适度性属性正式证明ε -最优性,展示了HDPA的鲁棒性和有效性。我们还在每个层次调用下鞅特性,以证明最优动作的动作概率随着时间趋于无穷大收敛到1。除了新机器是ε -最优的之外,数值结果表明,与最先进的HCPA方案相比,HDPA收敛所需的迭代次数显著减少。

相似文献

1
The Hierarchical Discrete Pursuit Learning Automaton: A Novel Scheme With Fast Convergence and Epsilon-Optimality.分层离散追踪学习自动机:一种具有快速收敛性和ε最优性的新方案。
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8278-8292. doi: 10.1109/TNNLS.2022.3226538. Epub 2024 Jun 3.
2
The Hierarchical Continuous Pursuit Learning Automation: A Novel Scheme for Environments With Large Numbers of Actions.
IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):512-526. doi: 10.1109/TNNLS.2019.2905162. Epub 2019 Apr 11.
3
Fast and Epsilon-Optimal Discretized Pursuit Learning Automata.快速且 ε-最优离散化追踪学习自动机。
IEEE Trans Cybern. 2015 Oct;45(10):2089-99. doi: 10.1109/TCYB.2014.2365463. Epub 2014 Nov 13.
4
Last-position elimination-based learning automata.基于最后位置消除的学习自动机。
IEEE Trans Cybern. 2014 Dec;44(12):2484-92. doi: 10.1109/TCYB.2014.2309478. Epub 2014 Apr 2.
5
Continuous and discretized pursuit learning schemes: various algorithms and their comparison.连续和离散化追踪学习方案:各种算法及其比较
IEEE Trans Syst Man Cybern B Cybern. 2001;31(3):277-87. doi: 10.1109/3477.931507.
6
A Conclusive Analysis of the Finite-Time Behavior of the Discretized Pursuit Learning Automaton.
IEEE Trans Neural Netw Learn Syst. 2020 Jan;31(1):284-294. doi: 10.1109/TNNLS.2019.2900639. Epub 2019 Mar 19.
7
Generalized pursuit learning schemes: new families of continuous and discretized learning automata.广义追踪学习方案:连续和离散学习自动机的新类别
IEEE Trans Syst Man Cybern B Cybern. 2002;32(6):738-49. doi: 10.1109/TSMCB.2002.1049608.
8
A new class of epsilon-optimal learning automata.一类新型的ε-最优学习自动机。
IEEE Trans Syst Man Cybern B Cybern. 2004 Feb;34(1):246-54. doi: 10.1109/tsmcb.2003.811117.
9
Discretizing Continuous Action Space With Unimodal Probability Distributions for On-Policy Reinforcement Learning.
IEEE Trans Neural Netw Learn Syst. 2025 Jun;36(6):11285-11297. doi: 10.1109/TNNLS.2024.3446371.
10
An Efficient Parameter-Free Learning Automaton Scheme.
IEEE Trans Neural Netw Learn Syst. 2021 Nov;32(11):4849-4863. doi: 10.1109/TNNLS.2020.3025937. Epub 2021 Oct 27.

引用本文的文献

1
A semi-supervised ensemble clustering algorithm for discovering relationships between different diseases by extracting cell-to-cell biological communications.一种通过提取细胞间生物通讯来发现不同疾病之间关系的半监督集成聚类算法。
J Cancer Res Clin Oncol. 2024 Jan 2;150(1):3. doi: 10.1007/s00432-023-05559-4.
2
Toward improving the performance of learning by joining feature selection and ensemble classification techniques: an application for cancer diagnosis.为了提高学习性能,结合特征选择和集成分类技术:在癌症诊断中的应用。
J Cancer Res Clin Oncol. 2023 Dec;149(19):16993-17006. doi: 10.1007/s00432-023-05422-6. Epub 2023 Sep 23.
3
Cancer detection in breast cells using a hybrid method based on deep complex neural network and data mining.
基于深度复杂神经网络和数据挖掘的混合方法在乳腺细胞中的癌症检测
J Cancer Res Clin Oncol. 2023 Nov;149(14):13331-13344. doi: 10.1007/s00432-023-05191-2. Epub 2023 Jul 24.
4
Data mining techniques in breast cancer diagnosis at the cellular-molecular level.细胞分子水平上乳腺癌诊断中的数据挖掘技术
J Cancer Res Clin Oncol. 2023 Nov;149(14):12605-12620. doi: 10.1007/s00432-023-05090-6. Epub 2023 Jul 14.
5
scFED: Clustering Identifying Cell Types of scRNA-Seq Data Based on Feature Engineering Denoising.scFED:基于特征工程去噪的单细胞RNA测序数据聚类识别细胞类型
Interdiscip Sci. 2023 Dec;15(4):590-601. doi: 10.1007/s12539-023-00574-y. Epub 2023 Jul 4.
6
Combining ensemble classification and integrated filter-evolutionary search for breast cancer diagnosis.结合集成分类与集成滤波器-进化搜索用于乳腺癌诊断。
J Cancer Res Clin Oncol. 2023 Sep;149(12):10753-10769. doi: 10.1007/s00432-023-04968-9. Epub 2023 Jun 13.
7
Automatic breast cancer diagnosis based on hybrid dimensionality reduction technique and ensemble classification.基于混合降维技术和集成分类的自动乳腺癌诊断。
J Cancer Res Clin Oncol. 2023 Aug;149(10):7609-7627. doi: 10.1007/s00432-023-04699-x. Epub 2023 Mar 30.