• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自组织映射在强化学习中的应用。

Applications of the self-organising map to reinforcement learning.

作者信息

Smith Andrew James

机构信息

The Division of Informatics, Institute for Adaptive and Neural Computation, University of Edinburgh, UK.

出版信息

Neural Netw. 2002 Oct-Nov;15(8-9):1107-24. doi: 10.1016/s0893-6080(02)00083-7.

DOI:10.1016/s0893-6080(02)00083-7
PMID:12416698
Abstract

This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 1987] which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed.

摘要

本文关注强化学习(RL)问题中连续动作空间的表示与泛化。基于科霍宁的自组织映射(SOM)[《自组织与联想记忆》,1987年]提出了一个模型,该模型能够捕捉期望状态-动作映射的一对一、多对一或一对多结构。尽管这里是针对涉及即时奖励的任务提出的,但该方法可轻松扩展到延迟奖励。我们得出结论,在状态和动作空间的潜在维度较小的RL问题中,SOM是提供实时在线泛化的有用工具。还讨论了可扩展性问题。

相似文献

1
Applications of the self-organising map to reinforcement learning.自组织映射在强化学习中的应用。
Neural Netw. 2002 Oct-Nov;15(8-9):1107-24. doi: 10.1016/s0893-6080(02)00083-7.
2
Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting.在多智能体环境中采用不同强化学习(RL)方案的脉冲神经网络。
Chin J Physiol. 2010 Dec 31;53(6):447-53.
3
Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback.将时间差分方法与自组织神经网络相结合用于具有延迟评估反馈的强化学习。
IEEE Trans Neural Netw. 2008 Feb;19(2):230-44. doi: 10.1109/TNN.2007.905839.
4
On the equivalence between kernel self-organising maps and self-organising mixture density networks.关于核自组织映射与自组织混合密度网络之间的等价性
Neural Netw. 2006 Jul-Aug;19(6-7):780-4. doi: 10.1016/j.neunet.2006.05.007. Epub 2006 Jun 6.
5
Snap-drift neural network for self-organisation and sequence learning.快速漂移神经网络的自组织和序列学习。
Neural Netw. 2011 Oct;24(8):897-905. doi: 10.1016/j.neunet.2011.05.007. Epub 2011 Jun 6.
6
Self-control with spiking and non-spiking neural networks playing games.通过脉冲神经网络和非脉冲神经网络进行游戏时的自我控制。
J Physiol Paris. 2010 May-Sep;104(3-4):108-17. doi: 10.1016/j.jphysparis.2009.11.013. Epub 2009 Nov 26.
7
Multisource Transfer Double DQN Based on Actor Learning.基于演员学习的多源转移双 DQN。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2227-2238. doi: 10.1109/TNNLS.2018.2806087.
8
Self-organizing neural networks integrating domain knowledge and reinforcement learning.自组织神经网络集成领域知识和强化学习。
IEEE Trans Neural Netw Learn Syst. 2015 May;26(5):889-902. doi: 10.1109/TNNLS.2014.2327636.
9
A neural model of hierarchical reinforcement learning.一种分层强化学习的神经模型。
PLoS One. 2017 Jul 6;12(7):e0180234. doi: 10.1371/journal.pone.0180234. eCollection 2017.
10
Ensemble algorithms in reinforcement learning.强化学习中的集成算法。
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):930-6. doi: 10.1109/TSMCB.2008.920231.

引用本文的文献

1
Unsupervised learning of perceptual feature combinations.无监督学习的感知特征组合。
PLoS Comput Biol. 2024 Mar 5;20(3):e1011926. doi: 10.1371/journal.pcbi.1011926. eCollection 2024 Mar.
2
Safe deep reinforcement learning in diesel engine emission control.柴油发动机排放控制中的安全深度强化学习
Proc Inst Mech Eng Part I J Syst Control Eng. 2023 Sep;237(8):1440-1453. doi: 10.1177/09596518231153445. Epub 2023 Feb 17.
3
The dilemma of the symbols: analogies between philosophy, biology and artificial life.符号的困境:哲学、生物学与人工生命之间的类比
Springerplus. 2013 Oct 1;2:495. doi: 10.1186/2193-1801-2-495. eCollection 2013.