Suppr超能文献

自组织映射在强化学习中的应用。

Applications of the self-organising map to reinforcement learning.

作者信息

Smith Andrew James

机构信息

The Division of Informatics, Institute for Adaptive and Neural Computation, University of Edinburgh, UK.

出版信息

Neural Netw. 2002 Oct-Nov;15(8-9):1107-24. doi: 10.1016/s0893-6080(02)00083-7.

Abstract

This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 1987] which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed.

摘要

本文关注强化学习(RL)问题中连续动作空间的表示与泛化。基于科霍宁的自组织映射(SOM)[《自组织与联想记忆》,1987年]提出了一个模型,该模型能够捕捉期望状态-动作映射的一对一、多对一或一对多结构。尽管这里是针对涉及即时奖励的任务提出的,但该方法可轻松扩展到延迟奖励。我们得出结论,在状态和动作空间的潜在维度较小的RL问题中,SOM是提供实时在线泛化的有用工具。还讨论了可扩展性问题。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验