基于非均匀伽柏空间采样、无监督生长网络和强化学习的认知导航。

Cognitive navigation based on nonuniform Gabor space sampling, unsupervised growing networks, and reinforcement learning.

作者信息

Arleo Angelo, Smeraldi Fabrizio, Gerstner Wulfram

机构信息

Neuroscience Group, SONY Computer Science Laboratory, 75005 Paris, France.

出版信息

IEEE Trans Neural Netw. 2004 May;15(3):639-52. doi: 10.1109/TNN.2004.826221.

DOI:10.1109/TNN.2004.826221

PMID:15384552

Abstract

We study spatial learning and navigation for autonomous agents. A state space representation is constructed by unsupervised Hebbian learning during exploration. As a result of learning, a representation of the continuous two-dimensional (2-D) manifold in the high-dimensional input space is found. The representation consists of a population of localized overlapping place fields covering the 2-D space densely and uniformly. This space coding is comparable to the representation provided by hippocampal place cells in rats. Place fields are learned by extracting spatio-temporal properties of the environment from sensory inputs. The visual scene is modeled using the responses of modified Gabor filters placed at the nodes of a sparse Log-polar graph. Visual sensory aliasing is eliminated by taking into account self-motion signals via path integration. This solves the hidden state problem and provides a suitable representation for applying reinforcement learning in continuous space for action selection. A temporal-difference prediction scheme is used to learn sensorimotor mappings to perform goal-oriented navigation. Population vector coding is employed to interpret ensemble neural activity. The model is validated on a mobile Khepera miniature robot.

摘要

我们研究自主智能体的空间学习与导航。在探索过程中，通过无监督赫布学习构建状态空间表示。学习的结果是在高维输入空间中找到连续二维（2-D）流形的一种表示。该表示由一群局部重叠的位置场组成，这些位置场密集且均匀地覆盖二维空间。这种空间编码类似于大鼠海马体位置细胞所提供的表示。通过从感官输入中提取环境的时空特性来学习位置场。使用放置在稀疏对数极坐标图节点处的改进型伽柏滤波器的响应来对视觉场景进行建模。通过路径积分考虑自身运动信号来消除视觉感官混叠。这解决了隐藏状态问题，并为在连续空间中应用强化学习进行动作选择提供了合适的表示。使用时间差分预测方案来学习感觉运动映射以执行目标导向导航。采用群体向量编码来解释群体神经活动。该模型在移动的Khepera微型机器人上得到验证。

相似文献

Cognitive navigation based on nonuniform Gabor space sampling, unsupervised growing networks, and reinforcement learning.

IEEE Trans Neural Netw. 2004 May;15(3):639-52. doi: 10.1109/TNN.2004.826221.

Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity.

Biol Cybern. 2000 Sep;83(3):287-99. doi: 10.1007/s004220000171.

Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback.

IEEE Trans Neural Netw. 2008 Feb;19(2):230-44. doi: 10.1109/TNN.2007.905839.

Robust self-localisation and navigation based on hippocampal place cells.

Neural Netw. 2005 Nov;18(9):1125-40. doi: 10.1016/j.neunet.2005.08.012. Epub 2005 Nov 2.

Goal-oriented robot navigation learning using a multi-scale space representation.

Neural Netw. 2015 Dec;72:62-74. doi: 10.1016/j.neunet.2015.09.006. Epub 2015 Oct 19.

Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning.

Sensors (Basel). 2019 Apr 1;19(7):1576. doi: 10.3390/s19071576.

Goal-directed learning of features and forward models.

Neural Netw. 2009 Jul-Aug;22(5-6):586-92. doi: 10.1016/j.neunet.2009.06.049. Epub 2009 Jul 8.

SOVEREIGN: An autonomous neural system for incrementally learning planned action sequences to navigate towards a rewarded goal.

Neural Netw. 2008 Jun;21(5):699-758. doi: 10.1016/j.neunet.2007.09.016. Epub 2007 Oct 7.

Self-correction mechanism for path integration in a modular navigation system on the basis of an egocentric spatial map.

Neural Netw. 2003 Nov;16(9):1373-88. doi: 10.1016/j.neunet.2003.08.004.

A model of hippocampally dependent navigation, using the temporal difference learning rule.

Hippocampus. 2000;10(1):1-16. doi: 10.1002/(SICI)1098-1063(2000)10:1<1::AID-HIPO1>3.0.CO;2-1.

引用本文的文献

Action of the Euclidean versus projective group on an agent's internal space in curiosity driven exploration.

Biol Cybern. 2025 Jan 17;119(1):4. doi: 10.1007/s00422-024-01001-1.

Neurorobotics-A Thriving Community and a Promising Pathway Toward Intelligent Cognitive Robots.

Front Neurorobot. 2018 Jul 16;12:42. doi: 10.3389/fnbot.2018.00042. eCollection 2018.

An Energy Model of Place Cell Network in Three Dimensional Space.

Front Neurosci. 2018 Apr 25;12:264. doi: 10.3389/fnins.2018.00264. eCollection 2018.

Locating and navigation mechanism based on place-cell and grid-cell models.

Cogn Neurodyn. 2016 Aug;10(4):353-60. doi: 10.1007/s11571-016-9384-2. Epub 2016 Mar 26.

Neuromodulated Spike-Timing-Dependent Plasticity, and Theory of Three-Factor Learning Rules.

Front Neural Circuits. 2016 Jan 19;9:85. doi: 10.3389/fncir.2015.00085. eCollection 2015.

A neurorobotic platform to test the influence of neuromodulatory signaling on anxious and curious behavior.

Front Neurorobot. 2013 Feb 5;7:1. doi: 10.3389/fnbot.2013.00001. eCollection 2013.

Contribution of cerebellar sensorimotor adaptation to hippocampal spatial memory.

PLoS One. 2012;7(4):e32560. doi: 10.1371/journal.pone.0032560. Epub 2012 Apr 2.

Spatial learning and action planning in a prefrontal cortical network model.

PLoS Comput Biol. 2011 May;7(5):e1002045. doi: 10.1371/journal.pcbi.1002045. Epub 2011 May 19.

Unsupervised learning of reflexive and action-based affordances to model adaptive navigational behavior.

Front Neurorobot. 2010 May 12;4:2. doi: 10.3389/fnbot.2010.00002. eCollection 2010.

Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning.

J Comput Neurosci. 2008 Dec;25(3):562-82. doi: 10.1007/s10827-008-0094-6. Epub 2008 Apr 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于非均匀伽柏空间采样、无监督生长网络和强化学习的认知导航。

Cognitive navigation based on nonuniform Gabor space sampling, unsupervised growing networks, and reinforcement learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献