分层好奇心循环与主动感知。

Hierarchical curiosity loops and active sensing.

机构信息

Department of Neurobiology, Weizmann Institute of Science, Rehovot, 76100, Israel.

出版信息

Neural Netw. 2012 Aug;32:119-29. doi: 10.1016/j.neunet.2012.02.024. Epub 2012 Feb 14.

DOI:10.1016/j.neunet.2012.02.024

Abstract

A curious agent acts so as to optimize its learning about itself and its environment, without external supervision. We present a model of hierarchical curiosity loops for such an autonomous active learning agent, whereby each loop selects the optimal action that maximizes the agent's learning of sensory-motor correlations. The model is based on rewarding the learner's prediction errors in an actor-critic reinforcement learning (RL) paradigm. Hierarchy is achieved by utilizing previously learned motor-sensory mapping, which enables the learning of other mappings, thus increasing the extent and diversity of knowledge and skills. We demonstrate the relevance of this architecture to active sensing using the well-studied vibrissae (whiskers) system, where rodents acquire sensory information by virtue of repeated whisker movements. We show that hierarchical curiosity loops starting from optimally learning the internal models of whisker motion and then extending to object localization result in free-air whisking and object palpation, respectively.

摘要

一个好奇的主体会在没有外部监督的情况下，自主地优化自身和环境的学习。我们提出了一种用于这种自主式主动学习主体的分层好奇循环模型，其中每个循环选择最优的动作，以最大限度地提高主体对感觉运动相关性的学习。该模型基于在强化学习（RL）范例中奖励学习者的预测误差。通过利用先前学习的运动-感觉映射，实现了层次结构，从而能够学习其他映射，从而增加知识和技能的范围和多样性。我们使用经过充分研究的触须（胡须）系统来证明这种架构对主动感知的相关性，在该系统中，啮齿动物通过反复的胡须运动来获取感官信息。我们表明，从最优地学习胡须运动的内部模型开始，然后扩展到目标定位的分层好奇循环，分别导致自由空气胡须拍打和物体触摸。

相似文献

Hierarchical curiosity loops and active sensing.

Neural Netw. 2012 Aug;32:119-29. doi: 10.1016/j.neunet.2012.02.024. Epub 2012 Feb 14.

Reinforcement active learning in the vibrissae system: optimal object localization.

J Physiol Paris. 2013 Jan-Apr;107(1-2):107-15. doi: 10.1016/j.jphysparis.2012.06.004. Epub 2012 Jul 9.

Vibrissal kinematics in 3D: tight coupling of azimuth, elevation, and torsion across different whisking modes.

Neuron. 2008 Jul 10;59(1):35-42. doi: 10.1016/j.neuron.2008.05.013.

Topography of rodent whisking--I. Two-dimensional monitoring of whisker movements.

Somatosens Mot Res. 2002;19(4):341-6. doi: 10.1080/0899022021000037809.

SOVEREIGN: An autonomous neural system for incrementally learning planned action sequences to navigate towards a rewarded goal.

Neural Netw. 2008 Jun;21(5):699-758. doi: 10.1016/j.neunet.2007.09.016. Epub 2007 Oct 7.

Active sensing of target location encoded by cortical microstimulation.

IEEE Trans Neural Syst Rehabil Eng. 2011 Jun;19(3):317-24. doi: 10.1109/TNSRE.2011.2117441. Epub 2011 Mar 3.

A night in the life of a rat: vibrissal mechanics and tactile exploration.

Ann N Y Acad Sci. 2011 Apr;1225:110-8. doi: 10.1111/j.1749-6632.2011.06007.x.

Temporal decoding by phase-locked loops: unique features of circuit-level implementations and their significance for vibrissal information processing.

Neural Comput. 2006 Jul;18(7):1611-36. doi: 10.1162/neco.2006.18.7.1611.

Parallel thalamic pathways for whisking and touch signals in the rat.

PLoS Biol. 2006 May;4(5):e124. doi: 10.1371/journal.pbio.0040124. Epub 2006 Apr 18.

Unsupervised whisker tracking in unrestrained behaving animals.

J Neurophysiol. 2008 Jul;100(1):504-15. doi: 10.1152/jn.00012.2008. Epub 2008 May 7.

引用本文的文献

Striatal dopamine explains novelty-induced behavioral dynamics and individual variability in threat prediction.

Neuron. 2022 Nov 16;110(22):3789-3804.e9. doi: 10.1016/j.neuron.2022.08.022. Epub 2022 Sep 20.

A roadmap for development of neuro-oscillations as translational biomarkers for treatment development in neuropsychopharmacology.

Neuropsychopharmacology. 2020 Aug;45(9):1411-1422. doi: 10.1038/s41386-020-0697-9. Epub 2020 May 6.

Theoretical perspectives on active sensing.

Curr Opin Behav Sci. 2018 Oct;11:100-108. doi: 10.1016/j.cobeha.2016.06.009.

Computational Foundations of Natural Intelligence.

Front Comput Neurosci. 2017 Dec 7;11:112. doi: 10.3389/fncom.2017.00112. eCollection 2017.

Spinal microcircuits comprising dI3 interneurons are necessary for motor functional recovery following spinal cord transection.

Elife. 2016 Dec 15;5:e21715. doi: 10.7554/eLife.21715.

Perception as a closed-loop convergence process.

Elife. 2016 May 9;5:e12830. doi: 10.7554/eLife.12830.

Emergent exploration via novelty management.

J Neurosci. 2014 Sep 17;34(38):12646-61. doi: 10.1523/JNEUROSCI.1872-14.2014.

Learning and control of exploration primitives.

J Comput Neurosci. 2014 Oct;37(2):259-80. doi: 10.1007/s10827-014-0500-1. Epub 2014 May 7.

Pre-neuronal morphological processing of object location by individual whiskers.

Nat Neurosci. 2013 May;16(5):622-31. doi: 10.1038/nn.3378. Epub 2013 Apr 7.

Motor-sensory confluence in tactile perception.

J Neurosci. 2012 Oct 3;32(40):14022-32. doi: 10.1523/JNEUROSCI.2432-12.2012.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分层好奇心循环与主动感知。

Hierarchical curiosity loops and active sensing.

机构信息

Department of Neurobiology, Weizmann Institute of Science, Rehovot, 76100, Israel.

出版信息

Neural Netw. 2012 Aug;32:119-29. doi: 10.1016/j.neunet.2012.02.024. Epub 2012 Feb 14.

DOI:10.1016/j.neunet.2012.02.024

PMID:22386787

Abstract

摘要

分层好奇心循环与主动感知。

Hierarchical curiosity loops and active sensing.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

分层好奇心循环与主动感知。

Hierarchical curiosity loops and active sensing.

机构信息

出版信息

相似文献

引用本文的文献