基于多个远距离奖励的突触稳定强化的尖峰神经网络模型。

A spiking neural model for stable reinforcement of synapses based on multiple distal rewards.

机构信息

Department of Mathematics, University of California at Los Angeles, Los Angeles, CA 90095, USA.

出版信息

Neural Comput. 2013 Jan;25(1):123-56. doi: 10.1162/NECO_a_00387. Epub 2012 Sep 28.

PMID:23020112

Abstract

In this letter, a novel critic-like algorithm was developed to extend the synaptic plasticity rule described in Florian (2007) and Izhikevich (2007) in order to solve the problem of learning multiple distal rewards simultaneously. The system is augmented with short-term plasticity (STP) to stabilize the learning dynamics, thereby increasing the system's learning capacity. A theoretical threshold is estimated for the number of distal rewards that this system can learn. The validity of the novel algorithm was verified by computer simulations.

摘要

在这封信中，开发了一种新颖的评论家样算法，以扩展 Florian（2007）和 Izhikevich（2007）中描述的突触可塑性规则，从而解决同时学习多个远距离奖励的问题。该系统增加了短期可塑性（STP）以稳定学习动态，从而提高了系统的学习能力。估计了该系统可以学习的远距离奖励数量的理论阈值。通过计算机模拟验证了新算法的有效性。

相似文献

A spiking neural model for stable reinforcement of synapses based on multiple distal rewards.

Neural Comput. 2013 Jan;25(1):123-56. doi: 10.1162/NECO_a_00387. Epub 2012 Sep 28.

Solving the distal reward problem with rare correlations.

Neural Comput. 2013 Apr;25(4):940-78. doi: 10.1162/NECO_a_00419. Epub 2013 Jan 22.

Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.

Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.

Reinforcement learning, spike-time-dependent plasticity, and the BCM rule.

Neural Comput. 2007 Aug;19(8):2245-79. doi: 10.1162/neco.2007.19.8.2245.

Neuron as a reward-modulated combinatorial switch and a model of learning behavior.

Neural Netw. 2013 Oct;46:62-74. doi: 10.1016/j.neunet.2013.04.010. Epub 2013 May 6.

Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses.

Neural Comput. 2019 Dec;31(12):2368-2389. doi: 10.1162/neco_a_01238. Epub 2019 Oct 15.

A spiking neural network model of an actor-critic learning agent.

Neural Comput. 2009 Feb;21(2):301-39. doi: 10.1162/neco.2008.08-07-593.

Learning in neural networks by reinforcement of irregular spiking.

Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Apr;69(4 Pt 1):041909. doi: 10.1103/PhysRevE.69.041909. Epub 2004 Apr 30.

Learning by the dendritic prediction of somatic spiking.

Neuron. 2014 Feb 5;81(3):521-8. doi: 10.1016/j.neuron.2013.11.030.

Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.

Proc Natl Acad Sci U S A. 2006 Oct 10;103(41):15224-9. doi: 10.1073/pnas.0505220103. Epub 2006 Sep 28.

引用本文的文献

An Efficient Supervised Training Algorithm for Multilayer Spiking Neural Networks.

PLoS One. 2016 Apr 4;11(4):e0150329. doi: 10.1371/journal.pone.0150329. eCollection 2016.

A novel analytical characterization for short-term plasticity parameters in spiking neural networks.

Front Comput Neurosci. 2014 Nov 19;8:148. doi: 10.3389/fncom.2014.00148. eCollection 2014.

Using a hybrid neuron in physiologically inspired models of the basal ganglia.

Front Comput Neurosci. 2013 Jul 5;7:88. doi: 10.3389/fncom.2013.00088. eCollection 2013.

Learning and prospective recall of noisy spike pattern episodes.

Front Comput Neurosci. 2013 Jun 21;7:80. doi: 10.3389/fncom.2013.00080. eCollection 2013.

Reward-based learning for virtual neurorobotics through emotional speech processing.

Front Neurorobot. 2013 Apr 29;7:8. doi: 10.3389/fnbot.2013.00008. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于多个远距离奖励的突触稳定强化的尖峰神经网络模型。

A spiking neural model for stable reinforcement of synapses based on multiple distal rewards.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献