Laboratory of Cognitive Neuroscience, School of Life Sciences, Brain Mind Institute, Ecole Polytechnique Federale de Lausanne Lausanne, Vaud, Switzerland ; Laboratory of Computational Neuroscience, School of Computer and Communication Sciences, Ecole Polytechnique Federale de Lausanne Lausanne, Vaud, Switzerland.
Front Comput Neurosci. 2014 Apr 4;8:38. doi: 10.3389/fncom.2014.00038. eCollection 2014.
The ability to learn and perform statistical inference with biologically plausible recurrent networks of spiking neurons is an important step toward understanding perception and reasoning. Here we derive and investigate a new learning rule for recurrent spiking networks with hidden neurons, combining principles from variational learning and reinforcement learning. Our network defines a generative model over spike train histories and the derived learning rule has the form of a local Spike Timing Dependent Plasticity rule modulated by global factors (neuromodulators) conveying information about "novelty" on a statistically rigorous ground. Simulations show that our model is able to learn both stationary and non-stationary patterns of spike trains. We also propose one experiment that could potentially be performed with animals in order to test the dynamics of the predicted novelty signal.
能够使用具有生物合理性的尖峰神经元递归网络学习和执行统计推断,是理解感知和推理的重要步骤。在这里,我们为具有隐藏神经元的递归尖峰网络推导出并研究了一种新的学习规则,该规则结合了变分学习和强化学习的原理。我们的网络定义了尖峰序列历史的生成模型,而推导的学习规则具有局部尖峰时间相关可塑性规则的形式,该规则由全局因素(神经调质)调制,这些全局因素在统计上严格地传递有关“新颖性”的信息。模拟表明,我们的模型能够学习尖峰序列的静态和非静态模式。我们还提出了一个可能在动物身上进行的实验,以测试预测新奇信号的动力学。