Legenstein Robert, Chase Steven M, Schwartz Andrew B, Maass Wolfgang
Institute for Theoretical Computer Science, Graz University of Technology, Austria.
Department of Neurobiology, University of Pittsburgh ; Center for the Neural Basis of Cognition, Carnegie Mellon University ; Department of Statistics, Carnegie Mellon University.
Adv Neural Inf Process Syst. 2009;2009:1105-1113.
The control of neuroprosthetic devices from the activity of motor cortex neurons benefits from learning effects where the function of these neurons is adapted to the control task. It was recently shown that tuning properties of neurons in monkey motor cortex are adapted selectively in order to compensate for an erroneous interpretation of their activity. In particular, it was shown that the tuning curves of those neurons whose preferred directions had been misinterpreted changed more than those of other neurons. In this article, we show that the experimentally observed self-tuning properties of the system can be explained on the basis of a simple learning rule. This learning rule utilizes neuronal noise for exploration and performs Hebbian weight updates that are modulated by a global reward signal. In contrast to most previously proposed reward-modulated Hebbian learning rules, this rule does not require extraneous knowledge about what is noise and what is signal. The learning rule is able to optimize the performance of the model system within biologically realistic periods of time and under high noise levels. When the neuronal noise is fitted to experimental data, the model produces learning effects similar to those found in monkey experiments.
通过运动皮层神经元的活动来控制神经假体设备受益于学习效应,即这些神经元的功能会根据控制任务进行调整。最近的研究表明,猴子运动皮层中神经元的调谐特性会被选择性地调整,以补偿对其活动的错误解读。具体而言,研究发现那些偏好方向被误判的神经元的调谐曲线比其他神经元的变化更大。在本文中,我们表明,基于一个简单的学习规则可以解释该系统实验观察到的自调谐特性。这个学习规则利用神经元噪声进行探索,并执行由全局奖励信号调制的赫布权重更新。与大多数先前提出的奖励调制赫布学习规则不同,该规则不需要关于什么是噪声和什么是信号的外部知识。该学习规则能够在生物学上现实的时间周期内和高噪声水平下优化模型系统的性能。当将神经元噪声拟合到实验数据时,该模型产生的学习效应与猴子实验中发现的效应相似。