运动皮层中的功能网络重组可以通过奖励调制的赫布学习来解释。

Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.

作者信息

Legenstein Robert, Chase Steven M, Schwartz Andrew B, Maass Wolfgang

机构信息

Institute for Theoretical Computer Science, Graz University of Technology, Austria.

Department of Neurobiology, University of Pittsburgh ; Center for the Neural Basis of Cognition, Carnegie Mellon University ; Department of Statistics, Carnegie Mellon University.

出版信息

Adv Neural Inf Process Syst. 2009;2009:1105-1113.

PMID:25284966

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4180441/

Abstract

The control of neuroprosthetic devices from the activity of motor cortex neurons benefits from learning effects where the function of these neurons is adapted to the control task. It was recently shown that tuning properties of neurons in monkey motor cortex are adapted selectively in order to compensate for an erroneous interpretation of their activity. In particular, it was shown that the tuning curves of those neurons whose preferred directions had been misinterpreted changed more than those of other neurons. In this article, we show that the experimentally observed self-tuning properties of the system can be explained on the basis of a simple learning rule. This learning rule utilizes neuronal noise for exploration and performs Hebbian weight updates that are modulated by a global reward signal. In contrast to most previously proposed reward-modulated Hebbian learning rules, this rule does not require extraneous knowledge about what is noise and what is signal. The learning rule is able to optimize the performance of the model system within biologically realistic periods of time and under high noise levels. When the neuronal noise is fitted to experimental data, the model produces learning effects similar to those found in monkey experiments.

摘要

通过运动皮层神经元的活动来控制神经假体设备受益于学习效应，即这些神经元的功能会根据控制任务进行调整。最近的研究表明，猴子运动皮层中神经元的调谐特性会被选择性地调整，以补偿对其活动的错误解读。具体而言，研究发现那些偏好方向被误判的神经元的调谐曲线比其他神经元的变化更大。在本文中，我们表明，基于一个简单的学习规则可以解释该系统实验观察到的自调谐特性。这个学习规则利用神经元噪声进行探索，并执行由全局奖励信号调制的赫布权重更新。与大多数先前提出的奖励调制赫布学习规则不同，该规则不需要关于什么是噪声和什么是信号的外部知识。该学习规则能够在生物学上现实的时间周期内和高噪声水平下优化模型系统的性能。当将神经元噪声拟合到实验数据时，该模型产生的学习效应与猴子实验中发现的效应相似。

相似文献

Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.

Adv Neural Inf Process Syst. 2009;2009:1105-1113.

A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task.

J Neurosci. 2010 Jun 23;30(25):8400-10. doi: 10.1523/JNEUROSCI.4284-09.2010.

Confidence-Controlled Hebbian Learning Efficiently Extracts Category Membership From Stimuli Encoded in View of a Categorization Task.

Neural Comput. 2021 Dec 15;34(1):45-77. doi: 10.1162/neco_a_01452.

Learning flexible sensori-motor mappings in a complex network.

Biol Cybern. 2009 Feb;100(2):147-58. doi: 10.1007/s00422-008-0288-z. Epub 2009 Jan 20.

Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

PLoS Comput Biol. 2009 Dec;5(12):e1000586. doi: 10.1371/journal.pcbi.1000586. Epub 2009 Dec 4.

Information bottleneck-based Hebbian learning rule naturally ties working memory and synaptic updates.

Front Comput Neurosci. 2024 May 16;18:1240348. doi: 10.3389/fncom.2024.1240348. eCollection 2024.

Hebbian Learning in a Random Network Captures Selectivity Properties of the Prefrontal Cortex.

J Neurosci. 2017 Nov 8;37(45):11021-11036. doi: 10.1523/JNEUROSCI.1222-17.2017. Epub 2017 Oct 6.

Stimulus sampling as an exploration mechanism for fast reinforcement learning.

Biol Cybern. 2009 Apr;100(4):319-30. doi: 10.1007/s00422-009-0305-x. Epub 2009 Apr 10.

Reward-modulated Hebbian learning of decision making.

Neural Comput. 2010 Jun;22(6):1399-444. doi: 10.1162/neco.2010.03-09-980.

Reward-dependent learning in neuronal networks for planning and decision making.

Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

引用本文的文献

Brain-inspired wiring economics for artificial neural networks.

PNAS Nexus. 2025 Jan 7;4(1):pgae580. doi: 10.1093/pnasnexus/pgae580. eCollection 2025 Jan.

A Cognitive Model Based on Neuromodulated Plasticity.

Comput Intell Neurosci. 2016;2016:4296356. doi: 10.1155/2016/4296356. Epub 2016 Oct 30.

Unsupervised learning in probabilistic neural networks with multi-state metal-oxide memristive synapses.

Nat Commun. 2016 Sep 29;7:12611. doi: 10.1038/ncomms12611.

Selection of cortical dynamics for motor behaviour by the basal ganglia.

Biol Cybern. 2015 Dec;109(6):575-95. doi: 10.1007/s00422-015-0662-6. Epub 2015 Nov 4.

Bayesian computation emerges in generic cortical microcircuits through spike-timing-dependent plasticity.

PLoS Comput Biol. 2013 Apr;9(4):e1003037. doi: 10.1371/journal.pcbi.1003037. Epub 2013 Apr 25.

Synaptic theory of replicator-like melioration.

Front Comput Neurosci. 2010 Jun 17;4:17. doi: 10.3389/fncom.2010.00017. eCollection 2010.

本文引用的文献

A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task.

J Neurosci. 2010 Jun 23;30(25):8400-10. doi: 10.1523/JNEUROSCI.4284-09.2010.

Functional network reorganization during learning in a brain-computer interface paradigm.

Proc Natl Acad Sci U S A. 2008 Dec 9;105(49):19486-91. doi: 10.1073/pnas.0808113105. Epub 2008 Dec 1.

A learning theory for reward-modulated spike-timing-dependent plasticity with application to biofeedback.

PLoS Comput Biol. 2008 Oct;4(10):e1000180. doi: 10.1371/journal.pcbi.1000180. Epub 2008 Oct 10.

Central contributions to acoustic variation in birdsong.

J Neurosci. 2008 Oct 8;28(41):10370-9. doi: 10.1523/JNEUROSCI.2448-08.2008.

Performance variability enables adaptive plasticity of 'crystallized' adult birdsong.

Nature. 2007 Dec 20;450(7173):1240-4. doi: 10.1038/nature06390.

Reinforcement learning with modulated spike timing dependent synaptic plasticity.

J Neurophysiol. 2007 Dec;98(6):3648-65. doi: 10.1152/jn.00364.2007. Epub 2007 Oct 10.

Reinforcement learning, spike-time-dependent plasticity, and the BCM rule.

Neural Comput. 2007 Aug;19(8):2245-79. doi: 10.1162/neco.2007.19.8.2245.

Motor learning with unstable neural representations.

Neuron. 2007 May 24;54(4):653-66. doi: 10.1016/j.neuron.2007.04.030.

Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.

Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468.

Useful signals from motor cortex.

J Physiol. 2007 Mar 15;579(Pt 3):581-601. doi: 10.1113/jphysiol.2006.126698. Epub 2007 Jan 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

运动皮层中的功能网络重组可以通过奖励调制的赫布学习来解释。

Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.

作者信息

Legenstein Robert, Chase Steven M, Schwartz Andrew B, Maass Wolfgang

机构信息

Institute for Theoretical Computer Science, Graz University of Technology, Austria.

Department of Neurobiology, University of Pittsburgh ; Center for the Neural Basis of Cognition, Carnegie Mellon University ; Department of Statistics, Carnegie Mellon University.

出版信息

Adv Neural Inf Process Syst. 2009;2009:1105-1113.

PMID:25284966

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4180441/

Abstract

摘要

运动皮层中的功能网络重组可以通过奖励调制的赫布学习来解释。

Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

运动皮层中的功能网络重组可以通过奖励调制的赫布学习来解释。

Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.

作者信息

机构信息

出版信息