重新审视眼球运动学习：基底神经节中的强化学习模型，其中包含对运动动作的传出副本。

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions.

机构信息

Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge MA, USA.

出版信息

Front Neural Circuits. 2012 Jun 27;6:38. doi: 10.3389/fncir.2012.00038. eCollection 2012.

DOI:10.3389/fncir.2012.00038

PMID:22754501

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3385561/

Abstract

In its simplest formulation, reinforcement learning is based on the idea that if an action taken in a particular context is followed by a favorable outcome, then, in the same context, the tendency to produce that action should be strengthened, or reinforced. While reinforcement learning forms the basis of many current theories of basal ganglia (BG) function, these models do not incorporate distinct computational roles for signals that convey context, and those that convey what action an animal takes. Recent experiments in the songbird suggest that vocal-related BG circuitry receives two functionally distinct excitatory inputs. One input is from a cortical region that carries context information about the current "time" in the motor sequence. The other is an efference copy of motor commands from a separate cortical brain region that generates vocal variability during learning. Based on these findings, I propose here a general model of vertebrate BG function that combines context information with a distinct motor efference copy signal. The signals are integrated by a learning rule in which efference copy inputs gate the potentiation of context inputs (but not efference copy inputs) onto medium spiny neurons in response to a rewarded action. The hypothesis is described in terms of a circuit that implements the learning of visually guided saccades. The model makes testable predictions about the anatomical and functional properties of hypothesized context and efference copy inputs to the striatum from both thalamic and cortical sources.

摘要

在其最简单的表述中，强化学习基于这样的理念：如果在特定环境下采取的行动伴随着有利的结果，那么在相同的环境下，产生该行动的倾向应该得到加强或增强。虽然强化学习是许多当前基底神经节（BG）功能理论的基础，但这些模型没有将传达环境信息的信号与传达动物采取什么行动的信号纳入其中。最近在鸣禽中的实验表明，与发声相关的 BG 回路接收两种功能上不同的兴奋性输入。一种输入来自皮质区域，携带关于运动序列当前“时间”的上下文信息。另一种是来自另一个皮质脑区的运动指令的传出副本，该副本在学习过程中产生发声变化。基于这些发现，我在这里提出了一个结合上下文信息和独特的运动传出副本信号的脊椎动物 BG 功能的一般模型。该信号通过学习规则进行整合，其中传出副本输入会在奖励性动作后将门控上下文输入（但不是传出副本输入）增强到中等棘突神经元上。该假设是根据一个电路来描述的，该电路实现了对视觉引导的扫视学习。该模型对来自丘脑和皮质源的假定上下文和传出副本输入到纹状体的解剖学和功能特性做出了可测试的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b43f/3385561/6914d0e09985/fncir-06-00038-g0001.jpg

相似文献

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions.重新审视眼球运动学习：基底神经节中的强化学习模型，其中包含对运动动作的传出副本。

Front Neural Circuits. 2012 Jun 27;6:38. doi: 10.3389/fncir.2012.00038. eCollection 2012.

What Is the Role of Thalamostriatal Circuits in Learning Vocal Sequences?丘脑纹状体回路在学习声乐序列中的作用是什么？

Front Neural Circuits. 2021 Sep 22;15:724858. doi: 10.3389/fncir.2021.724858. eCollection 2021.

The role of efference copy in striatal learning.纹状体学习中的传出副本的作用。

Curr Opin Neurobiol. 2014 Apr;25:194-200. doi: 10.1016/j.conb.2014.01.012. Epub 2014 Feb 21.

The Avian Basal Ganglia Are a Source of Rapid Behavioral Variation That Enables Vocal Motor Exploration.鸟类基底神经节是快速行为变化的来源，使发声运动探索成为可能。

J Neurosci. 2018 Nov 7;38(45):9635-9647. doi: 10.1523/JNEUROSCI.2915-17.2018. Epub 2018 Sep 24.

An associational model of birdsong sensorimotor learning I. Efference copy and the learning of song syllables.鸟鸣声感觉运动学习的关联模型I. 传出副本与歌曲音节的学习

J Neurophysiol. 2000 Sep;84(3):1204-23. doi: 10.1152/jn.2000.84.3.1204.

Variability in action: Contributions of a songbird cortical-basal ganglia circuit to vocal motor learning and control.行为中的变异性：鸣禽皮质-基底神经节回路对发声运动学习与控制的贡献。

Neuroscience. 2015 Jun 18;296:39-47. doi: 10.1016/j.neuroscience.2014.10.010. Epub 2014 Oct 18.

Cortical processing of saccade-related efference copy signals in patients with cerebellar lesion.小脑病变患者扫视相关传出副本信号的皮质处理。

Eur J Neurosci. 2013 Mar;37(5):804-15. doi: 10.1111/ejn.12081. Epub 2012 Dec 3.

A hypothesis for basal ganglia-dependent reinforcement learning in the songbird.鸣禽基底神经节依赖的强化学习假说。

Neuroscience. 2011 Dec 15;198:152-70. doi: 10.1016/j.neuroscience.2011.09.069. Epub 2011 Oct 13.

An associational model of birdsong sensorimotor learning II. Temporal hierarchies and the learning of song sequence.鸟鸣声感觉运动学习的关联模型II. 时间层次与鸣声序列学习

J Neurophysiol. 2000 Sep;84(3):1224-39. doi: 10.1152/jn.2000.84.3.1224.

Neural systems for control of voluntary action--a hypothesis.用于控制随意动作的神经系统——一种假说。

Adv Biophys. 1998;35:81-102.

引用本文的文献

Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化

Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.

From avoidance to new action: the multifaceted role of the striatal indirect pathway.从回避到新行动：纹状体间接通路的多方面作用。

Nat Rev Neurosci. 2025 May 7. doi: 10.1038/s41583-025-00925-2.

Temporally and Functionally Distinct Contributions to Value Based Choice Along the Anterior-Posterior Dorsomedial Striatal Axis.沿前后背内侧纹状体轴对基于价值的选择在时间和功能上的不同贡献。

bioRxiv. 2025 Mar 15:2025.03.14.643367. doi: 10.1101/2025.03.14.643367.

Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化

bioRxiv. 2024 Dec 24:2024.02.14.580408. doi: 10.1101/2024.02.14.580408.

Developmentally regulated pathways for motor skill learning in songbirds.鸣禽运动技能学习的发育调节途径。

J Comp Neurol. 2022 Jun;530(8):1288-1301. doi: 10.1002/cne.25276. Epub 2021 Dec 14.

Frequency-Specific Effects of Galvanic Vestibular Stimulation on Response-Time Performance in Parkinson's Disease.直流电前庭刺激对帕金森病反应时间表现的频率特异性影响。

Front Neurol. 2021 Nov 2;12:758122. doi: 10.3389/fneur.2021.758122. eCollection 2021.

What Is the Role of Thalamostriatal Circuits in Learning Vocal Sequences?丘脑纹状体回路在学习声乐序列中的作用是什么？

Front Neural Circuits. 2021 Sep 22;15:724858. doi: 10.3389/fncir.2021.724858. eCollection 2021.

Investigation of motor self-monitoring deficits in schizophrenia with passivity experiences using a novel modified joint position matching paradigm.使用一种新颖的改良关节位置匹配范式对伴有被动体验的精神分裂症患者的运动自我监测缺陷进行研究。

Eur Arch Psychiatry Clin Neurosci. 2022 Apr;272(3):509-518. doi: 10.1007/s00406-021-01261-z. Epub 2021 Apr 10.

BOLD differences normally attributed to inhibitory control predict symptoms, not task-directed inhibitory control in ADHD.通常归因于抑制控制的 BOLD 差异可预测 ADHD 症状，而不是任务导向的抑制控制。

J Neurodev Disord. 2020 Feb 21;12(1):8. doi: 10.1186/s11689-020-09311-8.

Thalamostriatal and cerebellothalamic pathways in a songbird, the Bengalese finch.孟加拉雀这种鸣禽中的丘脑纹状体通路和小脑丘脑通路。

J Comp Neurol. 2018 Jun 15;526(9):1550-1570. doi: 10.1002/cne.24428. Epub 2018 Apr 6.

本文引用的文献

Evolution of the basal ganglia: dual-output pathways conserved throughout vertebrate phylogeny.基底神经节的进化：双输出通路在整个脊椎动物系统发育中保守。

J Comp Neurol. 2012 Sep 1;520(13):2957-73. doi: 10.1002/cne.23087.

Neuron-type-specific signals for reward and punishment in the ventral tegmental area.腹侧被盖区中与奖赏和惩罚相关的神经元类型特异性信号。

Nature. 2012 Jan 18;482(7383):85-8. doi: 10.1038/nature10754.

Two distinct modes of forebrain circuit dynamics underlie temporal patterning in the vocalizations of young songbirds.两种不同的大脑前脑回路动态模式是幼鸟鸣叫声时间模式形成的基础。

J Neurosci. 2011 Nov 9;31(45):16353-68. doi: 10.1523/JNEUROSCI.3009-11.2011.

Thalamic contributions to Basal Ganglia-related behavioral switching and reinforcement.丘脑对基底神经节相关行为切换和强化的贡献。

J Neurosci. 2011 Nov 9;31(45):16102-6. doi: 10.1523/JNEUROSCI.4634-11.2011.

A hypothesis for basal ganglia-dependent reinforcement learning in the songbird.鸣禽基底神经节依赖的强化学习假说。

Neuroscience. 2011 Dec 15;198:152-70. doi: 10.1016/j.neuroscience.2011.09.069. Epub 2011 Oct 13.

Dendritic spines and distributed circuits.树突棘和分布式电路。

Neuron. 2011 Sep 8;71(5):772-81. doi: 10.1016/j.neuron.2011.07.024.

Subdivisions of the adult zebrafish subpallium by molecular marker analysis.成年斑马鱼 subpallium 的分子标记分析细分。

J Comp Neurol. 2012 Feb 15;520(3):633-55. doi: 10.1002/cne.22757.

Neural coding of syntactic structure in learned vocalizations in the songbird.鸣禽学习叫声中句法结构的神经编码。

J Neurosci. 2011 Jul 6;31(27):10023-33. doi: 10.1523/JNEUROSCI.1606-11.2011.

Mechanisms and time course of vocal learning and consolidation in the adult songbird.成年鸣禽的发声学习和巩固的机制和时程。

J Neurophysiol. 2011 Oct;106(4):1806-21. doi: 10.1152/jn.00311.2011. Epub 2011 Jul 6.

Synaptically driven state transitions in distal dendrites of striatal spiny neurons.纹状体棘状神经元远端树突中的突触驱动状态转变。

Nat Neurosci. 2011 Jun 12;14(7):881-8. doi: 10.1038/nn.2848.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

重新审视眼球运动学习：基底神经节中的强化学习模型，其中包含对运动动作的传出副本。

Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献