一种用于皮质-纹状体可塑性的新框架：行为理论在强化-行动界面与体外数据相遇。

A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.

作者信息

Gurney Kevin N, Humphries Mark D, Redgrave Peter

机构信息

Department of Psychology, Adaptive Behaviour Research Group, University of Sheffield, United Kingdom; INSIGNEO Institute for In Silico Medicine, University of Sheffield, United Kingdom.

Faculty of Life Sciences, University of Manchester, United Kingdom.

出版信息

PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.

DOI:10.1371/journal.pbio.1002034

PMID:25562526

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4285402/

Abstract

Operant learning requires that reinforcement signals interact with action representations at a suitable neural interface. Much evidence suggests that this occurs when phasic dopamine, acting as a reinforcement prediction error, gates plasticity at cortico-striatal synapses, and thereby changes the future likelihood of selecting the action(s) coded by striatal neurons. But this hypothesis faces serious challenges. First, cortico-striatal plasticity is inexplicably complex, depending on spike timing, dopamine level, and dopamine receptor type. Second, there is a credit assignment problem-action selection signals occur long before the consequent dopamine reinforcement signal. Third, the two types of striatal output neuron have apparently opposite effects on action selection. Whether these factors rule out the interface hypothesis and how they interact to produce reinforcement learning is unknown. We present a computational framework that addresses these challenges. We first predict the expected activity changes over an operant task for both types of action-coding striatal neuron, and show they co-operate to promote action selection in learning and compete to promote action suppression in extinction. Separately, we derive a complete model of dopamine and spike-timing dependent cortico-striatal plasticity from in vitro data. We then show this model produces the predicted activity changes necessary for learning and extinction in an operant task, a remarkable convergence of a bottom-up data-driven plasticity model with the top-down behavioural requirements of learning theory. Moreover, we show the complex dependencies of cortico-striatal plasticity are not only sufficient but necessary for learning and extinction. Validating the model, we show it can account for behavioural data describing extinction, renewal, and reacquisition, and replicate in vitro experimental data on cortico-striatal plasticity. By bridging the levels between the single synapse and behaviour, our model shows how striatum acts as the action-reinforcement interface.

摘要

操作性学习要求强化信号在合适的神经接口处与动作表征相互作用。大量证据表明，当作为强化预测误差的相位性多巴胺调节皮质-纹状体突触的可塑性，从而改变选择由纹状体神经元编码的动作的未来可能性时，这种情况就会发生。但这一假设面临着严峻挑战。首先，皮质-纹状体可塑性复杂得令人费解，它取决于尖峰时间、多巴胺水平和多巴胺受体类型。其次，存在一个信用分配问题——动作选择信号在随后的多巴胺强化信号出现之前很久就已出现。第三，两种类型的纹状体输出神经元对动作选择的影响显然相反。这些因素是否排除了接口假设，以及它们如何相互作用以产生强化学习尚不清楚。我们提出了一个解决这些挑战的计算框架。我们首先预测了在操作性任务中两种类型的动作编码纹状体神经元的预期活动变化，并表明它们在学习中协同促进动作选择，而在消退中相互竞争以促进动作抑制。另外，我们从体外数据推导出了一个完整的多巴胺和尖峰时间依赖性皮质-纹状体可塑性模型。然后我们表明，该模型在操作性任务中产生了学习和消退所需的预测活动变化，这是一个自下而上的数据驱动可塑性模型与学习理论的自上而下行为要求的显著融合。此外，我们表明皮质-纹状体可塑性的复杂依赖性不仅对学习和消退是充分的，而且是必要的。通过验证该模型，我们表明它可以解释描述消退、恢复和重新习得的行为数据，并复制关于皮质-纹状体可塑性的体外实验数据。通过在单个突触和行为之间架起桥梁，我们的模型展示了纹状体如何作为动作-强化接口发挥作用。

相似文献

A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.

PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.

Striatal action-learning based on dopamine concentration.

Exp Brain Res. 2010 Jan;200(3-4):307-17. doi: 10.1007/s00221-009-2060-6. Epub 2009 Nov 11.

Opposing patterns of abnormal D1 and D2 receptor dependent cortico-striatal plasticity explain increased risk taking in patients with DYT1 dystonia.

PLoS One. 2020 May 4;15(5):e0226790. doi: 10.1371/journal.pone.0226790. eCollection 2020.

Reinforcement determines the timing dependence of corticostriatal synaptic plasticity in vivo.

Nat Commun. 2017 Aug 24;8(1):334. doi: 10.1038/s41467-017-00394-x.

Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: evidence from fMRI.

Cereb Cortex. 2012 Mar;22(3):527-36. doi: 10.1093/cercor/bhr117. Epub 2011 Jun 21.

Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.

Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.

Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks.

PLoS Comput Biol. 2023 Aug 18;19(8):e1011385. doi: 10.1371/journal.pcbi.1011385. eCollection 2023 Aug.

Dynamics of striatal action selection and reinforcement learning.

Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.

A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine.

Front Neural Circuits. 2019 Jan 7;12:111. doi: 10.3389/fncir.2018.00111. eCollection 2018.

Sensory Reinforced Corticostriatal Plasticity.

Curr Neuropharmacol. 2024;22(9):1513-1527. doi: 10.2174/1570159X21666230801110359.

引用本文的文献

Potential impacts of acupuncture on motor function recovery after ischemic stroke: insights from basic and clinical studies.

Front Cell Neurosci. 2025 Aug 13;19:1623535. doi: 10.3389/fncel.2025.1623535. eCollection 2025.

Mechanisms and interventions promoting healthy frontostriatal dynamics in obsessive-compulsive disorder.

Nat Commun. 2025 Aug 11;16(1):7400. doi: 10.1038/s41467-025-62190-2.

Dynamics of striatal action selection and reinforcement learning.

Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.

The Computational Bottleneck of Basal Ganglia Output (and What to Do About it).

eNeuro. 2025 Apr 24;12(4). doi: 10.1523/ENEURO.0431-23.2024. Print 2025 Apr.

Reward expectation and receipt differentially modulate the spiking of accumbens D1+ and D2+ neurons.

Curr Biol. 2025 Mar 24;35(6):1285-1297.e3. doi: 10.1016/j.cub.2025.02.007. Epub 2025 Feb 27.

An opponent striatal circuit for distributional reinforcement learning.

Nature. 2025 Mar;639(8055):717-726. doi: 10.1038/s41586-024-08488-5. Epub 2025 Feb 19.

CBGTPy: An extensible cortico-basal ganglia-thalamic framework for modeling biological decision making.

PLoS One. 2025 Jan 14;20(1):e0310367. doi: 10.1371/journal.pone.0310367. eCollection 2025.

An ultra low frequency spike timing dependent plasticity based approach for reducing alcohol drinking.

Sci Rep. 2024 Dec 28;14(1):30907. doi: 10.1038/s41598-024-81390-2.

D2 dopamine receptor expression, reactivity to rewards, and reinforcement learning in a complex value-based decision-making task.

Soc Cogn Affect Neurosci. 2024 Jul 26;19(1). doi: 10.1093/scan/nsae050.

Distinct dopaminergic spike-timing-dependent plasticity rules are suited to different functional roles.

bioRxiv. 2024 Oct 4:2024.06.24.600372. doi: 10.1101/2024.06.24.600372.

本文引用的文献

The acquisition of goal-directed actions generates opposing plasticity in direct and indirect pathways in dorsomedial striatum.

J Neurosci. 2014 Jul 9;34(28):9196-201. doi: 10.1523/JNEUROSCI.0313-14.2014.

Balanced activity in basal ganglia projection pathways is critical for contraversive movements.

Nat Commun. 2014 Jul 8;5:4315. doi: 10.1038/ncomms5315.

Differential entrainment and learning-related dynamics of spike and local field potential activity in the sensorimotor and associative striatum.

J Neurosci. 2014 Feb 19;34(8):2845-59. doi: 10.1523/JNEUROSCI.1782-13.2014.

Dopamine prediction error responses integrate subjective value from different reward dimensions.

Proc Natl Acad Sci U S A. 2014 Feb 11;111(6):2343-8. doi: 10.1073/pnas.1321596111. Epub 2014 Jan 22.

J Neurosci. 2014 Jan 15;34(3):817-22. doi: 10.1523/JNEUROSCI.1703-13.2014.

Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term.

J Neurosci. 2014 Jan 15;34(3):698-704. doi: 10.1523/JNEUROSCI.2489-13.2014.

Control of basal ganglia output by direct and indirect pathway projection neurons.

J Neurosci. 2013 Nov 20;33(47):18531-9. doi: 10.1523/JNEUROSCI.1278-13.2013.

Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions.

Nat Commun. 2013;4:2264. doi: 10.1038/ncomms3264.

Dynamic modulation of spike timing-dependent calcium influx during corticostriatal upstates.

J Neurophysiol. 2013 Oct;110(7):1631-45. doi: 10.1152/jn.00232.2013. Epub 2013 Jul 10.

GABAergic circuits control spike-timing-dependent plasticity.

J Neurosci. 2013 May 29;33(22):9353-63. doi: 10.1523/JNEUROSCI.5796-12.2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于皮质-纹状体可塑性的新框架：行为理论在强化-行动界面与体外数据相遇。

A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.

作者信息

Gurney Kevin N, Humphries Mark D, Redgrave Peter

机构信息

Department of Psychology, Adaptive Behaviour Research Group, University of Sheffield, United Kingdom; INSIGNEO Institute for In Silico Medicine, University of Sheffield, United Kingdom.

Faculty of Life Sciences, University of Manchester, United Kingdom.

出版信息

PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.

DOI:10.1371/journal.pbio.1002034

PMID:25562526

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4285402/

Abstract

摘要

一种用于皮质-纹状体可塑性的新框架：行为理论在强化-行动界面与体外数据相遇。

A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于皮质-纹状体可塑性的新框架：行为理论在强化-行动界面与体外数据相遇。

A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献