• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

纹状体动作选择与强化学习的动态变化

Dynamics of striatal action selection and reinforcement learning.

作者信息

Lindsey Jack W, Markowitz Jeffrey, Gillis Winthrop F, Datta Sandeep R, Litwin-Kumar Ashok

机构信息

Kavli Institute for Brain Science, Columbia University, New York, United States.

Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, United States.

出版信息

Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.

DOI:10.7554/eLife.101747
PMID:40338017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12061475/
Abstract

Spiny projection neurons (SPNs) in dorsal striatum are often proposed as a locus of reinforcement learning in the basal ganglia. Here, we identify and resolve a fundamental inconsistency between striatal reinforcement learning models and known SPN synaptic plasticity rules. Direct-pathway (dSPN) and indirect-pathway (iSPN) neurons, which promote and suppress actions, respectively, exhibit synaptic plasticity that reinforces activity associated with elevated or suppressed dopamine release. We show that iSPN plasticity prevents successful learning, as it reinforces activity patterns associated with negative outcomes. However, this pathological behavior is reversed if functionally opponent dSPNs and iSPNs, which promote and suppress the current behavior, are simultaneously activated by efferent input following action selection. This prediction is supported by striatal recordings and contrasts with prior models of SPN representations. In our model, learning and action selection signals can be multiplexed without interference, enabling learning algorithms beyond those of standard temporal difference models.

摘要

背侧纹状体中的棘状投射神经元(SPN)常被认为是基底神经节中强化学习的位点。在此,我们识别并解决了纹状体强化学习模型与已知的SPN突触可塑性规则之间的一个基本矛盾。分别促进和抑制动作的直接通路(dSPN)和间接通路(iSPN)神经元,表现出强化与多巴胺释放升高或抑制相关活动的突触可塑性。我们表明,iSPN可塑性会阻止成功学习,因为它强化了与负面结果相关的活动模式。然而,如果在动作选择后,促进和抑制当前行为的功能上相对的dSPN和iSPN被传出输入同时激活,这种病理行为就会被逆转。这一预测得到了纹状体记录的支持,并且与先前的SPN表征模型形成对比。在我们的模型中,学习信号和动作选择信号可以无干扰地复用,从而实现超越标准时间差分模型的学习算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/383356f30b0e/elife-101747-fig7-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/8c0676691e86/elife-101747-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/a4293b54b499/elife-101747-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/290d48c83e8b/elife-101747-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/08d80023d5bd/elife-101747-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/b3426a4d140e/elife-101747-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/b334233281ab/elife-101747-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/05ab0e1c4774/elife-101747-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/f33b66ef622c/elife-101747-fig5-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/c5e3bdb9ba96/elife-101747-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/a97df2268f1e/elife-101747-fig6-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/e78f122c6f72/elife-101747-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/383356f30b0e/elife-101747-fig7-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/8c0676691e86/elife-101747-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/a4293b54b499/elife-101747-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/290d48c83e8b/elife-101747-fig2-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/08d80023d5bd/elife-101747-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/b3426a4d140e/elife-101747-fig3-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/b334233281ab/elife-101747-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/05ab0e1c4774/elife-101747-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/f33b66ef622c/elife-101747-fig5-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/c5e3bdb9ba96/elife-101747-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/a97df2268f1e/elife-101747-fig6-figsupp1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/e78f122c6f72/elife-101747-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6302/12061475/383356f30b0e/elife-101747-fig7-figsupp1.jpg

相似文献

1
Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化
Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.
2
Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化
bioRxiv. 2024 Dec 24:2024.02.14.580408. doi: 10.1101/2024.02.14.580408.
3
Dichotomous regulation of striatal plasticity by dynorphin.强啡肽对纹状体可塑性的二态性调节。
Mol Psychiatry. 2023 Jan;28(1):434-447. doi: 10.1038/s41380-022-01885-0. Epub 2022 Dec 2.
4
Long-term plasticity of corticostriatal synapses is modulated by pathway-specific co-release of opioids through κ-opioid receptors.通过 κ 型阿片受体,皮质纹状体突触的长时程可塑性受到特定通路中外源阿片共同释放的调节。
J Physiol. 2017 Aug 15;595(16):5637-5652. doi: 10.1113/JP274190. Epub 2017 May 26.
5
A Population of Indirect Pathway Striatal Projection Neurons Is Selectively Entrained to Parkinsonian Beta Oscillations.一群间接通路纹状体投射神经元被选择性地锁定到帕金森病的β振荡。
J Neurosci. 2017 Oct 11;37(41):9977-9998. doi: 10.1523/JNEUROSCI.0658-17.2017. Epub 2017 Aug 28.
6
Striatal direct and indirect pathway neurons differentially control the encoding and updating of goal-directed learning.纹状体直接和间接通路神经元对目标导向学习的编码和更新有不同的控制作用。
Elife. 2020 Nov 20;9:e58544. doi: 10.7554/eLife.58544.
7
A new framework for cortico-striatal plasticity: behavioural theory meets in vitro data at the reinforcement-action interface.一种用于皮质-纹状体可塑性的新框架:行为理论在强化-行动界面与体外数据相遇。
PLoS Biol. 2015 Jan 6;13(1):e1002034. doi: 10.1371/journal.pbio.1002034. eCollection 2015 Jan.
8
Differential Development of Dendritic Spines in Striatal Projection Neurons of Direct and Indirect Pathways in the Caudoputamen and Nucleus Accumbens.尾壳核和伏隔核中直接和间接通路纹状体投射神经元树突棘的差异发育。
eNeuro. 2023 Jun 12;10(6). doi: 10.1523/ENEURO.0366-22.2023. Print 2023 Jun.
9
Comparison of unitary synaptic currents generated by indirect and direct pathway neurons of the mouse striatum.比较小鼠纹状体间接和直接通路神经元产生的单元突触电流。
J Neurophysiol. 2024 May 1;131(5):914-936. doi: 10.1152/jn.00066.2024. Epub 2024 Apr 10.
10
Striatal action-learning based on dopamine concentration.基于多巴胺浓度的纹状体动作学习。
Exp Brain Res. 2010 Jan;200(3-4):307-17. doi: 10.1007/s00221-009-2060-6. Epub 2009 Nov 11.

引用本文的文献

1
The role of motor cortex in motor sequence execution depends on demands for flexibility.运动皮层在运动序列执行中的作用取决于对灵活性的需求。
Nat Neurosci. 2024 Dec;27(12):2466-2475. doi: 10.1038/s41593-024-01792-3. Epub 2024 Nov 4.
2
Refinement of efficient encodings of movement in the dorsolateral striatum throughout learning.在整个学习过程中,背外侧纹状体中运动有效编码的优化。
bioRxiv. 2024 Jun 6:2024.06.06.596654. doi: 10.1101/2024.06.06.596654.

本文引用的文献

1
Dissociating the contributions of sensorimotor striatum to automatic and visually guided motor sequences.分离感觉运动纹状体对自动和视觉引导运动序列的贡献。
Nat Neurosci. 2023 Oct;26(10):1791-1804. doi: 10.1038/s41593-023-01431-3. Epub 2023 Sep 4.
2
The respective activation and silencing of striatal direct and indirect pathway neurons support behavior encoding.纹状体直接和间接通路神经元的分别激活和沉默支持行为编码。
Nat Commun. 2023 Aug 17;14(1):4982. doi: 10.1038/s41467-023-40677-0.
3
Feasibility of dopamine as a vector-valued feedback signal in the basal ganglia.
多巴胺作为基底神经节中向量值反馈信号的可行性。
Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2221994120. doi: 10.1073/pnas.2221994120. Epub 2023 Aug 1.
4
On the normative advantages of dopamine and striatal opponency for learning and choice.多巴胺和纹状体拮抗作用在学习和选择中的规范优势。
Elife. 2023 Mar 22;12:e85107. doi: 10.7554/eLife.85107.
5
Spontaneous behaviour is structured by reinforcement without explicit reward.自发行为是由强化而不是明确的奖励来结构化的。
Nature. 2023 Feb;614(7946):108-117. doi: 10.1038/s41586-022-05611-2. Epub 2023 Jan 18.
6
Action suppression reveals opponent parallel control via striatal circuits.动作抑制通过纹状体回路揭示对手的并行控制。
Nature. 2022 Jul;607(7919):521-526. doi: 10.1038/s41586-022-04894-9. Epub 2022 Jul 6.
7
A calcium-based plasticity model for predicting long-term potentiation and depression in the neocortex.一种基于钙的可塑性模型,用于预测新皮层中的长时程增强和长时程压抑。
Nat Commun. 2022 Jun 1;13(1):3038. doi: 10.1038/s41467-022-30214-w.
8
Striatal indirect pathway mediates exploration via collicular competition.纹状体间接通路通过视丘下核竞争介导探索。
Nature. 2021 Nov;599(7886):645-649. doi: 10.1038/s41586-021-04055-4. Epub 2021 Nov 3.
9
The basal ganglia control the detailed kinematics of learned motor skills.基底神经节控制着习得运动技能的详细运动学。
Nat Neurosci. 2021 Sep;24(9):1256-1269. doi: 10.1038/s41593-021-00889-3. Epub 2021 Jul 15.
10
Striatal activity topographically reflects cortical activity.纹状体活动在地形上反映了皮质活动。
Nature. 2021 Mar;591(7850):420-425. doi: 10.1038/s41586-020-03166-8. Epub 2021 Jan 20.