• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基底神经节的 Actor-评论家模型:新的解剖学和计算视角。

Actor-critic models of the basal ganglia: new anatomical and computational perspectives.

作者信息

Joel Daphna, Niv Yael, Ruppin Eytan

机构信息

Department of Psychology, Tel-Aviv University, Ramat-Aviv, Israel.

出版信息

Neural Netw. 2002 Jun-Jul;15(4-6):535-47. doi: 10.1016/s0893-6080(02)00047-3.

DOI:10.1016/s0893-6080(02)00047-3
PMID:12371510
Abstract

A large number of computational models of information processing in the basal ganglia have been developed in recent years. Prominent in these are actor-critic models of basal ganglia functioning, which build on the strong resemblance between dopamine neuron activity and the temporal difference prediction error signal in the critic, and between dopamine-dependent long-term synaptic plasticity in the striatum and learning guided by a prediction error signal in the actor. We selectively review several actor-critic models of the basal ganglia with an emphasis on two important aspects: the way in which models of the critic reproduce the temporal dynamics of dopamine firing, and the extent to which models of the actor take into account known basal ganglia anatomy and physiology. To complement the efforts to relate basal ganglia mechanisms to reinforcement learning (RL), we introduce an alternative approach to modeling a critic network, which uses Evolutionary Computation techniques to 'evolve' an optimal RL mechanism, and relate the evolved mechanism to the basic model of the critic. We conclude our discussion of models of the critic by a critical discussion of the anatomical plausibility of implementations of a critic in basal ganglia circuitry, and conclude that such implementations build on assumptions that are inconsistent with the known anatomy of the basal ganglia. We return to the actor component of the actor-critic model, which is usually modeled at the striatal level with very little detail. We describe an alternative model of the basal ganglia which takes into account several important, and previously neglected, anatomical and physiological characteristics of basal ganglia-thalamocortical connectivity and suggests that the basal ganglia performs reinforcement-biased dimensionality reduction of cortical inputs. We further suggest that since such selective encoding may bias the representation at the level of the frontal cortex towards the selection of rewarded plans and actions, the reinforcement-driven dimensionality reduction framework may serve as a basis for basal ganglia actor models. We conclude with a short discussion of the dual role of the dopamine signal in RL and in behavioral switching.

摘要

近年来,已经开发了大量关于基底神经节信息处理的计算模型。其中突出的是基底神经节功能的行动者-评判者模型,该模型基于多巴胺神经元活动与评判者中的时间差预测误差信号之间的强烈相似性,以及纹状体中多巴胺依赖的长期突触可塑性与行动者中由预测误差信号引导的学习之间的相似性。我们选择性地回顾了几个基底神经节的行动者-评判者模型,重点关注两个重要方面:评判者模型再现多巴胺发放时间动态的方式,以及行动者模型考虑已知基底神经节解剖学和生理学的程度。为了补充将基底神经节机制与强化学习(RL)联系起来的努力,我们引入了一种对评判网络进行建模的替代方法,该方法使用进化计算技术来“进化”出一种最优的RL机制,并将进化出的机制与评判者的基本模型联系起来。我们通过对基底神经节电路中评判者实现的解剖学合理性进行批判性讨论来结束对评判者模型的讨论,并得出结论,这种实现基于与基底神经节已知解剖学不一致的假设。我们回到行动者-评判者模型的行动者部分,该部分通常在纹状体水平上建模,细节很少。我们描述了一种基底神经节的替代模型,该模型考虑了基底神经节-丘脑皮质连接的几个重要且先前被忽视的解剖学和生理学特征,并表明基底神经节对皮质输入进行强化偏向的降维。我们进一步表明,由于这种选择性编码可能会使额叶皮质水平的表征偏向于选择有奖励的计划和行动,强化驱动的降维框架可能作为基底神经节行动者模型的基础。我们最后简要讨论了多巴胺信号在RL和行为切换中的双重作用。

相似文献

1
Actor-critic models of the basal ganglia: new anatomical and computational perspectives.基底神经节的 Actor-评论家模型:新的解剖学和计算视角。
Neural Netw. 2002 Jun-Jul;15(4-6):535-47. doi: 10.1016/s0893-6080(02)00047-3.
2
[Morphological Re-evaluation of the Basal Ganglia Network].[基底神经节网络的形态学重新评估]
Brain Nerve. 2016 Jul;68(7):861-4. doi: 10.11477/mf.1416200519.
3
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
4
The many worlds hypothesis of dopamine prediction error: implications of a parallel circuit architecture in the basal ganglia.多巴胺预测误差的多世界假说:基底神经节中平行电路结构的意义。
Curr Opin Neurobiol. 2017 Oct;46:241-247. doi: 10.1016/j.conb.2017.08.015. Epub 2017 Oct 3.
5
Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.整合强化学习与基底神经节的最优决策理论。
Neural Comput. 2011 Apr;23(4):817-51. doi: 10.1162/NECO_a_00103. Epub 2011 Jan 11.
6
Information processing, dimensionality reduction and reinforcement learning in the basal ganglia.基底神经节中的信息处理、降维和强化学习
Prog Neurobiol. 2003 Dec;71(6):439-73. doi: 10.1016/j.pneurobio.2003.12.001.
7
Dissociable roles of ventral pallidum neurons in the basal ganglia reinforcement learning network.腹侧苍白球神经元在基底神经节强化学习网络中的分离作用。
Nat Neurosci. 2020 Apr;23(4):556-564. doi: 10.1038/s41593-020-0605-y. Epub 2020 Mar 30.
8
Morphological elucidation of basal ganglia circuits contributing reward prediction.对参与奖励预测的基底神经节回路的形态学阐释。
Front Neurosci. 2015 Feb 5;9:6. doi: 10.3389/fnins.2015.00006. eCollection 2015.
9
Reinforcement learning using a continuous time actor-critic framework with spiking neurons.使用具有尖峰神经元的连续时间动作 - 评论框架进行强化学习。
PLoS Comput Biol. 2013 Apr;9(4):e1003024. doi: 10.1371/journal.pcbi.1003024. Epub 2013 Apr 11.
10
A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine.皮质-基底神经节通路的双重作用假说:多巴胺和腺苷介导的对立和时间差分。
Front Neural Circuits. 2019 Jan 7;12:111. doi: 10.3389/fncir.2018.00111. eCollection 2018.

引用本文的文献

1
Mechanisms and interventions promoting healthy frontostriatal dynamics in obsessive-compulsive disorder.促进强迫症中健康额纹状体动力学的机制与干预措施。
Nat Commun. 2025 Aug 11;16(1):7400. doi: 10.1038/s41467-025-62190-2.
2
Striatal Gradient in Value-Decay Explains Regional Differences in Dopamine Patterns and Reinforcement Learning Computations.纹状体价值衰减梯度解释了多巴胺模式和强化学习计算中的区域差异。
J Neurosci. 2025 Jul 18. doi: 10.1523/JNEUROSCI.0170-25.2025.
3
Dynamics of striatal action selection and reinforcement learning.
纹状体动作选择与强化学习的动态变化
Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.
4
The Computational Bottleneck of Basal Ganglia Output (and What to Do About it).基底神经节输出的计算瓶颈(以及应对方法)。
eNeuro. 2025 Apr 24;12(4). doi: 10.1523/ENEURO.0431-23.2024. Print 2025 Apr.
5
Effect of Extrinsic Reward on Motor Plasticity during Skill Learning.外在奖励对技能学习过程中运动可塑性的影响。
eNeuro. 2025 Apr 10;12(4). doi: 10.1523/ENEURO.0410-24.2025. Print 2025 Apr.
6
Natural behaviour is learned through dopamine-mediated reinforcement.自然行为是通过多巴胺介导的强化作用习得的。
Nature. 2025 May;641(8063):699-706. doi: 10.1038/s41586-025-08729-1. Epub 2025 Mar 12.
7
The striatal matrix compartment is expanded in autism spectrum disorder.在自闭症谱系障碍中,纹状体基质区室扩大。
J Neurodev Disord. 2025 Feb 15;17(1):8. doi: 10.1186/s11689-025-09596-7.
8
Dopaminergic responses to identity prediction errors depend differently on the orbitofrontal cortex and hippocampus.多巴胺能对身份预测误差的反应在眶额皮质和海马体上的依赖方式有所不同。
bioRxiv. 2024 Dec 17:2024.12.11.628003. doi: 10.1101/2024.12.11.628003.
9
The role of motor cortex in motor sequence execution depends on demands for flexibility.运动皮层在运动序列执行中的作用取决于对灵活性的需求。
Nat Neurosci. 2024 Dec;27(12):2466-2475. doi: 10.1038/s41593-024-01792-3. Epub 2024 Nov 4.
10
Dopamine transients encode reward prediction errors independent of learning rates.多巴胺瞬变独立于学习率编码奖励预测误差。
Cell Rep. 2024 Oct 22;43(10):114840. doi: 10.1016/j.celrep.2024.114840. Epub 2024 Oct 11.