• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

皮质基底神经节回路中的强化学习的多种表示和算法。

Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit.

机构信息

Neural Computation Unit, Okinawa Institute of Science and Technology, Okinawa 904-0412, Japan.

出版信息

Curr Opin Neurobiol. 2011 Jun;21(3):368-73. doi: 10.1016/j.conb.2011.04.001. Epub 2011 Apr 29.

DOI:10.1016/j.conb.2011.04.001
PMID:21531544
Abstract

Accumulating evidence shows that the neural network of the cerebral cortex and the basal ganglia is critically involved in reinforcement learning. Recent studies found functional heterogeneity within the cortico-basal ganglia circuit, especially in its ventromedial to dorsolateral axis. Here we review computational issues in reinforcement learning and propose a working hypothesis on how multiple reinforcement learning algorithms are implemented in the cortico-basal ganglia circuit using different representations of states, values, and actions.

摘要

越来越多的证据表明,大脑皮层和基底神经节的神经网络在强化学习中起着至关重要的作用。最近的研究发现,皮质-基底神经节回路内存在功能异质性,特别是在其腹侧到背侧轴上。在这里,我们回顾了强化学习中的计算问题,并提出了一个工作假设,即使用状态、值和动作的不同表示形式,多个强化学习算法如何在皮质-基底神经节回路中实现。

相似文献

1
Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit.皮质基底神经节回路中的强化学习的多种表示和算法。
Curr Opin Neurobiol. 2011 Jun;21(3):368-73. doi: 10.1016/j.conb.2011.04.001. Epub 2011 Apr 29.
2
Self-organization in the basal ganglia with modulation of reinforcement signals.基底神经节中的自组织与强化信号的调制
Neural Comput. 2002 Apr;14(4):819-44. doi: 10.1162/089976602317318974.
3
Integration of reinforcement learning and optimal decision-making theories of the basal ganglia.整合强化学习与基底神经节的最优决策理论。
Neural Comput. 2011 Apr;23(4):817-51. doi: 10.1162/NECO_a_00103. Epub 2011 Jan 11.
4
Cortico-basal ganglia circuitry: a review of key research and implications for functional connectivity studies of mood and anxiety disorders.皮质-基底神经节回路:关键研究综述及其对情绪和焦虑障碍功能连接研究的影响。
Brain Struct Funct. 2010 Aug;215(2):73-96. doi: 10.1007/s00429-010-0280-y. Epub 2010 Oct 12.
5
Working memory and response selection: a computational account of interactions among cortico-basalganglio-thalamic loops.工作记忆和反应选择:皮质基底神经节丘脑回路相互作用的计算描述。
Neural Netw. 2012 Feb;26:59-74. doi: 10.1016/j.neunet.2011.10.008. Epub 2011 Oct 25.
6
[Cortico-basal ganglia circuits--parallel closed loops and convergent/divergent connections].[皮质-基底神经节环路——平行闭环与汇聚/发散连接]
Brain Nerve. 2009 Apr;61(4):351-9.
7
[Functional analysis of the roles of direct and indirect pathways by using immunotoxin-mediated cell targeting approach].[利用免疫毒素介导的细胞靶向方法对直接和间接通路作用的功能分析]
Brain Nerve. 2009 Apr;61(4):412-8.
8
Variability in action: Contributions of a songbird cortical-basal ganglia circuit to vocal motor learning and control.行为中的变异性:鸣禽皮质-基底神经节回路对发声运动学习与控制的贡献。
Neuroscience. 2015 Jun 18;296:39-47. doi: 10.1016/j.neuroscience.2014.10.010. Epub 2014 Oct 18.
9
Seven problems on the basal ganglia.基底神经节的七个问题。
Curr Opin Neurobiol. 2008 Dec;18(6):595-604. doi: 10.1016/j.conb.2008.11.001. Epub 2008 Dec 8.
10
The basal ganglia and cortex implement optimal decision making between alternative actions.基底神经节和皮层在不同动作之间实现最优决策。
Neural Comput. 2007 Feb;19(2):442-77. doi: 10.1162/neco.2007.19.2.442.

引用本文的文献

1
Success-efficient/failure-safe strategy for hierarchical reinforcement motor learning.分层强化运动学习的成功高效/失败安全策略。
PLoS Comput Biol. 2025 May 9;21(5):e1013089. doi: 10.1371/journal.pcbi.1013089. eCollection 2025 May.
2
Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化
Elife. 2025 May 8;13:RP101747. doi: 10.7554/eLife.101747.
3
The Computational Bottleneck of Basal Ganglia Output (and What to Do About it).基底神经节输出的计算瓶颈(以及应对方法)。
eNeuro. 2025 Apr 24;12(4). doi: 10.1523/ENEURO.0431-23.2024. Print 2025 Apr.
4
Dynamics of striatal action selection and reinforcement learning.纹状体动作选择与强化学习的动态变化
bioRxiv. 2024 Dec 24:2024.02.14.580408. doi: 10.1101/2024.02.14.580408.
5
Subthalamic nucleus deep brain stimulation alleviates oxidative stress via mitophagy in Parkinson's disease.丘脑底核深部脑刺激通过线粒体自噬减轻帕金森病中的氧化应激。
NPJ Parkinsons Dis. 2024 Mar 6;10(1):52. doi: 10.1038/s41531-024-00668-4.
6
Dopamine transients follow a striatal gradient of reward time horizons.多巴胺瞬变遵循纹状体奖赏时程的梯度。
Nat Neurosci. 2024 Apr;27(4):737-746. doi: 10.1038/s41593-023-01566-3. Epub 2024 Feb 6.
7
Interaction between decision-making and motor learning when selecting reach targets in the presence of bias and noise.在存在偏差和噪声的情况下选择目标时,决策与运动学习之间的相互作用。
PLoS Comput Biol. 2023 Nov 2;19(11):e1011596. doi: 10.1371/journal.pcbi.1011596. eCollection 2023 Nov.
8
Virtual reality to improve low-back pain and pelvic pain during pregnancy: a pilot RCT for a multicenter randomized controlled trial.虚拟现实改善孕期下背痛和骨盆疼痛:一项多中心随机对照试验的初步随机对照试验
Front Med (Lausanne). 2023 Sep 4;10:1206799. doi: 10.3389/fmed.2023.1206799. eCollection 2023.
9
Contributions of the Basal Ganglia to Visual Perceptual Decisions.基底神经节对视觉感知决策的贡献。
Annu Rev Vis Sci. 2023 Sep 15;9:385-407. doi: 10.1146/annurev-vision-111022-123804.
10
Selective encoding of reward predictions and prediction errors by globus pallidus subpopulations.苍白球亚群对奖励预测和预测误差的选择性编码。
Curr Biol. 2023 Oct 9;33(19):4124-4135.e5. doi: 10.1016/j.cub.2023.08.042. Epub 2023 Sep 12.