基于模型的动作规划涉及皮质-小脑和基底神经节网络。

Model-based action planning involves cortico-cerebellar and basal ganglia networks.

作者信息

Fermin Alan S R, Yoshida Takehiko, Yoshimoto Junichiro, Ito Makoto, Tanaka Saori C, Doya Kenji

机构信息

Graduate School of Information Science, Nara Institute of Science and Technology, Nara 630-0192, Japan.

Neural Computation Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0495, Japan.

出版信息

Sci Rep. 2016 Aug 19;6:31378. doi: 10.1038/srep31378.

DOI:10.1038/srep31378

PMID:27539554

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4990901/

Abstract

Humans can select actions by learning, planning, or retrieving motor memories. Reinforcement Learning (RL) associates these processes with three major classes of strategies for action selection: exploratory RL learns state-action values by exploration, model-based RL uses internal models to simulate future states reached by hypothetical actions, and motor-memory RL selects past successful state-action mapping. In order to investigate the neural substrates that implement these strategies, we conducted a functional magnetic resonance imaging (fMRI) experiment while humans performed a sequential action selection task under conditions that promoted the use of a specific RL strategy. The ventromedial prefrontal cortex and ventral striatum increased activity in the exploratory condition; the dorsolateral prefrontal cortex, dorsomedial striatum, and lateral cerebellum in the model-based condition; and the supplementary motor area, putamen, and anterior cerebellum in the motor-memory condition. These findings suggest that a distinct prefrontal-basal ganglia and cerebellar network implements the model-based RL action selection strategy.

摘要

人类可以通过学习、规划或检索运动记忆来选择行动。强化学习（RL）将这些过程与行动选择的三大类策略联系起来：探索性强化学习通过探索来学习状态-行动值，基于模型的强化学习使用内部模型来模拟假设行动所达到的未来状态，而运动记忆强化学习则选择过去成功的状态-行动映射。为了研究实施这些策略的神经基础，我们进行了一项功能磁共振成像（fMRI）实验，实验中人类在促进使用特定强化学习策略的条件下执行连续行动选择任务。在探索条件下，腹内侧前额叶皮层和腹侧纹状体的活动增加；在基于模型的条件下，背外侧前额叶皮层、背内侧纹状体和外侧小脑的活动增加；在运动记忆条件下，辅助运动区、壳核和前小脑的活动增加。这些发现表明，一个独特的前额叶-基底神经节和小脑网络实施基于模型的强化学习行动选择策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7a3/4990901/ad9a0535d1a9/srep31378-f1.jpg

相似文献

Model-based action planning involves cortico-cerebellar and basal ganglia networks.基于模型的动作规划涉及皮质-小脑和基底神经节网络。

Sci Rep. 2016 Aug 19;6:31378. doi: 10.1038/srep31378.

Neuronal Representation of a Working Memory-Based Decision Strategy in the Motor and Prefrontal Cortico-Basal Ganglia Loops.运动和前额皮质-基底神经节回路中基于工作记忆的决策策略的神经元表示。

eNeuro. 2023 Jun 20;10(6). doi: 10.1523/ENEURO.0413-22.2023. Print 2023 Jun.

Changes in cortical, cerebellar and basal ganglia representation after comprehensive long term unilateral hand motor training.长期全面单侧手部运动训练后皮质、小脑和基底神经节表征的变化。

Behav Brain Res. 2015 Feb 1;278:393-403. doi: 10.1016/j.bbr.2014.08.044. Epub 2014 Sep 3.

A review of differences between basal ganglia and cerebellar control of movements as revealed by functional imaging studies.功能成像研究揭示的基底神经节与小脑运动控制差异综述。

Brain. 1998 Aug;121 ( Pt 8):1437-49. doi: 10.1093/brain/121.8.1437.

The time course of task-specific memory consolidation effects in resting state networks.静息态网络中任务特异性记忆巩固效应的时程。

J Neurosci. 2014 Mar 12;34(11):3982-92. doi: 10.1523/JNEUROSCI.4341-13.2014.

A spiking neuron model of the cortico-basal ganglia circuits for goal-directed and habitual action learning.用于目标导向和习惯动作学习的皮质基底神经节电路的尖峰神经元模型。

Neural Netw. 2013 May;41:212-24. doi: 10.1016/j.neunet.2012.11.009. Epub 2012 Dec 5.

A role of the basal ganglia and midbrain nuclei for initiation of motor sequences.基底神经节和中脑核在启动运动序列中的作用。

Neuroimage. 2008 Feb 1;39(3):1356-69. doi: 10.1016/j.neuroimage.2007.09.069. Epub 2007 Oct 16.

Learning, memory and consolidation mechanisms for behavioral control in hierarchically organized cortico-basal ganglia systems.分层组织的皮质-基底神经节系统中行为控制的学习、记忆和巩固机制。

Hippocampus. 2020 Jan;30(1):73-98. doi: 10.1002/hipo.23167. Epub 2019 Oct 16.

Neural substrates of visuomotor learning based on improved feedback control and prediction.基于改进的反馈控制和预测的视觉运动学习的神经基质。

Neuroimage. 2008 Feb 1;39(3):1383-95. doi: 10.1016/j.neuroimage.2007.09.062. Epub 2007 Oct 12.

Striatal-cerebellar networks mediate consolidation in a motor sequence learning task: An fMRI study using dynamic causal modelling.纹状体-小脑网络介导运动序列学习任务中的巩固：使用动态因果建模的 fMRI 研究。

Neuroimage. 2015 Nov 15;122:52-64. doi: 10.1016/j.neuroimage.2015.07.077. Epub 2015 Aug 2.

引用本文的文献

Striatal cell-type specific stability and reorganization underlying agency and habit.纹状体细胞类型特异性稳定性及潜在的自主性和习惯重组。

bioRxiv. 2025 Jan 26:2025.01.26.634924. doi: 10.1101/2025.01.26.634924.

Mentalizing About Dynamic Social Action Sequences Is Supported by the Cerebellum, Basal Ganglia, and Neocortex: A Meta-Analysis of Activation and Connectivity.小脑、基底神经节和新皮层支持对动态社会行动序列的心理化：激活与连通性的元分析

Hum Brain Mapp. 2024 Dec 15;45(18):e70098. doi: 10.1002/hbm.70098.

The role of training variability for model-based and model-free learning of an arbitrary visuomotor mapping.基于模型和无模型学习任意视动映射的训练变异性的作用。

PLoS Comput Biol. 2024 Sep 27;20(9):e1012471. doi: 10.1371/journal.pcbi.1012471. eCollection 2024 Sep.

Synergizing habits and goals with variational Bayes.协同习惯和目标与变分贝叶斯。

Nat Commun. 2024 May 25;15(1):4461. doi: 10.1038/s41467-024-48577-7.

Secondary cerebro-cerebellar and intra-cerebellar dysfunction in cerebellar mutism syndrome.小脑缄默症综合征中的次级脑-小脑和小脑内功能障碍。

Neuro Oncol. 2024 Sep 5;26(9):1700-1711. doi: 10.1093/neuonc/noae070.

Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks.通过包括直接和间接途径来增强强化学习模型可以提高纹状体依赖任务的性能。

PLoS Comput Biol. 2023 Aug 18;19(8):e1011385. doi: 10.1371/journal.pcbi.1011385. eCollection 2023 Aug.

Decision heuristics in contexts integrating action selection and execution.在整合行动选择和执行的情境中进行决策启发式。

Sci Rep. 2023 Apr 20;13(1):6486. doi: 10.1038/s41598-023-33008-2.

Structural atrophy and functional dysconnectivity patterns in the cerebellum relate to cerebral networks in svMCI.小脑的结构萎缩和功能失连接模式与轻度认知障碍（svMCI）中的脑网络相关。

Front Neurosci. 2023 Jan 11;16:1006231. doi: 10.3389/fnins.2022.1006231. eCollection 2022.

Ventrolateral Prefrontal Cortex Contributes to Human Motor Learning.腹外侧前额叶皮层参与人类运动学习。

eNeuro. 2022 Sep 29;9(5). doi: 10.1523/ENEURO.0269-22.2022. Print 2022 Sep-Oct.

The effect of self-generated versus externally generated actions on timing, duration, and amplitude of blood oxygen level dependent response for visual feedback processing.自发生动作与外发生动作对视觉反馈处理的血氧水平依赖反应的时程、时长和幅度的影响。

Hum Brain Mapp. 2022 Nov;43(16):4954-4969. doi: 10.1002/hbm.26053. Epub 2022 Sep 2.

本文引用的文献

Optogenetic activation of dorsal raphe serotonin neurons enhances patience for future rewards.中缝背核5-羟色胺能神经元的光遗传学激活增强了对未来奖励的耐心。

Curr Biol. 2014 Sep 8;24(17):2033-40. doi: 10.1016/j.cub.2014.07.041. Epub 2014 Aug 21.

Neural computations underlying arbitration between model-based and model-free learning.基于模型和无模型学习之间仲裁的神经计算。

Neuron. 2014 Feb 5;81(3):687-99. doi: 10.1016/j.neuron.2013.11.028.

Bonsai trees in your head: how the pavlovian system sculpts goal-directed choices by pruning decision trees.大脑中的盆景树：巴甫洛夫系统如何通过修剪决策树来塑造目标导向的选择。

PLoS Comput Biol. 2012;8(3):e1002410. doi: 10.1371/journal.pcbi.1002410. Epub 2012 Mar 8.

Mapping value based planning and extensively trained choice in the human brain.在人类大脑中映射基于价值的规划和广泛训练的选择。

Nat Neurosci. 2012 Mar 11;15(5):786-91. doi: 10.1038/nn.3068.

Changing the structure of complex visuo-motor sequences selectively activates the fronto-parietal network.改变复杂视动序列的结构可选择性地激活额顶网络。

Neuroimage. 2012 Jan 16;59(2):1180-9. doi: 10.1016/j.neuroimage.2011.08.006. Epub 2011 Aug 16.

Model-based influences on humans' choices and striatal prediction errors.基于模型的影响对人类选择和纹状体预测误差的影响。

Neuron. 2011 Mar 24;69(6):1204-15. doi: 10.1016/j.neuron.2011.02.027.

Internal models in the cerebellum.小脑的内模式。

Trends Cogn Sci. 1998 Sep 1;2(9):338-47. doi: 10.1016/s1364-6613(98)01221-2.

Evidence for model-based action planning in a sequential finger movement task.顺序手指运动任务中基于模型的动作规划的证据。

J Mot Behav. 2010 Nov;42(6):371-9. doi: 10.1080/00222895.2010.526467.

States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.状态与奖励：基于模型和无模型强化学习的分离神经预测误差信号。

Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.

Switching from automatic to controlled behavior: cortico-basal ganglia mechanisms.从自动行为到受控行为的转换：皮质-基底节机制。

Trends Cogn Sci. 2010 Apr;14(4):154-61. doi: 10.1016/j.tics.2010.01.006. Epub 2010 Feb 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于模型的动作规划涉及皮质-小脑和基底神经节网络。

Model-based action planning involves cortico-cerebellar and basal ganglia networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献