• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于模型的海马-腹侧纹状体回路中的空间导航:计算分析。

Model-based spatial navigation in the hippocampus-ventral striatum circuit: A computational analysis.

机构信息

Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.

University of Amsterdam, Swammerdam Institute for Life Sciences-Center for Neuroscience Amsterdam, The Netherlands.

出版信息

PLoS Comput Biol. 2018 Sep 17;14(9):e1006316. doi: 10.1371/journal.pcbi.1006316. eCollection 2018 Sep.

DOI:10.1371/journal.pcbi.1006316
PMID:30222746
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6160242/
Abstract

While the neurobiology of simple and habitual choices is relatively well known, our current understanding of goal-directed choices and planning in the brain is still limited. Theoretical work suggests that goal-directed computations can be productively associated to model-based (reinforcement learning) computations, yet a detailed mapping between computational processes and neuronal circuits remains to be fully established. Here we report a computational analysis that aligns Bayesian nonparametrics and model-based reinforcement learning (MB-RL) to the functioning of the hippocampus (HC) and the ventral striatum (vStr)-a neuronal circuit that increasingly recognized to be an appropriate model system to understand goal-directed (spatial) decisions and planning mechanisms in the brain. We test the MB-RL agent in a contextual conditioning task that depends on intact hippocampus and ventral striatal (shell) function and show that it solves the task while showing key behavioral and neuronal signatures of the HC-vStr circuit. Our simulations also explore the benefits of biological forms of look-ahead prediction (forward sweeps) during both learning and control. This article thus contributes to fill the gap between our current understanding of computational algorithms and biological realizations of (model-based) reinforcement learning.

摘要

虽然简单和习惯性选择的神经生物学相对为人所知,但我们目前对大脑中目标导向选择和规划的理解仍然有限。理论工作表明,目标导向的计算可以与基于模型的(强化学习)计算进行有益的关联,但计算过程和神经元回路之间的详细映射仍有待完全建立。在这里,我们报告了一项计算分析,该分析将贝叶斯非参数和基于模型的强化学习(MB-RL)与海马体(HC)和腹侧纹状体(vStr)的功能联系起来 - 越来越多的神经元回路被认为是一个合适的模型系统,用于理解大脑中的目标导向(空间)决策和规划机制。我们在一个依赖于完整的海马体和腹侧纹状体(壳)功能的情境条件作用任务中测试了 MB-RL 代理,并表明它解决了任务,同时显示了 HC-vStr 回路的关键行为和神经元特征。我们的模拟还探索了在学习和控制过程中进行生物形式的前瞻性预测(前扫)的好处。因此,本文有助于填补我们对计算算法的理解和(基于模型的)强化学习的生物实现之间的差距。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/5c7a6bb6ba03/pcbi.1006316.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/78d183708bcc/pcbi.1006316.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/bd78a1ae09b5/pcbi.1006316.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/9f3106ae22f6/pcbi.1006316.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/38d65feecb37/pcbi.1006316.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/31e9ac09471f/pcbi.1006316.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/a2245ebc5b95/pcbi.1006316.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/b84cf0f0c3e7/pcbi.1006316.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/b418d3e9cdd0/pcbi.1006316.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/ef9d122dd40f/pcbi.1006316.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/830a83593d4c/pcbi.1006316.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/5c7a6bb6ba03/pcbi.1006316.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/78d183708bcc/pcbi.1006316.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/bd78a1ae09b5/pcbi.1006316.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/9f3106ae22f6/pcbi.1006316.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/38d65feecb37/pcbi.1006316.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/31e9ac09471f/pcbi.1006316.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/a2245ebc5b95/pcbi.1006316.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/b84cf0f0c3e7/pcbi.1006316.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/b418d3e9cdd0/pcbi.1006316.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/ef9d122dd40f/pcbi.1006316.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/830a83593d4c/pcbi.1006316.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a28/6160242/5c7a6bb6ba03/pcbi.1006316.g011.jpg

相似文献

1
Model-based spatial navigation in the hippocampus-ventral striatum circuit: A computational analysis.基于模型的海马-腹侧纹状体回路中的空间导航:计算分析。
PLoS Comput Biol. 2018 Sep 17;14(9):e1006316. doi: 10.1371/journal.pcbi.1006316. eCollection 2018 Sep.
2
Using hippocampal-striatal loops for spatial navigation and goal-directed decision-making.利用海马体-纹状体环路进行空间导航和目标导向决策。
Cogn Process. 2012 Aug;13 Suppl 1:S125-9. doi: 10.1007/s10339-012-0475-7.
3
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
4
Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning.杏仁核与腹侧纹状体对强化学习有不同贡献。
Neuron. 2016 Oct 19;92(2):505-517. doi: 10.1016/j.neuron.2016.09.025. Epub 2016 Oct 6.
5
Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization.模拟和物理大鼠机器人中海马位置细胞重放和前额叶序列学习的实时感觉运动整合,用于新路径优化。
Biol Cybern. 2020 Apr;114(2):249-268. doi: 10.1007/s00422-020-00820-2. Epub 2020 Feb 24.
6
A general model of hippocampal and dorsal striatal learning and decision making.海马体和背侧纹状体学习和决策的通用模型。
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31427-31437. doi: 10.1073/pnas.2007981117. Epub 2020 Nov 23.
7
Neural correlates of forward planning in a spatial decision task in humans.人类在空间决策任务中进行前瞻性规划的神经关联。
J Neurosci. 2011 Apr 6;31(14):5526-39. doi: 10.1523/JNEUROSCI.4647-10.2011.
8
A computational model for spatial cognition combining dorsal and ventral hippocampal place field maps: multiscale navigation.结合背侧和腹侧海马体位置场图的空间认知计算模型:多尺度导航。
Biol Cybern. 2020 Apr;114(2):187-207. doi: 10.1007/s00422-019-00812-x. Epub 2020 Jan 9.
9
Internally generated sequences in learning and executing goal-directed behavior.内在产生的序列在学习和执行目标导向行为中的作用。
Trends Cogn Sci. 2014 Dec;18(12):647-57. doi: 10.1016/j.tics.2014.06.011. Epub 2014 Aug 23.
10
Offline ventral subiculum-ventral striatum serial communication is required for spatial memory consolidation.离线腹侧海马体-腹侧纹状体序列通讯是空间记忆巩固所必需的。
Nat Commun. 2019 Dec 16;10(1):5721. doi: 10.1038/s41467-019-13703-3.

引用本文的文献

1
The coming decade of digital brain research: A vision for neuroscience at the intersection of technology and computing.数字脑研究的未来十年:科技与计算交叉领域的神经科学愿景。
Imaging Neurosci (Camb). 2024 Apr 18;2. doi: 10.1162/imag_a_00137. eCollection 2024.
2
Deep Hybrid Models: Infer and Plan in a Dynamic World.深度混合模型:在动态世界中进行推理与规划。
Entropy (Basel). 2025 May 27;27(6):570. doi: 10.3390/e27060570.
3
A hierarchical active inference model of spatial alternation tasks and the hippocampal-prefrontal circuit.

本文引用的文献

1
The successor representation in human reinforcement learning.人类强化学习中的后继表示
Nat Hum Behav. 2017 Sep;1(9):680-692. doi: 10.1038/s41562-017-0180-8. Epub 2017 Aug 28.
2
Prioritized memory access explains planning and hippocampal replay.优先化的记忆访问解释了规划和海马体重放。
Nat Neurosci. 2018 Nov;21(11):1609-1617. doi: 10.1038/s41593-018-0232-z. Epub 2018 Oct 22.
3
Hierarchical Active Inference: A Theory of Motivated Control.分层主动推理:动机控制理论。
空间交替任务和海马-前额叶回路的分层主动推理模型。
Nat Commun. 2024 Nov 15;15(1):9892. doi: 10.1038/s41467-024-54257-3.
4
Dopamine-mediated formation of a memory module in the nucleus accumbens for goal-directed navigation.伏隔核中多巴胺介导的记忆模块形成,用于目标导向导航。
Nat Neurosci. 2024 Nov;27(11):2178-2192. doi: 10.1038/s41593-024-01770-9. Epub 2024 Sep 27.
5
An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.一种受大鼠大脑前向预测机制启发的改进型Dyna-Q算法用于移动机器人路径规划
Biomimetics (Basel). 2024 May 23;9(6):315. doi: 10.3390/biomimetics9060315.
6
Grid cells: the missing link in understanding Parkinson's disease?网格细胞:理解帕金森病的缺失环节?
Front Neurosci. 2024 Feb 8;18:1276714. doi: 10.3389/fnins.2024.1276714. eCollection 2024.
7
Flexible intentions: An Active Inference theory.灵活意图:一种主动推理理论。
Front Comput Neurosci. 2023 Mar 20;17:1128694. doi: 10.3389/fncom.2023.1128694. eCollection 2023.
8
Habit formation viewed as structural change in the behavioral network.习惯形成被视为行为网络中的结构性变化。
Commun Biol. 2023 Apr 4;6(1):303. doi: 10.1038/s42003-023-04500-2.
9
A quadruple dissociation of reward-related behaviour in mice across excitatory inputs to the nucleus accumbens shell.在小鼠的伏隔核壳兴奋性传入中,对奖赏相关行为进行四重分离。
Commun Biol. 2023 Jan 30;6(1):119. doi: 10.1038/s42003-023-04429-6.
10
Absolute Winding Number Differentiates Mouse Spatial Navigation Strategies With Genetic Risk for Alzheimer's Disease.绝对缠绕数可区分具有阿尔茨海默病遗传风险的小鼠空间导航策略。
Front Neurosci. 2022 Jun 17;16:848654. doi: 10.3389/fnins.2022.848654. eCollection 2022.
Trends Cogn Sci. 2018 Apr;22(4):294-306. doi: 10.1016/j.tics.2018.01.009. Epub 2018 Feb 20.
4
The hippocampus as a predictive map.海马体作为一个预测图。
Nat Neurosci. 2017 Nov;20(11):1643-1653. doi: 10.1038/nn.4650. Epub 2017 Oct 2.
5
Predictive representations can link model-based reinforcement learning to model-free mechanisms.预测性表征可以将基于模型的强化学习与无模型机制联系起来。
PLoS Comput Biol. 2017 Sep 25;13(9):e1005768. doi: 10.1371/journal.pcbi.1005768. eCollection 2017 Sep.
6
Lesions of the Head Direction Cell System Increase Hippocampal Place Field Repetition.头部方向细胞系统损伤增加海马位置场重复。
Curr Biol. 2017 Sep 11;27(17):2706-2712.e2. doi: 10.1016/j.cub.2017.07.071. Epub 2017 Aug 31.
7
Active Inference, Curiosity and Insight.主动推理、好奇心与洞察力。
Neural Comput. 2017 Oct;29(10):2633-2683. doi: 10.1162/neco_a_00999. Epub 2017 Aug 4.
8
Dorsal hippocampus contributes to model-based planning.背侧海马体有助于基于模型的规划。
Nat Neurosci. 2017 Sep;20(9):1269-1276. doi: 10.1038/nn.4613. Epub 2017 Jul 31.
9
Temporal and Rate Coding for Discrete Event Sequences in the Hippocampus.海马体中离散事件序列的时间和速率编码。
Neuron. 2017 Jun 21;94(6):1248-1262.e4. doi: 10.1016/j.neuron.2017.05.024. Epub 2017 Jun 9.
10
Oscillations, neural computations and learning during wake and sleep.清醒和睡眠期间的振荡、神经计算和学习。
Curr Opin Neurobiol. 2017 Jun;44:193-201. doi: 10.1016/j.conb.2017.05.009.