• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

海马体和背侧纹状体学习和决策的通用模型。

A general model of hippocampal and dorsal striatal learning and decision making.

机构信息

Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London W1T 4JG, United Kingdom.

Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, United Kingdom.

出版信息

Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31427-31437. doi: 10.1073/pnas.2007981117. Epub 2020 Nov 23.

DOI:10.1073/pnas.2007981117
PMID:33229541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7733794/
Abstract

Humans and other animals use multiple strategies for making decisions. Reinforcement-learning theory distinguishes between stimulus-response (model-free; MF) learning and deliberative (model-based; MB) planning. The spatial-navigation literature presents a parallel dichotomy between navigation strategies. In "response learning," associated with the dorsolateral striatum (DLS), decisions are anchored to an egocentric reference frame. In "place learning," associated with the hippocampus, decisions are anchored to an allocentric reference frame. Emerging evidence suggests that the contribution of hippocampus to place learning may also underlie its contribution to MB learning by representing relational structure in a cognitive map. Here, we introduce a computational model in which hippocampus subserves place and MB learning by learning a "successor representation" of relational structure between states; DLS implements model-free response learning by learning associations between actions and egocentric representations of landmarks; and action values from either system are weighted by the reliability of its predictions. We show that this model reproduces a range of seemingly disparate behavioral findings in spatial and nonspatial decision tasks and explains the effects of lesions to DLS and hippocampus on these tasks. Furthermore, modeling place cells as driven by boundaries explains the observation that, unlike navigation guided by landmarks, navigation guided by boundaries is robust to "blocking" by prior state-reward associations due to learned associations between place cells. Our model, originally shaped by detailed constraints in the spatial literature, successfully characterizes the hippocampal-striatal system as a general system for decision making via adaptive combination of stimulus-response learning and the use of a cognitive map.

摘要

人类和其他动物使用多种策略来做出决策。强化学习理论区分了刺激-反应(无模型;MF)学习和深思熟虑(基于模型;MB)规划。空间导航文献提出了一种平行的二分法,用于区分导航策略。在“反应学习”中,与背外侧纹状体(DLS)相关联,决策以自我为中心的参考系为锚定点。在“位置学习”中,与海马体相关联,决策以客观参考系为锚定点。新出现的证据表明,海马体对位置学习的贡献也可能是通过在认知地图中表示关系结构来为 MB 学习做出贡献。在这里,我们引入了一个计算模型,其中海马体通过学习状态之间关系结构的“后继表示”来实现位置和 MB 学习;DLS 通过学习动作和地标自我中心表示之间的关联来实现无模型反应学习;来自任一系统的动作值都由其预测的可靠性加权。我们表明,该模型再现了一系列看似不同的空间和非空间决策任务中的行为发现,并解释了 DLS 和海马体损伤对这些任务的影响。此外,将位置细胞建模为受边界驱动,可以解释这样一个观察结果,即与由地标引导的导航不同,由边界引导的导航由于位置细胞之间的学习关联而对先前状态-奖励关联的“阻断”具有鲁棒性。我们的模型最初是根据空间文献中的详细约束形成的,成功地将海马-纹状体系统描述为通过刺激-反应学习的自适应组合和使用认知地图来进行决策的通用系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/6cb213c637ef/pnas.2007981117fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/54af56b31317/pnas.2007981117fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/f5a92e48cd11/pnas.2007981117fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/2a66beefb4d0/pnas.2007981117fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/baa5ab7365c2/pnas.2007981117fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/09b05cc01344/pnas.2007981117fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/6cb213c637ef/pnas.2007981117fig06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/54af56b31317/pnas.2007981117fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/f5a92e48cd11/pnas.2007981117fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/2a66beefb4d0/pnas.2007981117fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/baa5ab7365c2/pnas.2007981117fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/09b05cc01344/pnas.2007981117fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29af/7733794/6cb213c637ef/pnas.2007981117fig06.jpg

相似文献

1
A general model of hippocampal and dorsal striatal learning and decision making.海马体和背侧纹状体学习和决策的通用模型。
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31427-31437. doi: 10.1073/pnas.2007981117. Epub 2020 Nov 23.
2
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
3
Using hippocampal-striatal loops for spatial navigation and goal-directed decision-making.利用海马体-纹状体环路进行空间导航和目标导向决策。
Cogn Process. 2012 Aug;13 Suppl 1:S125-9. doi: 10.1007/s10339-012-0475-7.
4
Contributions of Hippocampus and Striatum to Memory-Guided Behavior Depend on Past Experience.海马体和纹状体对记忆引导行为的贡献取决于过去的经验。
J Neurosci. 2016 Jun 15;36(24):6459-70. doi: 10.1523/JNEUROSCI.0840-16.2016.
5
Reinforcement learning approaches to hippocampus-dependent flexible spatial navigation.用于依赖海马体的灵活空间导航的强化学习方法。
Brain Neurosci Adv. 2021 Apr 9;5:2398212820975634. doi: 10.1177/2398212820975634. eCollection 2021 Jan-Dec.
6
Striatal versus hippocampal representations during win-stay maze performance.在赢则停留迷宫任务表现期间纹状体与海马体的表征。
J Neurophysiol. 2009 Mar;101(3):1575-87. doi: 10.1152/jn.91106.2008. Epub 2009 Jan 14.
7
Interindividual differences in memory system local field potential activity predict behavioral strategy on a dual-solution T-maze.个体间记忆系统局部场电位活动的差异预测了双解 T 迷宫上的行为策略。
Hippocampus. 2020 Dec;30(12):1313-1326. doi: 10.1002/hipo.23258. Epub 2020 Sep 7.
8
Threat-induced modulation of hippocampal and striatal memory systems during navigation of a virtual environment.在虚拟环境中导航时,威胁诱导的海马体和纹状体记忆系统的调制。
Neurobiol Learn Mem. 2020 Feb;168:107160. doi: 10.1016/j.nlm.2020.107160. Epub 2020 Jan 7.
9
Both the dorsal hippocampus and the dorsolateral striatum are needed for rat navigation in the Morris water maze.背侧海马体和背外侧纹状体都是大鼠在 Morris 水迷宫中导航所必需的。
Behav Brain Res. 2012 Jan 1;226(1):171-8. doi: 10.1016/j.bbr.2011.09.011. Epub 2011 Sep 12.
10
Spatial coordinate transforms linking the allocentric hippocampal and egocentric parietal primate brain systems for memory, action in space, and navigation.将无定位参考的海马体和自我中心的顶叶灵长类大脑系统的空间坐标转换联系起来,用于记忆、空间中的动作和导航。
Hippocampus. 2020 Apr;30(4):332-353. doi: 10.1002/hipo.23171. Epub 2019 Nov 7.

引用本文的文献

1
The neural computational and dynamical mechanisms of reward-modulated spatial coding in hippocampal place cells.海马体位置细胞中奖励调制空间编码的神经计算与动力学机制。
Cogn Neurodyn. 2025 Dec;19(1):99. doi: 10.1007/s11571-025-10282-6. Epub 2025 Jun 23.
2
Impact of symmetry in local learning rules on predictive neural representations and generalization in spatial navigation.局部学习规则中的对称性对空间导航中预测性神经表征及泛化的影响。
PLoS Comput Biol. 2025 Jun 23;21(6):e1013056. doi: 10.1371/journal.pcbi.1013056. eCollection 2025 Jun.
3
Neural evidence that humans reuse strategies to solve new tasks.

本文引用的文献

1
Fast reinforcement learning with generalized policy updates.快速强化学习与广义策略更新。
Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30079-30087. doi: 10.1073/pnas.1907370117. Epub 2020 Aug 17.
2
Goal-directed actions transiently depend on dorsal hippocampus.目标导向的动作暂时依赖于背侧海马体。
Nat Neurosci. 2020 Oct;23(10):1194-1197. doi: 10.1038/s41593-020-0693-8. Epub 2020 Aug 10.
3
Neuronal vector coding in spatial cognition.空间认知中的神经元向量编码。
人类重复使用策略来解决新任务的神经学证据。
PLoS Biol. 2025 Jun 5;23(6):e3003174. doi: 10.1371/journal.pbio.3003174. eCollection 2025 Jun.
4
Noise Resilience of Successor and Predecessor Feature Algorithms in One- and Two-Dimensional Environments.一维和二维环境中后继与前驱特征算法的抗噪能力
Sensors (Basel). 2025 Feb 6;25(3):979. doi: 10.3390/s25030979.
5
A recurrent network model of planning explains hippocampal replay and human behavior.一种规划的循环网络模型解释了海马体重放和人类行为。
Nat Neurosci. 2024 Jul;27(7):1340-1348. doi: 10.1038/s41593-024-01675-7. Epub 2024 Jun 7.
6
An Integrated Platform for Electrophysiology in Spatial Cognition Experiments.空间认知实验中的电生理整合平台。
eNeuro. 2023 Nov 21;10(11). doi: 10.1523/ENEURO.0274-23.2023. Print 2023 Nov.
7
Accounting for multiscale processing in adaptive real-world decision-making via the hippocampus.通过海马体在适应性现实世界决策中考虑多尺度处理。
Front Neurosci. 2023 Sep 5;17:1200842. doi: 10.3389/fnins.2023.1200842. eCollection 2023.
8
An Integrated Platform for in vivo Electrophysiology in Spatial Cognition Experiments.用于空间认知实验的体内电生理学综合平台。
bioRxiv. 2023 Aug 21:2023.08.02.551691. doi: 10.1101/2023.08.02.551691.
9
Strategy dependent recruitment of distributed cortical circuits during spatial navigation.空间导航过程中依赖策略的分布式皮质回路募集
Res Sq. 2023 Jun 16:rs.3.rs-2997927. doi: 10.21203/rs.3.rs-2997927/v1.
10
Chronic alcohol consumption shifts learning strategies and synaptic plasticity from hippocampus to striatum-dependent pathways.长期饮酒会使学习策略和突触可塑性从海马体依赖途径转变为纹状体依赖途径。
Front Psychiatry. 2023 May 26;14:1129030. doi: 10.3389/fpsyt.2023.1129030. eCollection 2023.
Nat Rev Neurosci. 2020 Sep;21(9):453-470. doi: 10.1038/s41583-020-0336-9. Epub 2020 Aug 6.
4
Neurobiological successor features for spatial navigation.空间导航的神经生物学后继特征。
Hippocampus. 2020 Dec;30(12):1347-1355. doi: 10.1002/hipo.23246. Epub 2020 Jun 25.
5
Deforming the metric of cognitive maps distorts memory.改变认知地图的度量会扭曲记忆。
Nat Hum Behav. 2020 Feb;4(2):177-188. doi: 10.1038/s41562-019-0767-3. Epub 2019 Nov 18.
6
Neuronal representation of environmental boundaries in egocentric coordinates.自我中心坐标中环境边界的神经元表示。
Nat Commun. 2019 Jun 24;10(1):2772. doi: 10.1038/s41467-019-10722-y.
7
The successor representation in human reinforcement learning.人类强化学习中的后继表示
Nat Hum Behav. 2017 Sep;1(9):680-692. doi: 10.1038/s41562-017-0180-8. Epub 2017 Aug 28.
8
Hippocampal Contributions to Model-Based Planning and Spatial Memory.海马体对基于模型的规划和空间记忆的贡献。
Neuron. 2019 May 8;102(3):683-693.e4. doi: 10.1016/j.neuron.2019.02.014. Epub 2019 Mar 11.
9
Rethinking dopamine as generalized prediction error.重新思考多巴胺作为一般性预测误差。
Proc Biol Sci. 2018 Nov 21;285(1891):20181645. doi: 10.1098/rspb.2018.1645.
10
A neural-level model of spatial memory and imagery.空间记忆和意象的神经水平模型。
Elife. 2018 Sep 4;7:e33752. doi: 10.7554/eLife.33752.