• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在复杂空间环境中,双重学分分配过程是多巴胺信号的基础。

Dual credit assignment processes underlie dopamine signals in a complex spatial environment.

作者信息

Krausz Timothy A, Comrie Alison E, Frank Loren M, Daw Nathaniel D, Berke Joshua D

机构信息

Neuroscience Graduate Program, University of California, San Francisco.

Kavli Institute for Fundamental Neuroscience, and Weill Institute for Neurosciences, UCSF.

出版信息

bioRxiv. 2023 Mar 19:2023.02.15.528738. doi: 10.1101/2023.02.15.528738.

DOI:10.1101/2023.02.15.528738
PMID:36993482
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10054934/
Abstract

Dopamine in the nucleus accumbens helps motivate behavior based on expectations of future reward ("values"). These values need to be updated by experience: after receiving reward, the choices that led to reward should be assigned greater value. There are multiple theoretical proposals for how this credit assignment could be achieved, but the specific algorithms that generate updated dopamine signals remain uncertain. We monitored accumbens dopamine as freely behaving rats foraged for rewards in a complex, changing environment. We observed brief pulses of dopamine both when rats received reward (scaling with prediction error), and when they encountered novel path opportunities. Furthermore, dopamine ramped up as rats ran towards reward ports, in proportion to the value at each location. By examining the evolution of these dopamine place-value signals, we found evidence for two distinct update processes: progressive propagation along taken paths, as in temporal-difference learning, and inference of value throughout the maze, using internal models. Our results demonstrate that within rich, naturalistic environments dopamine conveys place values that are updated via multiple, complementary learning algorithms.

摘要

伏隔核中的多巴胺有助于根据对未来奖励(“价值”)的期望来激发行为。这些价值需要通过经验进行更新:在获得奖励后,导致奖励的选择应被赋予更高的价值。关于如何实现这种信用分配有多种理论提议,但产生更新的多巴胺信号的具体算法仍不确定。我们在自由活动的大鼠于复杂多变的环境中觅食奖励时监测了伏隔核多巴胺。我们观察到,当大鼠获得奖励时(与预测误差成比例)以及当它们遇到新的路径机会时,多巴胺都会出现短暂脉冲。此外,当大鼠跑向奖励端口时,多巴胺会随着每个位置的价值成比例增加。通过研究这些多巴胺位置价值信号的演变,我们发现了两种不同更新过程的证据:如在时间差分学习中那样沿着所走路径进行渐进传播,以及使用内部模型在整个迷宫中推断价值。我们的结果表明,在丰富的自然环境中,多巴胺传达的位置价值是通过多种互补学习算法进行更新的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/9589cc1f5b3a/nihpp-2023.02.15.528738v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/4f87a6ea25bc/nihpp-2023.02.15.528738v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/8fde4012d4c0/nihpp-2023.02.15.528738v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/061df34e882b/nihpp-2023.02.15.528738v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/f681328f7368/nihpp-2023.02.15.528738v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/ea62b4a02251/nihpp-2023.02.15.528738v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/ad1e1c8a0733/nihpp-2023.02.15.528738v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/9589cc1f5b3a/nihpp-2023.02.15.528738v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/4f87a6ea25bc/nihpp-2023.02.15.528738v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/8fde4012d4c0/nihpp-2023.02.15.528738v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/061df34e882b/nihpp-2023.02.15.528738v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/f681328f7368/nihpp-2023.02.15.528738v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/ea62b4a02251/nihpp-2023.02.15.528738v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/ad1e1c8a0733/nihpp-2023.02.15.528738v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2e9/10054934/9589cc1f5b3a/nihpp-2023.02.15.528738v2-f0007.jpg

相似文献

1
Dual credit assignment processes underlie dopamine signals in a complex spatial environment.在复杂空间环境中,双重学分分配过程是多巴胺信号的基础。
bioRxiv. 2023 Mar 19:2023.02.15.528738. doi: 10.1101/2023.02.15.528738.
2
Dual credit assignment processes underlie dopamine signals in a complex spatial environment.双信用分配过程是复杂空间环境中多巴胺信号的基础。
Neuron. 2023 Nov 1;111(21):3465-3478.e7. doi: 10.1016/j.neuron.2023.07.017. Epub 2023 Aug 22.
3
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
4
Differential Dopamine Release Dynamics in the Nucleus Accumbens Core and Shell Reveal Complementary Signals for Error Prediction and Incentive Motivation.伏隔核核心区与壳区多巴胺释放动力学差异揭示了用于错误预测和激励动机的互补信号。
J Neurosci. 2015 Aug 19;35(33):11572-82. doi: 10.1523/JNEUROSCI.2344-15.2015.
5
Anticipatory reward signals in ventral striatal neurons of behaving rats.行为大鼠腹侧纹状体神经元中的预期奖励信号。
Eur J Neurosci. 2008 Nov;28(9):1849-66. doi: 10.1111/j.1460-9568.2008.06480.x.
6
Dopamine enhances model-free credit assignment through boosting of retrospective model-based inference.多巴胺通过增强回溯模型基推断来增强无模型信用分配。
Elife. 2021 Dec 9;10:e67778. doi: 10.7554/eLife.67778.
7
Dopamine errors drive excitatory and inhibitory components of backward conditioning in an outcome-specific manner.多巴胺错误以特定于结果的方式驱动反向条件作用的兴奋性和抑制性成分。
Curr Biol. 2022 Jul 25;32(14):3210-3218.e3. doi: 10.1016/j.cub.2022.06.035. Epub 2022 Jun 24.
8
Dopamine prediction error responses integrate subjective value from different reward dimensions.多巴胺预测误差反应整合了来自不同奖励维度的主观价值。
Proc Natl Acad Sci U S A. 2014 Feb 11;111(6):2343-8. doi: 10.1073/pnas.1321596111. Epub 2014 Jan 22.
9
The debate over dopamine's role in reward: the case for incentive salience.关于多巴胺在奖赏中作用的争论:动机显著性的实例
Psychopharmacology (Berl). 2007 Apr;191(3):391-431. doi: 10.1007/s00213-006-0578-x. Epub 2006 Oct 27.
10
Dynamic shaping of dopamine signals during probabilistic Pavlovian conditioning.概率性巴甫洛夫条件反射过程中多巴胺信号的动态塑造
Neurobiol Learn Mem. 2015 Jan;117:84-92. doi: 10.1016/j.nlm.2014.07.010. Epub 2014 Aug 27.

引用本文的文献

1
Rodent maze studies: from following simple rules to complex map learning.啮齿类动物迷宫研究:从遵循简单规则到复杂地图学习。
Brain Struct Funct. 2024 May;229(4):823-841. doi: 10.1007/s00429-024-02771-x. Epub 2024 Mar 15.

本文引用的文献

1
Mesolimbic dopamine release conveys causal associations.中脑边缘多巴胺释放传递因果关系。
Science. 2022 Dec 23;378(6626):eabq6740. doi: 10.1126/science.abq6740.
2
Imagination as a fundamental function of the hippocampus.想象作为海马体的基本功能。
Philos Trans R Soc Lond B Biol Sci. 2022 Dec 19;377(1866):20210336. doi: 10.1098/rstb.2021.0336. Epub 2022 Oct 31.
3
A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.多巴胺反应的逐渐时间转移反映了机器学习中时间差分误差的进展。
Nat Neurosci. 2022 Aug;25(8):1082-1092. doi: 10.1038/s41593-022-01109-2. Epub 2022 Jul 7.
4
How do real animals account for the passage of time during associative learning?真实动物在联想学习过程中如何感知时间的流逝?
Behav Neurosci. 2022 Oct;136(5):383-391. doi: 10.1037/bne0000516. Epub 2022 Apr 28.
5
The role of state uncertainty in the dynamics of dopamine.国家不确定性在多巴胺动态中的作用。
Curr Biol. 2022 Mar 14;32(5):1077-1087.e9. doi: 10.1016/j.cub.2022.01.025. Epub 2022 Feb 2.
6
The temporal dynamics of opportunity costs: A normative account of cognitive fatigue and boredom.机会成本的时间动态:认知疲劳和无聊现象的规范性解释。
Psychol Rev. 2022 Apr;129(3):564-585. doi: 10.1037/rev0000309. Epub 2021 Aug 12.
7
Experience replay is associated with efficient nonlocal learning.经验回放与高效的非局部学习有关。
Science. 2021 May 21;372(6544). doi: 10.1126/science.abf1357.
8
A Unified Framework for Dopamine Signals across Timescales.多巴胺信号的跨时间尺度统一框架。
Cell. 2020 Dec 10;183(6):1600-1616.e25. doi: 10.1016/j.cell.2020.11.013. Epub 2020 Nov 27.
9
Neuronal Computation Underlying Inferential Reasoning in Humans and Mice.人类和小鼠推理思维的神经计算。
Cell. 2020 Oct 1;183(1):228-243.e21. doi: 10.1016/j.cell.2020.08.035. Epub 2020 Sep 17.
10
Constant Sub-second Cycling between Representations of Possible Futures in the Hippocampus.海马体中可能未来的表示之间的恒次秒循环。
Cell. 2020 Feb 6;180(3):552-567.e25. doi: 10.1016/j.cell.2020.01.014. Epub 2020 Jan 30.