前瞻性偶然性解释了联想学习过程中的行为和多巴胺信号。

Prospective contingency explains behavior and dopamine signals during associative learning.

作者信息

Qian Lechen, Burrell Mark, Hennig Jay A, Matias Sara, Murthy Venkatesh N, Gershman Samuel J, Uchida Naoshige

机构信息

Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA.

Center for Brain Science, Harvard University, Cambridge, MA, USA.

出版信息

Nat Neurosci. 2025 Mar 18. doi: 10.1038/s41593-025-01915-4.

DOI:10.1038/s41593-025-01915-4

PMID:40102680

Abstract

Associative learning depends on contingency, the degree to which a stimulus predicts an outcome. Despite its importance, the neural mechanisms linking contingency to behavior remain elusive. In the present study, we examined the dopamine activity in the ventral striatum-a signal implicated in associative learning-in a Pavlovian contingency degradation task in mice. We show that both anticipatory licking and dopamine responses to a conditioned stimulus decreased when additional rewards were delivered uncued, but remained unchanged if additional rewards were cued. These results conflict with contingency-based accounts using a traditional definition of contingency or a new causal learning model (ANCCR), but can be explained by temporal difference (TD) learning models equipped with an appropriate intertrial interval state representation. Recurrent neural networks trained within a TD framework develop state representations akin to our best 'handcrafted' model. Our findings suggest that the TD error can be a measure that describes both contingency and dopaminergic activity.

摘要

关联性学习依赖于偶然性，即一个刺激预测一个结果的程度。尽管其很重要，但将偶然性与行为联系起来的神经机制仍不清楚。在本研究中，我们在小鼠的经典条件性偶然性退化任务中，检测了腹侧纹状体中的多巴胺活性——一种与关联性学习有关的信号。我们发现，当额外奖励在无提示的情况下发放时，对条件刺激的预期舔舐和多巴胺反应均降低，但如果额外奖励有提示，则保持不变。这些结果与使用传统偶然性定义的基于偶然性的解释或新的因果学习模型（ANCCR）相冲突，但可以由配备适当试验间隔状态表征的时间差（TD）学习模型来解释。在TD框架内训练的循环神经网络会形成类似于我们最佳“手工制作”模型的状态表征。我们的研究结果表明，TD误差可以作为一种描述偶然性和多巴胺能活性的指标。

相似文献

Prospective contingency explains behavior and dopamine signals during associative learning.

Nat Neurosci. 2025 Mar 18. doi: 10.1038/s41593-025-01915-4.

The role of prospective contingency in the control of behavior and dopamine signals during associative learning.

bioRxiv. 2024 Feb 6:2024.02.05.578961. doi: 10.1101/2024.02.05.578961.

Cue and Reward Evoked Dopamine Activity Is Necessary for Maintaining Learned Pavlovian Associations.

J Neurosci. 2021 Jun 9;41(23):5004-5014. doi: 10.1523/JNEUROSCI.2744-20.2021. Epub 2021 Apr 22.

Mesostriatal dopamine is sensitive to changes in specific cue-reward contingencies.

Sci Adv. 2024 May 31;10(22):eadn4203. doi: 10.1126/sciadv.adn4203. Epub 2024 May 29.

Acute Stress Enhances Associative Learning via Dopamine Signaling in the Ventral Lateral Striatum.

J Neurosci. 2020 May 27;40(22):4391-4400. doi: 10.1523/JNEUROSCI.3003-19.2020. Epub 2020 Apr 22.

Value Modulation of Self-Defeating Impulsivity.

Biol Psychiatry. 2025 Jun 15;97(12):1186-1194. doi: 10.1016/j.biopsych.2024.09.017. Epub 2024 Sep 28.

Contributions of associative and non-associative learning to the dynamics of defensive ethograms.

Elife. 2024 Dec 16;12:RP90414. doi: 10.7554/eLife.90414.

Cue-Evoked Dopamine Promotes Conditioned Responding during Learning.

Neuron. 2020 Apr 8;106(1):142-153.e7. doi: 10.1016/j.neuron.2020.01.012. Epub 2020 Feb 5.

Impaired Pavlovian predictive learning between temporally phasic but not static events in autism-model strain mice.

Neurobiol Learn Mem. 2016 Oct;134 Pt B:304-16. doi: 10.1016/j.nlm.2016.08.001. Epub 2016 Aug 10.

Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning.

Curr Biol. 2024 Jul 22;34(14):3086-3101.e4. doi: 10.1016/j.cub.2024.05.069. Epub 2024 Jun 25.

引用本文的文献

Striatal Gradient in Value-Decay Explains Regional Differences in Dopamine Patterns and Reinforcement Learning Computations.

J Neurosci. 2025 Jul 18. doi: 10.1523/JNEUROSCI.0170-25.2025.

The devilish details affecting TDRL models in dopamine research.

Trends Cogn Sci. 2025 May;29(5):434-447. doi: 10.1016/j.tics.2025.02.001. Epub 2025 Feb 26.

本文引用的文献

Multi-timescale reinforcement learning in the brain.

Nature. 2025 Jun 4. doi: 10.1038/s41586-025-08929-9.

A statistical framework for analysis of trial-level temporal dynamics in fiber photometry experiments.

Elife. 2025 Mar 12;13:RP95802. doi: 10.7554/eLife.95802.

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types.

PLoS Comput Biol. 2024 Nov 19;20(11):e1012580. doi: 10.1371/journal.pcbi.1012580. eCollection 2024 Nov.

Mesostriatal dopamine is sensitive to changes in specific cue-reward contingencies.

Sci Adv. 2024 May 31;10(22):eadn4203. doi: 10.1126/sciadv.adn4203. Epub 2024 May 29.

A hippocampo-cortical pathway detects changes in the validity of an action as a predictor of reward.

Curr Biol. 2024 Jan 8;34(1):24-35.e4. doi: 10.1016/j.cub.2023.11.036. Epub 2023 Dec 14.

Emergence of belief-like representations through reinforcement learning.

PLoS Comput Biol. 2023 Sep 11;19(9):e1011067. doi: 10.1371/journal.pcbi.1011067. eCollection 2023 Sep.

Overlapping representations of food and social stimuli in mouse VTA dopamine neurons.

Neuron. 2023 Nov 15;111(22):3541-3553.e8. doi: 10.1016/j.neuron.2023.08.003. Epub 2023 Aug 31.

Causal implicatures from correlational statements.

PLoS One. 2023 May 18;18(5):e0286067. doi: 10.1371/journal.pone.0286067. eCollection 2023.

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

Learning about reward identities and time.

Behav Processes. 2023 Apr;207:104859. doi: 10.1016/j.beproc.2023.104859. Epub 2023 Mar 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

前瞻性偶然性解释了联想学习过程中的行为和多巴胺信号。

Prospective contingency explains behavior and dopamine signals during associative learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献