腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

机构信息

Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.

Psychology Department, Princeton University, Princeton, NJ, USA.

出版信息

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

DOI:10.1038/s41593-023-01310-x

PMID:37081296

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10646487/

Abstract

Dopamine neuron activity is tied to the prediction error in temporal difference reinforcement learning models. These models make significant simplifying assumptions, particularly with regard to the structure of the predictions fed into the dopamine neurons, which consist of a single chain of timepoint states. Although this predictive structure can explain error signals observed in many studies, it cannot cope with settings where subjects might infer multiple independent events and outcomes. In the present study, we recorded dopamine neurons in the ventral tegmental area in such a setting to test the validity of the single-stream assumption. Rats were trained in an odor-based choice task, in which the timing and identity of one of several rewards delivered in each trial changed across trial blocks. This design revealed an error signaling pattern that requires the dopamine neurons to access and update multiple independent predictive streams reflecting the subject's belief about timing and potentially unique identities of expected rewards.

摘要

多巴胺神经元的活动与时间差分强化学习模型中的预测误差有关。这些模型做出了重大的简化假设，特别是对于输入到多巴胺神经元的预测结构，其仅由单个时间点状态链组成。尽管这种预测结构可以解释许多研究中观察到的错误信号，但它无法应对主体可能推断出多个独立事件和结果的情况。在本研究中，我们在这样的设置中记录腹侧被盖区的多巴胺神经元，以测试单流假设的有效性。大鼠在基于气味的选择任务中接受训练，在该任务中，每次试验中多个奖励之一的时间和身份在试验块之间变化。这种设计揭示了一种错误信号模式，要求多巴胺神经元访问和更新多个独立的预测流，以反映主体关于预期奖励的时间和潜在独特身份的信念。

相似文献

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

Disentangling prediction error and value in a formal test of dopamine's role in reinforcement learning.

Curr Biol. 2025 Aug 18;35(16):4019-4027.e7. doi: 10.1016/j.cub.2025.06.076. Epub 2025 Jul 29.

Prescription of Controlled Substances: Benefits and Risks

Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning.

Curr Biol. 2024 Jul 22;34(14):3086-3101.e4. doi: 10.1016/j.cub.2024.05.069. Epub 2024 Jun 25.

A multidimensional distributional map of future reward in dopamine neurons.

Nature. 2025 Jun;642(8068):691-699. doi: 10.1038/s41586-025-09089-6. Epub 2025 Jun 4.

Flexible updating of reward and punishment contingencies by VTA GABA neurons.

Curr Biol. 2025 Aug 18;35(16):3973-3985.e3. doi: 10.1016/j.cub.2025.07.021. Epub 2025 Jul 31.

Natural behaviour is learned through dopamine-mediated reinforcement.

Nature. 2025 May;641(8063):699-706. doi: 10.1038/s41586-025-08729-1. Epub 2025 Mar 12.

Short-Term Memory Impairment

Multi-timescale reinforcement learning in the brain.

Nature. 2025 Jun 4. doi: 10.1038/s41586-025-08929-9.

Medial septum activation improves strategy switching once strategies are well-learned via bidirectional regulation of dopamine neuron population activity.

Neuropsychopharmacology. 2022 Nov;47(12):2090-2100. doi: 10.1038/s41386-022-01387-1. Epub 2022 Jul 23.

引用本文的文献

Mesolimbic dopamine ramps reflect environmental timescales.

Elife. 2025 Aug 29;13:RP98666. doi: 10.7554/eLife.98666.

Striatal dopamine signals errors in prediction across different informational domains.

Sci Adv. 2025 Jul 11;11(28):eadq9684. doi: 10.1126/sciadv.adq9684. Epub 2025 Jul 9.

Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.

PLoS Comput Biol. 2025 Jul 2;21(7):e1013226. doi: 10.1371/journal.pcbi.1013226. eCollection 2025 Jul.

A multidimensional distributional map of future reward in dopamine neurons.

Nature. 2025 Jun;642(8068):691-699. doi: 10.1038/s41586-025-09089-6. Epub 2025 Jun 4.

Prospective contingency explains behavior and dopamine signals during associative learning.

Nat Neurosci. 2025 Mar 18. doi: 10.1038/s41593-025-01915-4.

The devilish details affecting TDRL models in dopamine research.

Trends Cogn Sci. 2025 May;29(5):434-447. doi: 10.1016/j.tics.2025.02.001. Epub 2025 Feb 26.

The curious case of dopaminergic prediction errors and learning associative information beyond value.

Nat Rev Neurosci. 2025 Mar;26(3):169-178. doi: 10.1038/s41583-024-00898-8. Epub 2025 Jan 8.

Dopaminergic responses to identity prediction errors depend differently on the orbitofrontal cortex and hippocampus.

bioRxiv. 2024 Dec 17:2024.12.11.628003. doi: 10.1101/2024.12.11.628003.

Generalized cue reactivity in rat dopamine neurons after opioids.

Nat Commun. 2025 Jan 2;16(1):321. doi: 10.1038/s41467-024-55504-3.

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types.

PLoS Comput Biol. 2024 Nov 19;20(11):e1012580. doi: 10.1371/journal.pcbi.1012580. eCollection 2024 Nov.

本文引用的文献

A Unified Framework for Dopamine Signals across Timescales.

Cell. 2020 Dec 10;183(6):1600-1616.e25. doi: 10.1016/j.cell.2020.11.013. Epub 2020 Nov 27.

Dopamine signals as temporal difference errors: recent advances.

Curr Opin Neurobiol. 2021 Apr;67:95-105. doi: 10.1016/j.conb.2020.08.014. Epub 2020 Nov 10.

Dopamine neuron ensembles signal the content of sensory prediction errors.

Elife. 2019 Nov 1;8:e49315. doi: 10.7554/eLife.49315.

Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions.

Curr Biol. 2019 Jan 7;29(1):93-103.e3. doi: 10.1016/j.cub.2018.11.050. Epub 2018 Dec 20.

Identity prediction errors in the human midbrain update reward-identity expectations in the orbitofrontal cortex.

Nat Commun. 2018 Apr 23;9(1):1611. doi: 10.1038/s41467-018-04055-5.

The Medial Prefrontal Cortex Shapes Dopamine Reward Prediction Errors under State Uncertainty.

Neuron. 2018 May 2;98(3):616-629.e6. doi: 10.1016/j.neuron.2018.03.036. Epub 2018 Apr 12.

Optogenetic Blockade of Dopamine Transients Prevents Learning Induced by Changes in Reward Features.

Curr Biol. 2017 Nov 20;27(22):3480-3486.e3. doi: 10.1016/j.cub.2017.09.049. Epub 2017 Nov 2.

Model-based predictions for dopamine.

Curr Opin Neurobiol. 2018 Apr;49:1-7. doi: 10.1016/j.conb.2017.10.006. Epub 2017 Oct 31.

Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards.

Neuron. 2017 Sep 13;95(6):1395-1405.e3. doi: 10.1016/j.neuron.2017.08.025.

Neural Circuitry of Reward Prediction Error.

Annu Rev Neurosci. 2017 Jul 25;40:373-394. doi: 10.1146/annurev-neuro-072116-031109. Epub 2017 Apr 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

机构信息

Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD, USA.

Psychology Department, Princeton University, Princeton, NJ, USA.

出版信息

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

DOI:10.1038/s41593-023-01310-x

PMID:37081296

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10646487/

Abstract

摘要

腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.

机构信息

出版信息