伏隔核多巴胺释放反映了工具性学习过程中的贝叶斯推理。

Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.

作者信息

Qü Albert J, Tai Lung-Hao, Hall Christopher D, Tu Emilie M, Eckstein Maria K, Mishchanchuk Karyna, Lin Wan Chen, Chase Juliana B, MacAskill Andrew F, Collins Anne G E, Gershman Samuel J, Wilbrecht Linda

机构信息

Department of Psychology, University of California, Berkeley, CA, 94720, USA.

Center for Computational Biology, University of California, Berkeley, CA, 94720, USA.

出版信息

bioRxiv. 2024 Sep 13:2023.11.10.566306. doi: 10.1101/2023.11.10.566306.

DOI:10.1101/2023.11.10.566306

PMID:38014354

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10680647/

Abstract

Dopamine release in the nucleus accumbens has been hypothesized to signal reward prediction error, the difference between observed and predicted reward, suggesting a biological implementation for reinforcement learning. Rigorous tests of this hypothesis require assumptions about how the brain maps sensory signals to reward predictions, yet this mapping is still poorly understood. In particular, the mapping is non-trivial when sensory signals provide ambiguous information about the hidden state of the environment. Previous work using classical conditioning tasks has suggested that reward predictions are generated conditional on probabilistic beliefs about the hidden state, such that dopamine implicitly reflects these beliefs. Here we test this hypothesis in the context of an instrumental task (a two-armed bandit), where the hidden state switches repeatedly. We measured choice behavior and recorded dLight signals reflecting dopamine release in the nucleus accumbens core. Model comparison among a wide set of cognitive models based on the behavioral data favored models that used Bayesian updating of probabilistic beliefs. These same models also quantitatively matched the dopamine measurements better than non-Bayesian alternatives. We conclude that probabilistic belief computation contributes to instrumental task performance in mice and is reflected in mesolimbic dopamine signaling.

摘要

伏隔核中的多巴胺释放被认为是奖励预测误差的信号，即观察到的奖励与预测奖励之间的差异，这表明强化学习存在生物学机制。对这一假设进行严格测试需要假设大脑如何将感觉信号映射到奖励预测，但这种映射仍未得到充分理解。特别是，当感觉信号提供有关环境隐藏状态的模糊信息时，这种映射就变得很复杂。先前使用经典条件任务的研究表明，奖励预测是基于对隐藏状态的概率信念生成的，因此多巴胺隐含地反映了这些信念。在这里，我们在一个工具性任务（双臂赌博机）的背景下测试这一假设，其中隐藏状态会反复切换。我们测量了选择行为，并记录了反映伏隔核核心多巴胺释放的dLight信号。基于行为数据的一系列认知模型之间的模型比较支持使用概率信念贝叶斯更新的模型。这些相同的模型在定量上也比非贝叶斯模型更能匹配多巴胺测量结果。我们得出结论，概率信念计算有助于小鼠的工具性任务表现，并反映在中脑边缘多巴胺信号中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe2/11423117/0c6bf9651ee2/nihpp-2023.11.10.566306v2-f0001.jpg

相似文献

Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.伏隔核多巴胺释放反映了工具性学习过程中的贝叶斯推理。

bioRxiv. 2024 Sep 13:2023.11.10.566306. doi: 10.1101/2023.11.10.566306.

Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.伏隔核多巴胺释放反映了工具性学习过程中的贝叶斯推理。

PLoS Comput Biol. 2025 Jul 2;21(7):e1013226. doi: 10.1371/journal.pcbi.1013226. eCollection 2025 Jul.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”：对自闭症成年人任务沉浸现象学体验的质性研究

Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历：系统检索与综述

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

Effects of experiencing the COVID-19 pandemic on optimistically biased belief updating.经历新冠疫情对乐观偏差信念更新的影响。

Elife. 2025 Jun 30;13:RP101157. doi: 10.7554/eLife.101157.

A Spectrum of Understanding: A Qualitative Exploration of Autistic Adults' Understandings and Perceptions of Friendship(s).理解的光谱：对自闭症成年人对友谊的理解与认知的质性探索

Autism Adulthood. 2024 Dec 2;6(4):438-450. doi: 10.1089/aut.2023.0051. eCollection 2024 Dec.

Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤：系统评价与经济学评估

Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.

本文引用的文献

Hidden state inference requires abstract contextual representations in the ventral hippocampus.隐状态推断需要腹侧海马体中的抽象上下文表示。

Science. 2024 Nov 22;386(6724):926-932. doi: 10.1126/science.adq5874. Epub 2024 Nov 21.

Dopamine-independent effect of rewards on choices through hidden-state inference.奖励对选择的多巴胺非依赖效应：通过隐状态推断。

Nat Neurosci. 2024 Feb;27(2):286-297. doi: 10.1038/s41593-023-01542-x. Epub 2024 Jan 12.

Emergence of belief-like representations through reinforcement learning.通过强化学习产生类信仰的表示。

PLoS Comput Biol. 2023 Sep 11;19(9):e1011067. doi: 10.1371/journal.pcbi.1011067. eCollection 2023 Sep.

Lowered inter-stimulus discriminability hurts incremental contributions to learning.刺激间辨别力降低会损害学习的增量贡献。

Cogn Affect Behav Neurosci. 2023 Oct;23(5):1346-1364. doi: 10.3758/s13415-023-01104-5. Epub 2023 Sep 1.

Unique functional responses differentially map onto genetic subtypes of dopamine neurons.独特的功能反应差异映射到多巴胺神经元的遗传亚型上。

Nat Neurosci. 2023 Oct;26(10):1762-1774. doi: 10.1038/s41593-023-01401-9. Epub 2023 Aug 3.

Dopaminergic prediction errors in the ventral tegmental area reflect a multithreaded predictive model.腹侧被盖区的多巴胺预测误差反映了一个多线程的预测模型。

Nat Neurosci. 2023 May;26(5):830-839. doi: 10.1038/s41593-023-01310-x. Epub 2023 Apr 20.

Midbrain dopamine neurons signal phasic and ramping reward prediction error during goal-directed navigation.中脑多巴胺神经元在目标导向导航过程中信号传递相位和斜率奖励预测误差。

Cell Rep. 2022 Oct 11;41(2):111470. doi: 10.1016/j.celrep.2022.111470.

Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal.强化学习和贝叶斯推断为青少年在随机反转中的独特优势提供了互补的模型。

Dev Cogn Neurosci. 2022 Jun;55:101106. doi: 10.1016/j.dcn.2022.101106. Epub 2022 Apr 22.

Mice exhibit stochastic and efficient action switching during probabilistic decision making.在进行概率决策时，老鼠表现出随机且有效的动作转换。

Proc Natl Acad Sci U S A. 2022 Apr 12;119(15):e2113961119. doi: 10.1073/pnas.2113961119. Epub 2022 Apr 6.

The Role of Executive Function in Shaping Reinforcement Learning.执行功能在塑造强化学习中的作用。

Curr Opin Behav Sci. 2021 Apr;38:66-73. doi: 10.1016/j.cobeha.2020.10.003. Epub 2020 Nov 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

伏隔核多巴胺释放反映了工具性学习过程中的贝叶斯推理。

Nucleus accumbens dopamine release reflects Bayesian inference during instrumental learning.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献