• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺能神经元在类似拍卖的任务中逐次编码主观奖励值。

Dopamine neurons encode trial-by-trial subjective reward value in an auction-like task.

作者信息

Hill Daniel F, Hickman Robert W, Al-Mohammad Alaa, Stasiak Arkadiusz, Schultz Wolfram

机构信息

Department of Physiology, Development and Neuroscience , University of Cambridge, Cambridge CB2 3DY, United Kingdom.

出版信息

bioRxiv. 2024 May 10:2023.01.20.524896. doi: 10.1101/2023.01.20.524896.

DOI:10.1101/2023.01.20.524896
PMID:36711724
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9882283/
Abstract

The dopamine reward prediction error signal is known to be subjective but has so far only been assessed in aggregate choices. However, personal choices fluctuate across trials and thus reflect the instantaneous subjective reward value. In the well-established Becker-DeGroot-Marschak (BDM) auction-like mechanism, participants are encouraged to place bids that accurately reveal their instantaneous subjective reward value; inaccurate bidding results in suboptimal reward ('incentive compatibility'). In our experiment, male rhesus monkeys became experienced over several years to place accurate BDM bids for juice rewards without specific external constraints. Their bids for physically identical rewards varied trial by trial and increased overall for larger rewards. In these highly experienced animals, responses of midbrain dopamine neurons followed the trial-by-trial variations of bids despite constant, explicitly predicted reward amounts. Inversely, dopamine responses were similar with similar bids for different physical reward amounts. Support Vector Regression demonstrated accurate prediction of the animals' bids by as few as twenty dopamine neurons. Thus, the phasic dopamine reward signal reflects instantaneous subjective reward value.

摘要

多巴胺奖励预测误差信号已知是主观的,但迄今为止仅在总体选择中进行了评估。然而,个人选择在不同试验中会有所波动,因此反映了即时主观奖励价值。在成熟的贝克尔 - 德格鲁特 - 马尔沙克(BDM)类拍卖机制中,鼓励参与者出价,以准确揭示其即时主观奖励价值;出价不准确会导致奖励次优(“激励相容性”)。在我们的实验中,雄性恒河猴经过数年的训练,在没有特定外部约束的情况下,能够准确地对果汁奖励进行BDM出价。它们对物理上相同的奖励的出价在每次试验中都有所不同,并且对于更大的奖励总体上会增加。在这些经验丰富的动物中,尽管奖励量恒定且明确可预测,但中脑多巴胺神经元的反应仍跟随出价的逐次试验变化。相反,对于不同物理奖励量的相似出价,多巴胺反应相似。支持向量回归表明,仅用二十个多巴胺神经元就能准确预测动物的出价。因此,阶段性多巴胺奖励信号反映了即时主观奖励价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/a1e893f67217/nihpp-2023.01.20.524896v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/94d1a50f8c2a/nihpp-2023.01.20.524896v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/2e749431cd8e/nihpp-2023.01.20.524896v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/769d6571da7c/nihpp-2023.01.20.524896v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/b9509006c63b/nihpp-2023.01.20.524896v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/a1e893f67217/nihpp-2023.01.20.524896v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/94d1a50f8c2a/nihpp-2023.01.20.524896v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/2e749431cd8e/nihpp-2023.01.20.524896v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/769d6571da7c/nihpp-2023.01.20.524896v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/b9509006c63b/nihpp-2023.01.20.524896v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac6e/11105861/a1e893f67217/nihpp-2023.01.20.524896v2-f0005.jpg

相似文献

1
Dopamine neurons encode trial-by-trial subjective reward value in an auction-like task.多巴胺能神经元在类似拍卖的任务中逐次编码主观奖励值。
bioRxiv. 2024 May 10:2023.01.20.524896. doi: 10.1101/2023.01.20.524896.
2
Dopamine neurons encode trial-by-trial subjective reward value in an auction-like task.在类似拍卖的任务中,多巴胺神经元对每次试验的主观奖励值进行编码。
Nat Commun. 2024 Sep 17;15(1):8138. doi: 10.1038/s41467-024-52311-8.
3
Reward Value Revealed by Auction in Rhesus Monkeys.拍卖揭示恒河猴的奖励价值。
J Neurosci. 2022 Feb 23;42(8):1510-1528. doi: 10.1523/JNEUROSCI.1275-21.2021. Epub 2021 Dec 22.
4
Dopamine prediction error responses integrate subjective value from different reward dimensions.多巴胺预测误差反应整合了来自不同奖励维度的主观价值。
Proc Natl Acad Sci U S A. 2014 Feb 11;111(6):2343-8. doi: 10.1073/pnas.1321596111. Epub 2014 Jan 22.
5
A comparison of reward processing during Becker-DeGroot-Marschak and Vickrey auctions: An ERP study.贝克-德格鲁特-马什阿克拍卖与维克瑞拍卖中奖励处理的比较:一项 ERP 研究。
Psychophysiology. 2023 Sep;60(9):e14313. doi: 10.1111/psyp.14313. Epub 2023 Apr 19.
6
Monetary, Food, and Social Rewards Induce Similar Pavlovian-to-Instrumental Transfer Effects.金钱、食物和社会奖励会引发相似的巴甫洛夫式到工具性的转移效应。
Front Behav Neurosci. 2017 Jan 4;10:247. doi: 10.3389/fnbeh.2016.00247. eCollection 2016.
7
Dopamine signals for reward value and risk: basic and recent data.多巴胺信号与奖励价值和风险:基础与近期数据。
Behav Brain Funct. 2010 Apr 23;6:24. doi: 10.1186/1744-9081-6-24.
8
Midbrain dopamine neurons encode a quantitative reward prediction error signal.中脑多巴胺神经元编码一种定量奖励预测误差信号。
Neuron. 2005 Jul 7;47(1):129-41. doi: 10.1016/j.neuron.2005.05.020.
9
Dopamine neurons learn to encode the long-term value of multiple future rewards.多巴胺神经元学会编码多个未来奖励的长期价值。
Proc Natl Acad Sci U S A. 2011 Sep 13;108(37):15462-7. doi: 10.1073/pnas.1014457108. Epub 2011 Sep 6.
10
Dopamine neurons learn relative chosen value from probabilistic rewards.多巴胺神经元从概率性奖励中学习相对选择价值。
Elife. 2016 Oct 27;5:e18044. doi: 10.7554/eLife.18044.

本文引用的文献

1
Reward Value Revealed by Auction in Rhesus Monkeys.拍卖揭示恒河猴的奖励价值。
J Neurosci. 2022 Feb 23;42(8):1510-1528. doi: 10.1523/JNEUROSCI.1275-21.2021. Epub 2021 Dec 22.
2
Uncovering structured responses of neural populations recorded from macaque monkeys with linear support vector machines.利用线性支持向量机揭示猕猴神经元群体的结构化反应。
STAR Protoc. 2021 Aug 12;2(3):100746. doi: 10.1016/j.xpro.2021.100746. eCollection 2021 Sep 17.
3
Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task.
在一个决策任务中,纹状体三个区域的多巴胺轴突中存在明显的时间差异错误信号。
Elife. 2020 Dec 21;9:e62390. doi: 10.7554/eLife.62390.
4
Machine Learning for Neural Decoding.机器学习在神经解码中的应用。
eNeuro. 2020 Aug 31;7(4). doi: 10.1523/ENEURO.0506-19.2020. Print 2020 Jul/Aug.
5
Orbitofrontal signals for two-component choice options comply with indifference curves of Revealed Preference Theory.眶额信号用于表示两部分选择选项,符合显示偏好理论的无差异曲线。
Nat Commun. 2019 Oct 25;10(1):4885. doi: 10.1038/s41467-019-12792-4.
6
The Code for Facial Identity in the Primate Brain.灵长类大脑中的面部识别编码
Cell. 2017 Jun 1;169(6):1013-1028.e14. doi: 10.1016/j.cell.2017.05.011.
7
Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision.中脑多巴胺神经元在知觉决策中对选择准确性的置信度进行信号传递。
Curr Biol. 2017 Mar 20;27(6):821-832. doi: 10.1016/j.cub.2017.02.026. Epub 2017 Mar 9.
8
Dopamine reward prediction errors reflect hidden-state inference across time.多巴胺奖励预测误差反映了跨时间的隐藏状态推理。
Nat Neurosci. 2017 Apr;20(4):581-589. doi: 10.1038/nn.4520. Epub 2017 Mar 6.
9
Midbrain dopamine neurons signal aversion in a reward-context-dependent manner.中脑多巴胺神经元以奖赏背景依赖的方式发出厌恶信号。
Elife. 2016 Oct 19;5:e17328. doi: 10.7554/eLife.17328.
10
Mesolimbic Dopamine Encodes Prediction Errors in a State-Dependent Manner.中脑边缘多巴胺以状态依赖的方式编码预测误差。
Cell Rep. 2016 Apr 12;15(2):221-8. doi: 10.1016/j.celrep.2016.03.031. Epub 2016 Mar 31.