• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

政策复杂性会抑制多巴胺反应。

Policy complexity suppresses dopamine responses.

作者信息

Gershman Samuel J, Lak Armin

机构信息

Department of Psychology and Center for Brain Science, Harvard University.

Department of Physiology, Anatomy and Genetics, University of Oxford.

出版信息

bioRxiv. 2024 Sep 16:2024.09.15.613150. doi: 10.1101/2024.09.15.613150.

DOI:10.1101/2024.09.15.613150
PMID:39345642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11429712/
Abstract

Limits on information processing capacity impose limits on task performance. We show that animals achieve performance on a perceptual decision task that is near-optimal given their capacity limits, as measured by policy complexity (the mutual information between states and actions). This behavioral profile could be achieved by reinforcement learning with a penalty on high complexity policies, realized through modulation of dopaminergic learning signals. In support of this hypothesis, we find that policy complexity suppresses midbrain dopamine responses to reward outcomes, thereby reducing behavioral sensitivity to these outcomes. Our results suggest that policy compression shapes basic mechanisms of reinforcement learning in the brain.

摘要

信息处理能力的限制会对任务表现施加限制。我们表明,动物在感知决策任务上的表现接近其能力限制下的最优水平,这一能力限制通过策略复杂性(状态与动作之间的互信息)来衡量。这种行为表现可以通过对高复杂性策略施加惩罚的强化学习来实现,这种惩罚通过多巴胺能学习信号的调制来实现。为支持这一假设,我们发现策略复杂性会抑制中脑多巴胺对奖励结果的反应,从而降低行为对这些结果的敏感性。我们的结果表明,策略压缩塑造了大脑中强化学习的基本机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/165fce8a003c/nihpp-2024.09.15.613150v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/6ec483d60fc9/nihpp-2024.09.15.613150v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/c3f42ee834fa/nihpp-2024.09.15.613150v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/1dc2d483cf87/nihpp-2024.09.15.613150v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/165fce8a003c/nihpp-2024.09.15.613150v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/6ec483d60fc9/nihpp-2024.09.15.613150v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/c3f42ee834fa/nihpp-2024.09.15.613150v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/1dc2d483cf87/nihpp-2024.09.15.613150v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf91/11429712/165fce8a003c/nihpp-2024.09.15.613150v1-f0004.jpg

相似文献

1
Policy complexity suppresses dopamine responses.政策复杂性会抑制多巴胺反应。
bioRxiv. 2024 Sep 16:2024.09.15.613150. doi: 10.1101/2024.09.15.613150.
2
Policy Complexity Suppresses Dopamine Responses.政策复杂性抑制多巴胺反应。
J Neurosci. 2025 Feb 26;45(9):e1756242024. doi: 10.1523/JNEUROSCI.1756-24.2024.
3
Undermatching Is a Consequence of Policy Compression.政策压缩导致不匹配。
J Neurosci. 2023 Jan 18;43(3):447-457. doi: 10.1523/JNEUROSCI.1003-22.2022. Epub 2022 Dec 6.
4
Human decision making balances reward maximization and policy compression.人类决策平衡了奖励最大化和策略压缩。
PLoS Comput Biol. 2024 Apr 26;20(4):e1012057. doi: 10.1371/journal.pcbi.1012057. eCollection 2024 Apr.
5
Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision.中脑多巴胺神经元在知觉决策中对选择准确性的置信度进行信号传递。
Curr Biol. 2017 Mar 20;27(6):821-832. doi: 10.1016/j.cub.2017.02.026. Epub 2017 Mar 9.
6
From Prediction to Action: Dissociable Roles of Ventral Tegmental Area and Substantia Nigra Dopamine Neurons in Instrumental Reinforcement.从预测到行动:腹侧被盖区和黑质多巴胺神经元在工具性强化中的分离作用。
J Neurosci. 2023 May 24;43(21):3895-3908. doi: 10.1523/JNEUROSCI.0028-23.2023. Epub 2023 Apr 25.
7
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
8
Dopaminergic Balance between Reward Maximization and Policy Complexity.多巴胺能平衡奖励最大化和策略复杂性。
Front Syst Neurosci. 2011 May 9;5:22. doi: 10.3389/fnsys.2011.00022. eCollection 2011.
9
Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.在基于奖励的决策过程中,人类纹状体中的强化学习信号可区分学习者和非学习者。
J Neurosci. 2007 Nov 21;27(47):12860-7. doi: 10.1523/JNEUROSCI.2496-07.2007.
10
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

本文引用的文献

1
Explaining dopamine through prediction errors and beyond.通过预测误差解释多巴胺及其他。
Nat Neurosci. 2024 Sep;27(9):1645-1655. doi: 10.1038/s41593-024-01705-4. Epub 2024 Jul 25.
2
Reward prediction error neurons implement an efficient code for reward.奖励预测误差神经元为奖励实施了一种有效的编码。
Nat Neurosci. 2024 Jul;27(7):1333-1339. doi: 10.1038/s41593-024-01671-x. Epub 2024 Jun 19.
3
Human decision making balances reward maximization and policy compression.人类决策平衡了奖励最大化和策略压缩。
PLoS Comput Biol. 2024 Apr 26;20(4):e1012057. doi: 10.1371/journal.pcbi.1012057. eCollection 2024 Apr.
4
Undermatching Is a Consequence of Policy Compression.政策压缩导致不匹配。
J Neurosci. 2023 Jan 18;43(3):447-457. doi: 10.1523/JNEUROSCI.1003-22.2022. Epub 2022 Dec 6.
5
Rational inattention and tonic dopamine.理性忽视与基础多巴胺。
PLoS Comput Biol. 2021 Mar 24;17(3):e1008659. doi: 10.1371/journal.pcbi.1008659. eCollection 2021 Mar.
6
Origin of perseveration in the trade-off between reward and complexity.在奖励和复杂性的权衡中坚持的起源。
Cognition. 2020 Nov;204:104394. doi: 10.1016/j.cognition.2020.104394. Epub 2020 Jul 14.
7
Dopaminergic and Prefrontal Basis of Learning from Sensory Confidence and Reward Value.从感觉置信度和奖励价值中学习的多巴胺能和前额叶基础。
Neuron. 2020 Feb 19;105(4):700-711.e6. doi: 10.1016/j.neuron.2019.11.018. Epub 2019 Dec 16.
8
Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。
J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.
9
Dopamine Does Double Duty in Motivating Cognitive Effort.多巴胺在激发认知努力方面具有双重作用。
Neuron. 2016 Feb 17;89(4):695-710. doi: 10.1016/j.neuron.2015.12.029.
10
Dopamine, reward learning, and active inference.多巴胺、奖赏学习与主动推理
Front Comput Neurosci. 2015 Nov 4;9:136. doi: 10.3389/fncom.2015.00136. eCollection 2015.