• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多巴胺能动作预测误差作为一种无价值的教学信号。

Dopaminergic action prediction errors serve as a value-free teaching signal.

作者信息

Greenstreet Francesca, Vergara Hernando Martinez, Johansson Yvonne, Pati Sthitapranjya, Schwarz Laura, Lenzi Stephen C, Geerts Jesse P, Wisdom Matthew, Gubanova Alina, Rollik Lars B, Kaur Jasvin, Moskovitz Theodore, Cohen Joseph, Thompson Emmett, Margrie Troy W, Clopath Claudia, Stephenson-Jones Marcus

机构信息

Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK.

Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain.

出版信息

Nature. 2025 May 14. doi: 10.1038/s41586-025-09008-9.

DOI:10.1038/s41586-025-09008-9
PMID:40369067
Abstract

Choice behaviour of animals is characterized by two main tendencies: taking actions that led to rewards and repeating past actions. Theory suggests that these strategies may be reinforced by different types of dopaminergic teaching signals: reward prediction error to reinforce value-based associations and movement-based action prediction errors to reinforce value-free repetitive associations. Here we use an auditory discrimination task in mice to show that movement-related dopamine activity in the tail of the striatum encodes the hypothesized action prediction error signal. Causal manipulations reveal that this prediction error serves as a value-free teaching signal that supports learning by reinforcing repeated associations. Computational modelling and experiments demonstrate that action prediction errors alone cannot support reward-guided learning, but when paired with the reward prediction error circuitry they serve to consolidate stable sound-action associations in a value-free manner. Together we show that there are two types of dopaminergic prediction errors that work in tandem to support learning, each reinforcing different types of association in different striatal areas.

摘要

动物的选择行为具有两种主要倾向

采取能带来奖励的行动以及重复过去的行动。理论表明,这些策略可能会被不同类型的多巴胺能教学信号强化:奖励预测误差用于强化基于价值的关联,而基于运动的动作预测误差用于强化无价值的重复关联。在此,我们利用小鼠的听觉辨别任务来表明,纹状体尾部与运动相关的多巴胺活动编码了假设的动作预测误差信号。因果操纵表明,这种预测误差作为一种无价值的教学信号,通过强化重复关联来支持学习。计算建模和实验表明,仅动作预测误差无法支持奖励引导的学习,但当与奖励预测误差电路配对时,它们以无价值的方式巩固稳定的声音 - 动作关联。我们共同表明,有两种类型的多巴胺能预测误差协同作用以支持学习,每种误差在不同的纹状体区域强化不同类型的关联。

相似文献

1
Dopaminergic action prediction errors serve as a value-free teaching signal.多巴胺能动作预测误差作为一种无价值的教学信号。
Nature. 2025 May 14. doi: 10.1038/s41586-025-09008-9.
2
An auditory cortical-striatal circuit supports sound-triggered timing to predict future events.听觉皮层-纹状体回路支持声音触发的时机,以预测未来事件。
PLoS Biol. 2025 Jun 2;23(6):e3003209. doi: 10.1371/journal.pbio.3003209. eCollection 2025 Jun.
3
Multi-timescale reinforcement learning in the brain.大脑中的多时间尺度强化学习。
Nature. 2025 Jun 4. doi: 10.1038/s41586-025-08929-9.
4
A multidimensional distributional map of future reward in dopamine neurons.多巴胺神经元中未来奖励的多维分布图。
Nature. 2025 Jun;642(8068):691-699. doi: 10.1038/s41586-025-09089-6. Epub 2025 Jun 4.
5
Natural behaviour is learned through dopamine-mediated reinforcement.自然行为是通过多巴胺介导的强化作用习得的。
Nature. 2025 May;641(8063):699-706. doi: 10.1038/s41586-025-08729-1. Epub 2025 Mar 12.
6
Striatal Gradient in Value-Decay Explains Regional Differences in Dopamine Patterns and Reinforcement Learning Computations.纹状体价值衰减梯度解释了多巴胺模式和强化学习计算中的区域差异。
J Neurosci. 2025 Jul 18. doi: 10.1523/JNEUROSCI.0170-25.2025.
7
Sexual Harassment and Prevention Training性骚扰与预防培训
8
Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning.多巴胺神经元在学习过程中驱动纹状体多巴胺信号的时空异质性。
Curr Biol. 2024 Jul 22;34(14):3086-3101.e4. doi: 10.1016/j.cub.2024.05.069. Epub 2024 Jun 25.
9
Striatal dopamine signals errors in prediction across different informational domains.纹状体多巴胺信号在不同信息领域中预测误差。
Sci Adv. 2025 Jul 11;11(28):eadq9684. doi: 10.1126/sciadv.adq9684. Epub 2025 Jul 9.
10
Do autistic individuals show atypical performance in probabilistic learning? A comparison of cue-number, predictive strength, and prediction error.自闭症个体在概率学习中是否表现出异常?线索数量、预测强度和预测误差的比较。
Mol Autism. 2025 Mar 4;16(1):15. doi: 10.1186/s13229-025-00651-7.

引用本文的文献

1
Value-free teaching in action.价值中立教学在实践中。
Nat Rev Neurosci. 2025 Jun 6. doi: 10.1038/s41583-025-00938-x.

本文引用的文献

1
Striatum supports fast learning but not memory recall.纹状体支持快速学习,但不支持记忆回忆。
Nature. 2025 May 7. doi: 10.1038/s41586-025-08969-1.
2
Dopamine in the tail of the striatum facilitates avoidance in threat-reward conflicts.纹状体尾部的多巴胺在威胁-奖赏冲突中促进回避行为。
Nat Neurosci. 2025 Apr;28(4):795-810. doi: 10.1038/s41593-025-01902-9. Epub 2025 Mar 10.
3
Understanding dual process cognition via the minimum description length principle.通过最小描述长度原理理解双过程认知。
PLoS Comput Biol. 2024 Oct 18;20(10):e1012383. doi: 10.1371/journal.pcbi.1012383. eCollection 2024 Oct.
4
Striatal dopamine release tracks the relationship between actions and their consequences.纹状体多巴胺释放追踪动作与其后果之间的关系。
Cell Rep. 2024 Mar 26;43(3):113828. doi: 10.1016/j.celrep.2024.113828. Epub 2024 Feb 21.
5
Unique functional responses differentially map onto genetic subtypes of dopamine neurons.独特的功能反应差异映射到多巴胺神经元的遗传亚型上。
Nat Neurosci. 2023 Oct;26(10):1762-1774. doi: 10.1038/s41593-023-01401-9. Epub 2023 Aug 3.
6
Mesolimbic dopamine adapts the rate of learning from action.中脑边缘多巴胺适应动作学习的速度。
Nature. 2023 Feb;614(7947):294-302. doi: 10.1038/s41586-022-05614-z. Epub 2023 Jan 18.
7
Spontaneous behaviour is structured by reinforcement without explicit reward.自发行为是由强化而不是明确的奖励来结构化的。
Nature. 2023 Feb;614(7946):108-117. doi: 10.1038/s41586-022-05611-2. Epub 2023 Jan 18.
8
Mesolimbic dopamine release conveys causal associations.中脑边缘多巴胺释放传递因果关系。
Science. 2022 Dec 23;378(6626):eabq6740. doi: 10.1126/science.abq6740.
9
Nigrostriatal dopamine pathway regulates auditory discrimination behavior.黑质纹状体多巴胺通路调节听觉辨别行为。
Nat Commun. 2022 Oct 8;13(1):5942. doi: 10.1038/s41467-022-33747-2.
10
Threat history controls flexible escape behavior in mice.威胁史控制着小鼠的灵活逃逸行为。
Curr Biol. 2022 Jul 11;32(13):2972-2979.e3. doi: 10.1016/j.cub.2022.05.022. Epub 2022 Jun 2.