• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自发行为是由强化而不是明确的奖励来结构化的。

Spontaneous behaviour is structured by reinforcement without explicit reward.

机构信息

Department of Neurobiology, Harvard Medical School, Boston, MA, USA.

Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA.

出版信息

Nature. 2023 Feb;614(7946):108-117. doi: 10.1038/s41586-022-05611-2. Epub 2023 Jan 18.

DOI:10.1038/s41586-022-05611-2
PMID:36653449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9892006/
Abstract

Spontaneous animal behaviour is built from action modules that are concatenated by the brain into sequences. However, the neural mechanisms that guide the composition of naturalistic, self-motivated behaviour remain unknown. Here we show that dopamine systematically fluctuates in the dorsolateral striatum (DLS) as mice spontaneously express sub-second behavioural modules, despite the absence of task structure, sensory cues or exogenous reward. Photometric recordings and calibrated closed-loop optogenetic manipulations during open field behaviour demonstrate that DLS dopamine fluctuations increase sequence variation over seconds, reinforce the use of associated behavioural modules over minutes, and modulate the vigour with which modules are expressed, without directly influencing movement initiation or moment-to-moment kinematics. Although the reinforcing effects of optogenetic DLS dopamine manipulations vary across behavioural modules and individual mice, these differences are well predicted by observed variation in the relationships between endogenous dopamine and module use. Consistent with the possibility that DLS dopamine fluctuations act as a teaching signal, mice build sequences during exploration as if to maximize dopamine. Together, these findings suggest a model in which the same circuits and computations that govern action choices in structured tasks have a key role in sculpting the content of unconstrained, high-dimensional, spontaneous behaviour.

摘要

自发性动物行为是由大脑将动作模块串联而成的。然而,引导自然动机行为组成的神经机制仍然未知。在这里,我们表明,尽管没有任务结构、感官提示或外源性奖励,多巴胺在背外侧纹状体(DLS)中会系统地波动,因为老鼠会自发地表达亚秒级的行为模块。在开放场行为期间进行的光度记录和校准的闭环光遗传学操作表明,DLS 多巴胺波动会在数秒内增加序列变化,在数分钟内增强与行为模块相关的使用,调节模块表达的活力,而不会直接影响运动启动或瞬间运动学。尽管光遗传学 DLS 多巴胺操作的强化作用因行为模块和个体老鼠而异,但这些差异与内源性多巴胺与模块使用之间的关系的观察到的变化很好地预测。与 DLS 多巴胺波动作为教学信号的可能性一致,老鼠在探索期间构建序列,好像是为了最大化多巴胺。总之,这些发现表明了一个模型,即控制结构化任务中动作选择的相同电路和计算在塑造无约束、高维、自发行为的内容方面具有关键作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/99c920b0fd02/41586_2022_5611_Fig15_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/a7aa0f5a0c33/41586_2022_5611_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/88781288c9d2/41586_2022_5611_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f393fe6a3dc3/41586_2022_5611_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/954d1cc3c767/41586_2022_5611_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/5b2800753823/41586_2022_5611_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/93fc5035dd0d/41586_2022_5611_Fig6_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/aa5c77d55100/41586_2022_5611_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f6f83786c153/41586_2022_5611_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/de3b2ca06171/41586_2022_5611_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/7e9fef5b4fb6/41586_2022_5611_Fig10_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/2cbb7192a0f8/41586_2022_5611_Fig11_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f1c647064cde/41586_2022_5611_Fig12_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/2fdbaa6ec480/41586_2022_5611_Fig13_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/33bee2069d4f/41586_2022_5611_Fig14_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/99c920b0fd02/41586_2022_5611_Fig15_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/a7aa0f5a0c33/41586_2022_5611_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/88781288c9d2/41586_2022_5611_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f393fe6a3dc3/41586_2022_5611_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/954d1cc3c767/41586_2022_5611_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/5b2800753823/41586_2022_5611_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/93fc5035dd0d/41586_2022_5611_Fig6_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/aa5c77d55100/41586_2022_5611_Fig7_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f6f83786c153/41586_2022_5611_Fig8_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/de3b2ca06171/41586_2022_5611_Fig9_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/7e9fef5b4fb6/41586_2022_5611_Fig10_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/2cbb7192a0f8/41586_2022_5611_Fig11_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/f1c647064cde/41586_2022_5611_Fig12_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/2fdbaa6ec480/41586_2022_5611_Fig13_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/33bee2069d4f/41586_2022_5611_Fig14_ESM.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9efd/9892006/99c920b0fd02/41586_2022_5611_Fig15_ESM.jpg

相似文献

1
Spontaneous behaviour is structured by reinforcement without explicit reward.自发行为是由强化而不是明确的奖励来结构化的。
Nature. 2023 Feb;614(7946):108-117. doi: 10.1038/s41586-022-05611-2. Epub 2023 Jan 18.
2
Dynamic behaviour restructuring mediates dopamine-dependent credit assignment.动态行为重构介导多巴胺依赖的信用分配。
Nature. 2024 Feb;626(7999):583-592. doi: 10.1038/s41586-023-06941-5. Epub 2023 Dec 13.
3
Regionally distinct phasic dopamine release patterns in the striatum during reversal learning.在逆向学习过程中,纹状体内区域特异性的阶段性多巴胺释放模式。
Neuroscience. 2017 Mar 14;345:110-123. doi: 10.1016/j.neuroscience.2016.05.011. Epub 2016 May 13.
4
The Striatum Organizes 3D Behavior via Moment-to-Moment Action Selection.纹状体通过即时动作选择来组织 3D 行为。
Cell. 2018 Jun 28;174(1):44-58.e17. doi: 10.1016/j.cell.2018.04.019. Epub 2018 May 17.
5
Mesolimbic dopamine adapts the rate of learning from action.中脑边缘多巴胺适应动作学习的速度。
Nature. 2023 Feb;614(7947):294-302. doi: 10.1038/s41586-022-05614-z. Epub 2023 Jan 18.
6
Dopamine signaling in the dorsomedial striatum promotes compulsive behavior.背侧纹状体中的多巴胺信号促进强迫行为。
Curr Biol. 2022 Mar 14;32(5):1175-1188.e5. doi: 10.1016/j.cub.2022.01.055. Epub 2022 Feb 7.
7
Dissociable contributions of phasic dopamine activity to reward and prediction.相位多巴胺活动对奖励和预测的可分离贡献。
Cell Rep. 2021 Sep 7;36(10):109684. doi: 10.1016/j.celrep.2021.109684.
8
Sensory Cues Potentiate VTA Dopamine Mediated Reinforcement.感觉线索增强 VTA 多巴胺介导的强化。
eNeuro. 2024 Feb 15;11(2). doi: 10.1523/ENEURO.0421-23.2024. Print 2024 Feb.
9
Nigrostriatal dopamine signals sequence-specific action-outcome prediction errors.黑质纹状体多巴胺信号序列特异性动作-结果预测误差。
Curr Biol. 2021 Dec 6;31(23):5350-5363.e5. doi: 10.1016/j.cub.2021.09.040. Epub 2021 Oct 11.
10
Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.纹状体多巴胺爬坡可能表明皮质基底神经节回路具有灵活的强化学习和遗忘能力。
Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.

引用本文的文献

1
Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning.正确性本身就是一种回报:在自我引导的强化学习中引导误差信号。
bioRxiv. 2025 Aug 19:2025.07.18.665446. doi: 10.1101/2025.07.18.665446.
2
Mesolimbic dopamine ramps reflect environmental timescales.中脑边缘多巴胺信号增强反映环境时间尺度。
Elife. 2025 Aug 29;13:RP98666. doi: 10.7554/eLife.98666.
3
A study of animal action segmentation algorithms across supervised, unsupervised, and semi-supervised learning paradigms.一项针对监督学习、无监督学习和半监督学习范式下动物动作分割算法的研究。
Neuron Behav Data Anal Theory. 2024;2024. doi: 10.51628/001c.127770. Epub 2024 Dec 20.
4
Individual differences in decision-making shape how mesolimbic dopamine regulates choice confidence and change-of-mind.决策过程中的个体差异塑造了中脑边缘多巴胺调节选择信心和改变想法的方式。
Nat Neurosci. 2025 Jul 30. doi: 10.1038/s41593-025-02015-z.
5
Subsecond Analysis of Locomotor Activity in Parkinsonian Mice.帕金森病小鼠运动活动的亚秒级分析
eNeuro. 2025 Aug 5;12(8). doi: 10.1523/ENEURO.0014-25.2025. Print 2025 Aug.
6
Spontaneous movements and their relationship to neural activity fluctuate with latent engagement states.自发运动及其与神经活动的关系会随着潜在参与状态而波动。
Neuron. 2025 Jun 30. doi: 10.1016/j.neuron.2025.06.001.
7
Automated detection of complex zebrafish seizure behavior at scale.大规模自动检测复杂的斑马鱼癫痫行为。
Commun Biol. 2025 Jun 5;8(1):872. doi: 10.1038/s42003-025-08310-6.
8
Alcohol effects on associative and sensorimotor cortico-thalamo-basal ganglia circuits alter decision making and alcohol intake.酒精对联合及感觉运动皮质-丘脑-基底神经节回路的影响会改变决策和酒精摄入量。
Alcohol. 2025 Sep;127:21-46. doi: 10.1016/j.alcohol.2025.05.005. Epub 2025 May 31.
9
Inferring neural population codes for acoustic communication.推断用于声学通信的神经群体编码。
Proc Natl Acad Sci U S A. 2025 May 27;122(21):e2417733122. doi: 10.1073/pnas.2417733122. Epub 2025 May 19.
10
Dopaminergic action prediction errors serve as a value-free teaching signal.多巴胺能动作预测误差作为一种无价值的教学信号。
Nature. 2025 May 14. doi: 10.1038/s41586-025-09008-9.