• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自然行为是通过多巴胺介导的强化作用习得的。

Natural behaviour is learned through dopamine-mediated reinforcement.

作者信息

Kasdin Jonathan, Duffy Alison, Nadler Nathan, Raha Arnav, Fairhall Adrienne L, Stachenfeld Kimberly L, Gadagkar Vikram

机构信息

Department of Neuroscience, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA.

Department of Neurobiology and Biophysics and Computational Neuroscience Center, University of Washington, Seattle, WA, USA.

出版信息

Nature. 2025 May;641(8063):699-706. doi: 10.1038/s41586-025-08729-1. Epub 2025 Mar 12.

DOI:10.1038/s41586-025-08729-1
PMID:40074908
Abstract

Many natural motor skills, such as speaking or locomotion, are acquired through a process of trial-and-error learning over the course of development. It has long been hypothesized, motivated by observations in artificial learning experiments, that dopamine has a crucial role in this process. Dopamine in the basal ganglia is thought to guide reward-based trial-and-error learning by encoding reward prediction errors, decreasing after worse-than-predicted reward outcomes and increasing after better-than-predicted ones. Our previous work in adult zebra finches-in which we changed the perceived song quality with distorted auditory feedback-showed that dopamine in Area X, the singing-related basal ganglia, encodes performance prediction error: dopamine is suppressed after worse-than-predicted (distorted syllables) and activated after better-than-predicted (undistorted syllables) performance. However, it remains unknown whether the learning of natural behaviours, such as developmental vocal learning, occurs through dopamine-based reinforcement. Here we tracked song learning trajectories in juvenile zebra finches and used fibre photometry to monitor concurrent dopamine activity in Area X. We found that dopamine was activated after syllable renditions that were closer to the eventual adult version of the song, compared with recent renditions, and suppressed after renditions that were further away. Furthermore, the relationship between dopamine and song fluctuations revealed that dopamine predicted the future evolution of song, suggesting that dopamine drives behaviour. Finally, dopamine activity was explained by the contrast between the quality of the current rendition and the recent history of renditions-consistent with dopamine's hypothesized role in encoding prediction errors in an actor-critic reinforcement-learning model. Reinforcement-learning algorithms have emerged as a powerful class of model to explain learning in reward-based laboratory tasks, as well as for driving autonomous learning in artificial intelligence. Our results suggest that complex natural behaviours in biological systems can also be acquired through dopamine-mediated reinforcement learning.

摘要

许多自然运动技能,如说话或移动,是在发育过程中通过试错学习过程获得的。长期以来,受人工学习实验观察结果的启发,人们一直假设多巴胺在这一过程中起着关键作用。基底神经节中的多巴胺被认为通过编码奖励预测误差来指导基于奖励的试错学习,在奖励结果比预期差时减少,在奖励结果比预期好时增加。我们之前在成年斑胸草雀身上的研究——我们通过扭曲的听觉反馈改变了感知到的歌声质量——表明,与唱歌相关的基底神经节X区域中的多巴胺编码表现预测误差:在表现比预期差(音节扭曲)后多巴胺被抑制,在表现比预期好(音节未扭曲)后被激活。然而,自然行为的学习,如发育性发声学习,是否通过基于多巴胺的强化来发生仍然未知。在这里,我们追踪了幼年斑胸草雀的歌声学习轨迹,并使用光纤光度法监测X区域中同时发生的多巴胺活动。我们发现,与最近的演唱相比,当音节演唱更接近歌曲最终的成年版本时,多巴胺会被激活,而在距离更远的演唱后会被抑制。此外,多巴胺与歌声波动之间的关系表明,多巴胺预测了歌声的未来演变,这表明多巴胺驱动行为。最后,多巴胺活动可以通过当前演唱质量与近期演唱历史之间的对比来解释——这与多巴胺在演员-评论家强化学习模型中编码预测误差的假设作用一致。强化学习算法已成为一类强大的模型,用于解释基于奖励的实验室任务中的学习,以及驱动人工智能中的自主学习。我们的结果表明,生物系统中的复杂自然行为也可以通过多巴胺介导的强化学习来获得。

相似文献

1
Natural behaviour is learned through dopamine-mediated reinforcement.自然行为是通过多巴胺介导的强化作用习得的。
Nature. 2025 May;641(8063):699-706. doi: 10.1038/s41586-025-08729-1. Epub 2025 Mar 12.
2
Dual neuromodulatory dynamics underlie birdsong learning.双重神经调节动力学是鸟鸣学习的基础。
Nature. 2025 May;641(8063):690-698. doi: 10.1038/s41586-025-08694-9. Epub 2025 Mar 12.
3
Short-Term Memory Impairment短期记忆障碍
4
Vocal constraints on song amplitude in star finches .星雀歌声振幅的发声限制
PeerJ. 2025 Jul 10;13:e19705. doi: 10.7717/peerj.19705. eCollection 2025.
5
Maternal behavior influences vocal practice and learning processes in the greater sac-winged bat.母性行为会影响大耳囊翼蝠的发声练习和学习过程。
Elife. 2025 May 13;13:RP99474. doi: 10.7554/eLife.99474.
6
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
A Spectrum of Understanding: A Qualitative Exploration of Autistic Adults' Understandings and Perceptions of Friendship(s).理解的光谱:对自闭症成年人对友谊的理解与认知的质性探索
Autism Adulthood. 2024 Dec 2;6(4):438-450. doi: 10.1089/aut.2023.0051. eCollection 2024 Dec.
9
Sexual Harassment and Prevention Training性骚扰与预防培训
10
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.

引用本文的文献

1
Correctness is its own reward: bootstrapping error signals in self-guided reinforcement learning.正确性本身就是一种回报:在自我引导的强化学习中引导误差信号。
bioRxiv. 2025 Aug 19:2025.07.18.665446. doi: 10.1101/2025.07.18.665446.

本文引用的文献

1
Transient sensorimotor projections in the developmental song learning period.发育期歌唱学习期间短暂的感觉运动投射。
Cell Rep. 2024 May 28;43(5):114196. doi: 10.1016/j.celrep.2024.114196. Epub 2024 May 7.
2
Learning the sound inventory of a complex vocal skill via an intrinsic reward.通过内在奖励学习复杂声音技能的音库。
Sci Adv. 2024 Mar 29;10(13):eadj3824. doi: 10.1126/sciadv.adj3824. Epub 2024 Mar 27.
3
Daily vocal exercise is necessary for peak performance singing in a songbird.日常发声练习对鸣禽的歌唱巅峰表现是必要的。
Nat Commun. 2023 Dec 12;14(1):7787. doi: 10.1038/s41467-023-43592-6.
4
Improved green and red GRAB sensors for monitoring dopaminergic activity in vivo.用于监测体内多巴胺能活动的改良绿色和红色 GRAB 传感器。
Nat Methods. 2024 Apr;21(4):680-691. doi: 10.1038/s41592-023-02100-w. Epub 2023 Nov 30.
5
Dopaminergic error signals retune to social feedback during courtship.多巴胺能错误信号在求爱期间重新调整到社会反馈。
Nature. 2023 Nov;623(7986):375-380. doi: 10.1038/s41586-023-06580-w. Epub 2023 Sep 27.
6
Spontaneous behaviour is structured by reinforcement without explicit reward.自发行为是由强化而不是明确的奖励来结构化的。
Nature. 2023 Feb;614(7946):108-117. doi: 10.1038/s41586-022-05611-2. Epub 2023 Jan 18.
7
Birdsong neuroscience and the evolutionary substrates of learned vocalization.鸟鸣神经科学与习得性发声的进化基础。
Trends Neurosci. 2023 Feb;46(2):97-99. doi: 10.1016/j.tins.2022.11.005. Epub 2022 Dec 12.
8
Discovering faster matrix multiplication algorithms with reinforcement learning.用强化学习发现更快的矩阵乘法算法。
Nature. 2022 Oct;610(7930):47-53. doi: 10.1038/s41586-022-05172-4. Epub 2022 Oct 5.
9
Dopamine neurons evaluate natural fluctuations in performance quality.多巴胺神经元评估表现质量的自然波动。
Cell Rep. 2022 Mar 29;38(13):110574. doi: 10.1016/j.celrep.2022.110574.
10
Fast and accurate annotation of acoustic signals with deep neural networks.使用深度神经网络快速准确地标注声信号。
Elife. 2021 Nov 1;10:e68837. doi: 10.7554/eLife.68837.