• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鸣禽习得强化学习模型中的权重转移

Weight Transfer in the Reinforcement Learning Model of Songbird Acquisition.

作者信息

Tran Khue, Koulakov Alexei

机构信息

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.

Graduate Program in Neuroscience, Stony Brook University, Stony Brook, NY, USA.

出版信息

bioRxiv. 2024 Dec 30:2024.12.30.628217. doi: 10.1101/2024.12.30.628217.

DOI:10.1101/2024.12.30.628217
PMID:39803499
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11722242/
Abstract

Song acquisition behavior observed in the songbird system provides a notable example of learning through trial- and-error which parallels human speech acquisition. Studying songbird vocal learning can offer insights into mechanisms underlying human language. We present a computational model of song learning that integrates reinforcement learning (RL) and Hebbian learning and agrees with known songbird circuitry. The song circuit outputs activity from nucleus RA, which receives two primary inputs: timing information from area HVC and stochastic activity from nucleus LMAN. Additionally, song learning relies on Area X, a basal ganglia area that receives dopaminergic inputs from VTA. In our model, song is first acquired in the HVC-to-Area X connectivity, employing an RL mechanism that involves node perturbation. This information is then consolidated into HVC-to-RA synapses through a Hebbian mechanism. The transfer of weights from Area X to RA takes place via the thalamus, utilizing a specific form of spike-timing-dependent plasticity (STDP). Thus, we present a computational model grounded in songbird circuitry in which the optimal policy is initially guided by RL and subsequently transferred to another circuit through Hebbian plasticity.

摘要

在鸣禽系统中观察到的歌曲习得行为提供了一个通过试错学习的显著例子,这与人类语言习得相似。研究鸣禽的声乐学习可以深入了解人类语言的潜在机制。我们提出了一种歌曲学习的计算模型,该模型整合了强化学习(RL)和赫布学习,并与已知的鸣禽神经回路一致。歌曲回路输出来自RA核的活动,RA核接收两个主要输入:来自HVC区域的时间信息和来自LMAN核的随机活动。此外,歌曲学习依赖于X区域,这是一个基底神经节区域,接收来自腹侧被盖区(VTA)的多巴胺能输入。在我们的模型中,歌曲首先在HVC到X区域的连接中习得,采用一种涉及节点扰动的强化学习机制。然后,这些信息通过赫布机制整合到HVC到RA的突触中。权重从X区域转移到RA是通过丘脑进行的,利用一种特定形式的尖峰时间依赖可塑性(STDP)。因此,我们提出了一个基于鸣禽神经回路的计算模型,其中最优策略最初由强化学习引导,随后通过赫布可塑性转移到另一个回路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/ff6f2c84cf39/nihpp-2024.12.30.628217v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/ac72349fa158/nihpp-2024.12.30.628217v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/8cbe5ff46fdc/nihpp-2024.12.30.628217v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/3fb06878a16d/nihpp-2024.12.30.628217v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/9a1d5012f512/nihpp-2024.12.30.628217v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/4ea04f2fa1c8/nihpp-2024.12.30.628217v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/ff6f2c84cf39/nihpp-2024.12.30.628217v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/ac72349fa158/nihpp-2024.12.30.628217v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/8cbe5ff46fdc/nihpp-2024.12.30.628217v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/3fb06878a16d/nihpp-2024.12.30.628217v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/9a1d5012f512/nihpp-2024.12.30.628217v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/4ea04f2fa1c8/nihpp-2024.12.30.628217v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5ed/11722242/ff6f2c84cf39/nihpp-2024.12.30.628217v1-f0007.jpg

相似文献

1
Weight Transfer in the Reinforcement Learning Model of Songbird Acquisition.鸣禽习得强化学习模型中的权重转移
bioRxiv. 2024 Dec 30:2024.12.30.628217. doi: 10.1101/2024.12.30.628217.
2
Mechanisms and time course of vocal learning and consolidation in the adult songbird.成年鸣禽的发声学习和巩固的机制和时程。
J Neurophysiol. 2011 Oct;106(4):1806-21. doi: 10.1152/jn.00311.2011. Epub 2011 Jul 6.
3
Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances.基于神经电导动态扰动的梯度估计的鸟鸣学习模型。
J Neurophysiol. 2007 Oct;98(4):2038-57. doi: 10.1152/jn.01311.2006. Epub 2007 Jul 25.
4
The Avian Basal Ganglia Are a Source of Rapid Behavioral Variation That Enables Vocal Motor Exploration.鸟类基底神经节是快速行为变化的来源,使发声运动探索成为可能。
J Neurosci. 2018 Nov 7;38(45):9635-9647. doi: 10.1523/JNEUROSCI.2915-17.2018. Epub 2018 Sep 24.
5
What Is the Role of Thalamostriatal Circuits in Learning Vocal Sequences?丘脑纹状体回路在学习声乐序列中的作用是什么?
Front Neural Circuits. 2021 Sep 22;15:724858. doi: 10.3389/fncir.2021.724858. eCollection 2021.
6
The HVC microcircuit: the synaptic basis for interactions between song motor and vocal plasticity pathways.HVC微回路:歌曲运动与发声可塑性通路之间相互作用的突触基础。
J Neurosci. 2005 Feb 23;25(8):1952-64. doi: 10.1523/JNEUROSCI.3726-04.2005.
7
Sleep-related neural activity in a premotor and a basal-ganglia pathway of the songbird.鸣禽前运动和基底神经节通路中与睡眠相关的神经活动。
J Neurophysiol. 2006 Aug;96(2):794-812. doi: 10.1152/jn.01064.2005. Epub 2006 Feb 22.
8
Electrophysiological analysis of a songbird basal ganglia circuit essential for vocal plasticity.对鸣禽发声可塑性至关重要的基底神经节回路的电生理分析。
Brain Res Bull. 2002;57(3-4):529-32. doi: 10.1016/s0361-9230(01)00690-6.
9
Connections of a motor cortical region in zebra finches: relation to pathways for vocal learning.斑胸草雀中一个运动皮层区域的连接:与发声学习通路的关系。
J Comp Neurol. 2000 May 1;420(2):244-60.
10
Lesions of an avian forebrain nucleus that disrupt song development alter synaptic connectivity and transmission in the vocal premotor pathway.破坏鸣唱发育的鸟类前脑核损伤会改变发声前运动通路中的突触连接和传递。
J Neurosci. 1999 Nov 1;19(21):9385-98. doi: 10.1523/JNEUROSCI.19-21-09385.1999.

本文引用的文献

1
Birdsong neuroscience and the evolutionary substrates of learned vocalization.鸟鸣神经科学与习得性发声的进化基础。
Trends Neurosci. 2023 Feb;46(2):97-99. doi: 10.1016/j.tins.2022.11.005. Epub 2022 Dec 12.
2
Actor-critic reinforcement learning in the songbird.鸣禽中的演员-批评家强化学习。
Curr Opin Neurobiol. 2020 Dec;65:1-9. doi: 10.1016/j.conb.2020.08.005. Epub 2020 Sep 6.
3
Dopamine neurons encode performance error in singing birds.多巴胺神经元编码鸣禽的行为误差。
Science. 2016 Dec 9;354(6317):1278-1282. doi: 10.1126/science.aah6837. Epub 2016 Dec 8.
4
A hypothesis for basal ganglia-dependent reinforcement learning in the songbird.鸣禽基底神经节依赖的强化学习假说。
Neuroscience. 2011 Dec 15;198:152-70. doi: 10.1016/j.neuroscience.2011.09.069. Epub 2011 Oct 13.
5
Mechanisms and time course of vocal learning and consolidation in the adult songbird.成年鸣禽的发声学习和巩固的机制和时程。
J Neurophysiol. 2011 Oct;106(4):1806-21. doi: 10.1152/jn.00311.2011. Epub 2011 Jul 6.
6
The songbird as a model for the generation and learning of complex sequential behaviors.鸣禽作为复杂序列行为产生与学习的模型。
ILAR J. 2010;51(4):362-77. doi: 10.1093/ilar.51.4.362.
7
Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances.基于神经电导动态扰动的梯度估计的鸟鸣学习模型。
J Neurophysiol. 2007 Oct;98(4):2038-57. doi: 10.1152/jn.01311.2006. Epub 2007 Jul 25.
8
Neuromuscular control of vocalizations in birdsong: a model.鸟鸣发声的神经肌肉控制:一个模型
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 May;65(5 Pt 1):051921. doi: 10.1103/PhysRevE.65.051921. Epub 2002 May 20.
9
A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning.斑胸草雀鸣唱系统各部分损伤后行为缺陷的比较研究:对声乐学习的启示
J Neurosci. 1991 Sep;11(9):2896-913. doi: 10.1523/JNEUROSCI.11-09-02896.1991.