• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A multiplicative reinforcement learning model capturing learning dynamics and interindividual variability in mice.一种乘法强化学习模型,可捕捉小鼠中的学习动态和个体间变异性。
Proc Natl Acad Sci U S A. 2013 Dec 3;110(49):19950-5. doi: 10.1073/pnas.1312125110. Epub 2013 Nov 19.
2
Cortical recruitment determines learning dynamics and strategy.皮质募集决定学习动态和策略。
Nat Commun. 2019 Apr 1;10(1):1479. doi: 10.1038/s41467-019-09450-0.
3
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
4
High-order feature-based mixture models of classification learning predict individual learning curves and enable personalized teaching.基于高阶特征的分类学习混合模型可预测个体学习曲线,并实现个性化教学。
Proc Natl Acad Sci U S A. 2013 Jan 8;110(2):684-9. doi: 10.1073/pnas.1211606110. Epub 2012 Dec 26.
5
Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning.在使用强化学习预测小鼠行为中应激、基因型和去甲肾上腺素的作用
Nat Neurosci. 2009 Sep;12(9):1180-6. doi: 10.1038/nn.2374. Epub 2009 Aug 16.
6
Variable behavior and repeated learning in two mouse strains: Developmental and genetic contributions.两种小鼠品系的可变行为与重复学习:发育和遗传因素
Behav Processes. 2018 Dec;157:509-518. doi: 10.1016/j.beproc.2018.06.007. Epub 2018 Jun 18.
7
Different levels of food restriction reveal genotype-specific differences in learning a visual discrimination task.不同程度的食物限制揭示了在学习视觉辨别任务方面的基因型特异性差异。
PLoS One. 2012;7(11):e48703. doi: 10.1371/journal.pone.0048703. Epub 2012 Nov 7.
8
1-Back reinforcement symbolic-matching by humans: How do they learn it?1-人类的反向强化符号匹配:他们是如何学习的?
Learn Behav. 2023 Sep;51(3):274-280. doi: 10.3758/s13420-022-00558-w. Epub 2023 Jan 3.
9
Discrimination difficulty, cognitive burden, and reversal impairments in a maternal immune activation model of schizophrenia risk.精神分裂症风险的母体免疫激活模型中的辨别困难、认知负担和逆转障碍。
Behav Processes. 2019 Oct;167:103936. doi: 10.1016/j.beproc.2019.103936. Epub 2019 Aug 2.
10
Exploration biases forelimb reaching strategies.探索偏见影响前肢伸展策略。
Cell Rep. 2024 Apr 23;43(4):113958. doi: 10.1016/j.celrep.2024.113958. Epub 2024 Mar 22.

引用本文的文献

1
Acquisition of auditory discrimination mediated by different processes through two distinct circuits linked to the lateral striatum.通过与外侧纹状体相连的两条不同回路,由不同过程介导的听觉辨别能力的获得。
Elife. 2025 Jul 11;13:RP97326. doi: 10.7554/eLife.97326.
2
Stimulus uncertainty and relative reward rates determine adaptive responding in perceptual decision-making.刺激不确定性和相对奖励率决定了知觉决策中的适应性反应。
PLoS Comput Biol. 2025 May 27;21(5):e1012636. doi: 10.1371/journal.pcbi.1012636. eCollection 2025 May.
3
A spatial code for temporal information is necessary for efficient sensory learning.高效的感觉学习需要一种用于时间信息的空间编码。
Sci Adv. 2025 Jan 10;11(2):eadr6214. doi: 10.1126/sciadv.adr6214. Epub 2025 Jan 8.
4
Prior experience modifies acquisition trajectories via response-strategy sampling.先前经验通过反应策略抽样来改变习得轨迹。
Anim Cogn. 2023 Jul;26(4):1217-1239. doi: 10.1007/s10071-023-01769-y. Epub 2023 Apr 10.
5
CoBeL-RL: A neuroscience-oriented simulation framework for complex behavior and learning.CoBeL-RL:一个面向神经科学的复杂行为与学习模拟框架。
Front Neuroinform. 2023 Mar 9;17:1134405. doi: 10.3389/fninf.2023.1134405. eCollection 2023.
6
A model of hippocampal replay driven by experience and environmental structure facilitates spatial learning.经验和环境结构驱动的海马体重放模型促进了空间学习。
Elife. 2023 Mar 14;12:e82301. doi: 10.7554/eLife.82301.
7
Inter-Individual Differences in Cognitive Tasks: Focusing on the Shaping of Decision-Making Strategies.认知任务中的个体差异:聚焦于决策策略的形成
Front Behav Neurosci. 2022 Mar 29;16:818746. doi: 10.3389/fnbeh.2022.818746. eCollection 2022.
8
Unique features of stimulus-based probabilistic reversal learning.基于刺激的概率性逆转学习的独特特征。
Behav Neurosci. 2021 Aug;135(4):550-570. doi: 10.1037/bne0000474.
9
Striatal low-threshold spiking interneurons locally gate dopamine.纹状体中阈下尖峰放电中间神经元局部控制多巴胺。
Curr Biol. 2021 Sep 27;31(18):4139-4147.e6. doi: 10.1016/j.cub.2021.06.081. Epub 2021 Jul 23.
10
Stability of motor cortex network states during learning-associated neural reorganizations.运动皮层网络状态在与学习相关的神经重组过程中的稳定性。
J Neurophysiol. 2020 Nov 1;124(5):1327-1342. doi: 10.1152/jn.00061.2020. Epub 2020 Sep 16.

本文引用的文献

1
Increased axonal bouton dynamics in the aging mouse cortex.衰老小鼠皮层中轴突末梢动态增加。
Proc Natl Acad Sci U S A. 2013 Apr 16;110(16):E1514-23. doi: 10.1073/pnas.1218731110. Epub 2013 Mar 29.
2
Discrete neocortical dynamics predict behavioral categorization of sounds.离散的新皮层动力学可预测声音的行为分类。
Neuron. 2012 Oct 18;76(2):435-49. doi: 10.1016/j.neuron.2012.07.008. Epub 2012 Oct 17.
3
Exploring a latent cause theory of classical conditioning.探索经典条件作用的潜在原因理论。
Learn Behav. 2012 Sep;40(3):255-68. doi: 10.3758/s13420-012-0080-8.
4
Early life manipulations alter learning and memory in rats.早期生活的人为操作会改变大鼠的学习和记忆能力。
Neurosci Biobehav Rev. 2012 Oct;36(9):1985-2006. doi: 10.1016/j.neubiorev.2012.07.003. Epub 2012 Jul 20.
5
Neuron-type-specific signals for reward and punishment in the ventral tegmental area.腹侧被盖区中与奖赏和惩罚相关的神经元类型特异性信号。
Nature. 2012 Jan 18;482(7383):85-8. doi: 10.1038/nature10754.
6
Attention, the presolution period, and choice accuracy in pigeons.鸽子的注意力、预解决期和选择准确性。
Behav Processes. 2012 Mar;89(3):225-31. doi: 10.1016/j.beproc.2011.11.003. Epub 2011 Nov 29.
7
Juvenile mice show greater flexibility in multiple choice reversal learning than adults.幼年老鼠在多项选择反转学习中比成年老鼠表现出更大的灵活性。
Dev Cogn Neurosci. 2011 Oct;1(4):540-51. doi: 10.1016/j.dcn.2011.05.008.
8
Multiplicative dynamics underlie the emergence of the log-normal distribution of spine sizes in the neocortex in vivo.倍增动力学是体内新皮层中树突棘大小的对数正态分布出现的基础。
J Neurosci. 2011 Jun 29;31(26):9481-8. doi: 10.1523/JNEUROSCI.6130-10.2011.
9
Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning.在使用强化学习预测小鼠行为中应激、基因型和去甲肾上腺素的作用
Nat Neurosci. 2009 Sep;12(9):1180-6. doi: 10.1038/nn.2374. Epub 2009 Aug 16.
10
Correlated connectivity and the distribution of firing rates in the neocortex.新皮层中相关连接性与放电率分布
J Neurosci. 2009 Mar 25;29(12):3685-94. doi: 10.1523/JNEUROSCI.4500-08.2009.

一种乘法强化学习模型,可捕捉小鼠中的学习动态和个体间变异性。

A multiplicative reinforcement learning model capturing learning dynamics and interindividual variability in mice.

机构信息

Research Institute of Molecular Pathology, 1030 Vienna, Austria.

出版信息

Proc Natl Acad Sci U S A. 2013 Dec 3;110(49):19950-5. doi: 10.1073/pnas.1312125110. Epub 2013 Nov 19.

DOI:10.1073/pnas.1312125110
PMID:24255115
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3856837/
Abstract

Both in humans and in animals, different individuals may learn the same task with strikingly different speeds; however, the sources of this variability remain elusive. In standard learning models, interindividual variability is often explained by variations of the learning rate, a parameter indicating how much synapses are updated on each learning event. Here, we theoretically show that the initial connectivity between the neurons involved in learning a task is also a strong determinant of how quickly the task is learned, provided that connections are updated in a multiplicative manner. To experimentally test this idea, we trained mice to perform an auditory Go/NoGo discrimination task followed by a reversal to compare learning speed when starting from naive or already trained synaptic connections. All mice learned the initial task, but often displayed sigmoid-like learning curves, with a variable delay period followed by a steep increase in performance, as often observed in operant conditioning. For all mice, learning was much faster in the subsequent reversal training. An accurate fit of all learning curves could be obtained with a reinforcement learning model endowed with a multiplicative learning rule, but not with an additive rule. Surprisingly, the multiplicative model could explain a large fraction of the interindividual variability by variations in the initial synaptic weights. Altogether, these results demonstrate the power of multiplicative learning rules to account for the full dynamics of biological learning and suggest an important role of initial wiring in the brain for predispositions to different tasks.

摘要

在人类和动物中,不同个体可能以惊人不同的速度学习相同的任务;然而,这种可变性的来源仍然难以捉摸。在标准学习模型中,个体间的可变性通常可以通过学习率的变化来解释,学习率是一个表示每个学习事件中突触更新多少的参数。在这里,我们从理论上表明,参与学习任务的神经元之间的初始连接也是决定任务学习速度的一个重要因素,前提是连接以乘法方式更新。为了实验验证这一想法,我们训练老鼠执行听觉 Go/NoGo 辨别任务,然后进行反转,比较从幼稚或已经训练过的突触连接开始时的学习速度。所有的老鼠都学会了初始任务,但通常表现出类 S 型的学习曲线,有一个可变的延迟期,随后是性能的急剧增加,这在操作性条件反射中经常观察到。对于所有老鼠来说,在随后的反转训练中,学习速度要快得多。一个带有乘法学习规则的强化学习模型可以准确地拟合所有的学习曲线,但不能用加法规则。令人惊讶的是,乘法模型可以通过初始突触权重的变化来解释很大一部分个体间的可变性。总之,这些结果表明乘法学习规则在解释生物学习的全部动态方面具有强大的能力,并表明初始连接在大脑中对不同任务的倾向具有重要作用。