• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

青少年和成年小鼠在学习并发刺激-动作关联时会同时使用增量强化学习和短期记忆。

Adolescent and adult mice use both incremental reinforcement learning and short term memory when learning concurrent stimulus-action associations.

作者信息

Chase Juliana, Xia Liyu, Tai Lung-Hao, Lin Wan Chen, Collins Anne G E, Wilbrecht Linda

机构信息

Department of Psychology, University of California, Berkeley, Berkeley, California, United States of America.

Department of Neuroscience, University of California, Berkeley, Berkeley, California, United States of America.

出版信息

PLoS Comput Biol. 2024 Dec 23;20(12):e1012667. doi: 10.1371/journal.pcbi.1012667. eCollection 2024 Dec.

DOI:10.1371/journal.pcbi.1012667
PMID:39715285
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11706416/
Abstract

Computational modeling has revealed that human research participants use both rapid working memory (WM) and incremental reinforcement learning (RL) (RL+WM) to solve a simple instrumental learning task, relying on WM when the number of stimuli is small and supplementing with RL when the number of stimuli exceeds WM capacity. Inspired by this work, we examined which learning systems and strategies are used by adolescent and adult mice when they first acquire a conditional associative learning task. In a version of the human RL+WM task translated for rodents, mice were required to associate odor stimuli (from a set of 2 or 4 odors) with a left or right port to receive reward. Using logistic regression and computational models to analyze the first 200 trials per odor, we determined that mice used both incremental RL and stimulus-insensitive, one-back strategies to solve the task. While these one-back strategies may be a simple form of short-term or working memory, they did not approximate the boost to learning performance that has been observed in human participants using WM in a comparable task. Adolescent and adult mice also showed comparable performance, with no change in learning rate or softmax beta parameters with adolescent development and task experience. However, reliance on a one-back perseverative, win-stay strategy increased with development in males in both odor set sizes, but was not dependent on gonadal hormones. Our findings advance a simple conditional associative learning task and new models to enable the isolation and quantification of reinforcement learning alongside other strategies mice use while learning to associate stimuli with rewards within a single behavioral session. These data and methods can inform and aid comparative study of reinforcement learning across species.

摘要

计算模型表明,人类研究参与者使用快速工作记忆(WM)和增量强化学习(RL)(RL+WM)来解决简单的工具性学习任务,当刺激数量较少时依赖WM,当刺激数量超过WM容量时则补充RL。受这项工作的启发,我们研究了青春期和成年小鼠首次习得条件性联想学习任务时使用了哪些学习系统和策略。在为啮齿动物翻译的人类RL+WM任务版本中,小鼠需要将气味刺激(从一组2种或4种气味中选择)与左端口或右端口相关联以获得奖励。使用逻辑回归和计算模型分析每种气味的前200次试验,我们确定小鼠使用增量RL和对刺激不敏感的一步回溯策略来解决任务。虽然这些一步回溯策略可能是短期或工作记忆的一种简单形式,但它们并没有近似于在类似任务中使用WM的人类参与者所观察到的对学习表现的提升。青春期和成年小鼠也表现出相当的性能,随着青春期发育和任务经验的增加,学习率或softmaxβ参数没有变化。然而,在两种气味集大小中,雄性小鼠对一步回溯坚持性赢留策略的依赖随着发育而增加,但不依赖于性腺激素。我们的研究结果推进了一个简单的条件性联想学习任务和新模型,以能够在单个行为会话中学习将刺激与奖励相关联时,分离和量化强化学习以及小鼠使用的其他策略。这些数据和方法可以为跨物种强化学习的比较研究提供信息和帮助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/2a0e49dbcd47/pcbi.1012667.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/619260e5308c/pcbi.1012667.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/97d5c8715954/pcbi.1012667.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/2e90d00334d6/pcbi.1012667.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/5b8f70e92c5a/pcbi.1012667.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/8b4a7d896940/pcbi.1012667.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/5ac1dbf23270/pcbi.1012667.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/476521aafb9c/pcbi.1012667.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/e8cbf22e8764/pcbi.1012667.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/133445f11cd1/pcbi.1012667.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/2a0e49dbcd47/pcbi.1012667.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/619260e5308c/pcbi.1012667.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/97d5c8715954/pcbi.1012667.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/2e90d00334d6/pcbi.1012667.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/5b8f70e92c5a/pcbi.1012667.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/8b4a7d896940/pcbi.1012667.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/5ac1dbf23270/pcbi.1012667.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/476521aafb9c/pcbi.1012667.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/e8cbf22e8764/pcbi.1012667.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/133445f11cd1/pcbi.1012667.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfdc/11706416/2a0e49dbcd47/pcbi.1012667.g010.jpg

相似文献

1
Adolescent and adult mice use both incremental reinforcement learning and short term memory when learning concurrent stimulus-action associations.青少年和成年小鼠在学习并发刺激-动作关联时会同时使用增量强化学习和短期记忆。
PLoS Comput Biol. 2024 Dec 23;20(12):e1012667. doi: 10.1371/journal.pcbi.1012667. eCollection 2024 Dec.
2
How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis.强化学习中有多少是工作记忆,而不是强化学习?一项行为、计算和神经遗传学分析。
Eur J Neurosci. 2012 Apr;35(7):1024-35. doi: 10.1111/j.1460-9568.2011.07980.x.
3
Distentangling the systems contributing to changes in learning during adolescence.厘清导致青少年学习变化的系统。
Dev Cogn Neurosci. 2020 Feb;41:100732. doi: 10.1016/j.dcn.2019.100732. Epub 2019 Nov 14.
4
Neural Index of Reinforcement Learning Predicts Improved Stimulus-Response Retention under High Working Memory Load.神经强化学习指数预测在高工作记忆负荷下改善刺激-反应保持。
J Neurosci. 2023 Apr 26;43(17):3131-3143. doi: 10.1523/JNEUROSCI.1274-22.2023. Epub 2023 Mar 17.
5
Working memory contributions to reinforcement learning impairments in schizophrenia.工作记忆对精神分裂症强化学习障碍的影响
J Neurosci. 2014 Oct 8;34(41):13747-56. doi: 10.1523/JNEUROSCI.0989-14.2014.
6
The neurocognitive role of working memory load when Pavlovian motivational control affects instrumental learning.当巴甫洛夫式动机控制影响工具性学习时,工作记忆负荷的神经认知作用。
PLoS Comput Biol. 2023 Dec 8;19(12):e1011692. doi: 10.1371/journal.pcbi.1011692. eCollection 2023 Dec.
7
Working memory as a representational template for reinforcement learning.工作记忆作为强化学习的代表性模板。
Sci Rep. 2024 Nov 12;14(1):27660. doi: 10.1038/s41598-024-79119-2.
8
Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。
J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.
9
Interactions Among Working Memory, Reinforcement Learning, and Effort in Value-Based Choice: A New Paradigm and Selective Deficits in Schizophrenia.基于价值的选择中工作记忆、强化学习和努力之间的相互作用:一种新范式及精神分裂症中的选择性缺陷。
Biol Psychiatry. 2017 Sep 15;82(6):431-439. doi: 10.1016/j.biopsych.2017.05.017. Epub 2017 May 31.
10
Relevance of working memory for reinforcement learning in older adults varies with timescale of learning.工作记忆与老年人强化学习的相关性随学习时间尺度而异。
Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2020 Sep;27(5):654-676. doi: 10.1080/13825585.2019.1664389. Epub 2019 Sep 22.

引用本文的文献

1
Genetic changes linked to two different syndromic forms of autism enhance reinforcement learning in adolescent male but not female mice.与两种不同综合征形式自闭症相关的基因变化增强了青春期雄性小鼠而非雌性小鼠的强化学习能力。
bioRxiv. 2025 Jan 15:2025.01.15.633099. doi: 10.1101/2025.01.15.633099.

本文引用的文献

1
Goal-directed learning in adolescence: neurocognitive development and contextual influences.青少年的目标导向学习:神经认知发展与环境影响。
Nat Rev Neurosci. 2024 Mar;25(3):176-194. doi: 10.1038/s41583-023-00783-w. Epub 2024 Jan 23.
2
Mixtures of strategies underlie rodent behavior during reversal learning.策略混合是啮齿动物在反转学习过程中行为的基础。
PLoS Comput Biol. 2023 Sep 14;19(9):e1011430. doi: 10.1371/journal.pcbi.1011430. eCollection 2023 Sep.
3
Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal.
强化学习和贝叶斯推断为青少年在随机反转中的独特优势提供了互补的模型。
Dev Cogn Neurosci. 2022 Jun;55:101106. doi: 10.1016/j.dcn.2022.101106. Epub 2022 Apr 22.
4
The estrous cycle and 17β-estradiol modulate the electrophysiological properties of rat nucleus accumbens core medium spiny neurons.动情周期和 17β-雌二醇调节大鼠伏隔核核心中间神经元的电生理特性。
J Neuroendocrinol. 2022 Jun;34(6):e13122. doi: 10.1111/jne.13122. Epub 2022 Apr 2.
5
Opponent control of behavior by dorsomedial striatal pathways depends on task demands and internal state.背内侧纹状体通路对行为的对抗性控制取决于任务需求和内部状态。
Nat Neurosci. 2022 Mar;25(3):345-357. doi: 10.1038/s41593-022-01021-9. Epub 2022 Mar 7.
6
Making sense of strengths and weaknesses observed in adolescent laboratory rodents.理解青少年实验鼠身上观察到的优点和缺点。
Curr Opin Psychol. 2022 Jun;45:101297. doi: 10.1016/j.copsyc.2021.12.009. Epub 2022 Jan 6.
7
Mice alternate between discrete strategies during perceptual decision-making.小鼠在感知决策过程中会在不同策略之间交替。
Nat Neurosci. 2022 Feb;25(2):201-212. doi: 10.1038/s41593-021-01007-z. Epub 2022 Feb 7.
8
How Working Memory and Reinforcement Learning Are Intertwined: A Cognitive, Neural, and Computational Perspective.工作记忆与强化学习如何相互交织:认知、神经及计算视角
J Cogn Neurosci. 2022 Mar 5;34(4):551-568. doi: 10.1162/jocn_a_01808.
9
Divergence of rodent and primate medial frontal cortex functional connectivity.啮齿动物和灵长类动物内侧前额叶皮层功能连接的发散。
Proc Natl Acad Sci U S A. 2020 Sep 1;117(35):21681-21689. doi: 10.1073/pnas.2003181117. Epub 2020 Aug 17.
10
Reinforcement Learning during Adolescence in Rats.大鼠青春期的强化学习。
J Neurosci. 2020 Jul 22;40(30):5857-5870. doi: 10.1523/JNEUROSCI.0910-20.2020. Epub 2020 Jun 29.