• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

策略混合是啮齿动物在反转学习过程中行为的基础。

Mixtures of strategies underlie rodent behavior during reversal learning.

机构信息

Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

出版信息

PLoS Comput Biol. 2023 Sep 14;19(9):e1011430. doi: 10.1371/journal.pcbi.1011430. eCollection 2023 Sep.

DOI:10.1371/journal.pcbi.1011430
PMID:37708113
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10501641/
Abstract

In reversal learning tasks, the behavior of humans and animals is often assumed to be uniform within single experimental sessions to facilitate data analysis and model fitting. However, behavior of agents can display substantial variability in single experimental sessions, as they execute different blocks of trials with different transition dynamics. Here, we observed that in a deterministic reversal learning task, mice display noisy and sub-optimal choice transitions even at the expert stages of learning. We investigated two sources of the sub-optimality in the behavior. First, we found that mice exhibit a high lapse rate during task execution, as they reverted to unrewarded directions after choice transitions. Second, we unexpectedly found that a majority of mice did not execute a uniform strategy, but rather mixed between several behavioral modes with different transition dynamics. We quantified the use of such mixtures with a state-space model, block Hidden Markov Model (block HMM), to dissociate the mixtures of dynamic choice transitions in individual blocks of trials. Additionally, we found that blockHMM transition modes in rodent behavior can be accounted for by two different types of behavioral algorithms, model-free or inference-based learning, that might be used to solve the task. Combining these approaches, we found that mice used a mixture of both exploratory, model-free strategies and deterministic, inference-based behavior in the task, explaining their overall noisy choice sequences. Together, our combined computational approach highlights intrinsic sources of noise in rodent reversal learning behavior and provides a richer description of behavior than conventional techniques, while uncovering the hidden states that underlie the block-by-block transitions.

摘要

在反转学习任务中,为了便于数据分析和模型拟合,通常假设人类和动物的行为在单个实验会话内是一致的。然而,由于代理在执行具有不同转换动态的不同试验块时,可以显示出相当大的行为变异性,因此其行为会显示出很大的可变性。在这里,我们观察到,在确定性反转学习任务中,即使在学习的专家阶段,老鼠的选择转换也会显示出嘈杂和次优的行为。我们研究了行为次优性的两个来源。首先,我们发现老鼠在任务执行过程中表现出很高的失误率,因为它们在选择转换后又回到未受奖励的方向。其次,我们出人意料地发现,大多数老鼠并没有执行统一的策略,而是在几种具有不同转换动态的行为模式之间混合。我们使用状态空间模型(block Hidden Markov Model,block HMM)来量化这种混合,以区分个体试验块中的混合动态选择转换。此外,我们发现啮齿动物行为中的 blockHMM 转换模式可以由两种不同类型的行为算法来解释,即无模型或基于推理的学习,这两种算法可能用于解决任务。结合这些方法,我们发现老鼠在任务中同时使用了探索性的无模型策略和确定性的基于推理的行为的混合,这解释了它们整体嘈杂的选择序列。总的来说,我们的组合计算方法突出了啮齿动物反转学习行为中的内在噪声源,并提供了比传统技术更丰富的行为描述,同时揭示了块到块转换背后的隐藏状态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/43e8cb0aefc0/pcbi.1011430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/82f03cae9530/pcbi.1011430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/bbbe7621bf9e/pcbi.1011430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/42dab0423d6b/pcbi.1011430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/2cb105247f6b/pcbi.1011430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/3b5a693f6a1e/pcbi.1011430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/43e8cb0aefc0/pcbi.1011430.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/82f03cae9530/pcbi.1011430.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/bbbe7621bf9e/pcbi.1011430.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/42dab0423d6b/pcbi.1011430.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/2cb105247f6b/pcbi.1011430.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/3b5a693f6a1e/pcbi.1011430.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e79/10501641/43e8cb0aefc0/pcbi.1011430.g006.jpg

相似文献

1
Mixtures of strategies underlie rodent behavior during reversal learning.策略混合是啮齿动物在反转学习过程中行为的基础。
PLoS Comput Biol. 2023 Sep 14;19(9):e1011430. doi: 10.1371/journal.pcbi.1011430. eCollection 2023 Sep.
2
Impairments in operant probabilistic reversal learning in BTBR T+tf/J male and female mice.BTBR T+tf/J 雄性和雌性小鼠在操作性概率反转学习中的损伤。
Behav Brain Res. 2023 Feb 2;437:114111. doi: 10.1016/j.bbr.2022.114111. Epub 2022 Sep 12.
3
The Role of Frontal Cortical and Medial-Temporal Lobe Brain Areas in Learning a Bayesian Prior Belief on Reversals.额叶皮质和内侧颞叶脑区在学习关于反转的贝叶斯先验信念中的作用。
J Neurosci. 2015 Aug 19;35(33):11751-60. doi: 10.1523/JNEUROSCI.1594-15.2015.
4
Spatiotemporal Pavlovian head-fixed reversal learning task for mice.用于小鼠的时空 Pavlovian 头固定反转学习任务。
Mol Brain. 2022 Sep 7;15(1):78. doi: 10.1186/s13041-022-00952-5.
5
Deliberation and Procedural Automation on a Two-Step Task for Rats.大鼠两步任务中的思考与程序自动化
Front Integr Neurosci. 2018 Aug 3;12:30. doi: 10.3389/fnint.2018.00030. eCollection 2018.
6
Striatal dysfunction during reversal learning in unmedicated schizophrenia patients.未服药的精神分裂症患者在逆向学习过程中的纹状体功能障碍。
Neuroimage. 2014 Apr 1;89(100):171-80. doi: 10.1016/j.neuroimage.2013.11.034. Epub 2013 Nov 27.
7
Dissociable effects of 5-HT2C receptor antagonism and genetic inactivation on perseverance and learned non-reward in an egocentric spatial reversal task.5-HT2C 受体拮抗和基因失活对以自我为中心的空间反转任务中坚持和习得性无奖励的可分离影响。
PLoS One. 2013 Oct 30;8(10):e77762. doi: 10.1371/journal.pone.0077762. eCollection 2013.
8
Mice adaptively generate choice variability in a deterministic task.老鼠在确定性任务中适应性地产生选择变异性。
Commun Biol. 2020 Jan 21;3(1):34. doi: 10.1038/s42003-020-0759-x.
9
Reversal learning and dopamine: a bayesian perspective.逆向学习与多巴胺:贝叶斯视角
J Neurosci. 2015 Feb 11;35(6):2407-16. doi: 10.1523/JNEUROSCI.1989-14.2015.
10
Reinforcement Learning during Adolescence in Rats.大鼠青春期的强化学习。
J Neurosci. 2020 Jul 22;40(30):5857-5870. doi: 10.1523/JNEUROSCI.0910-20.2020. Epub 2020 Jun 29.

引用本文的文献

1
Structured experience shapes strategy learning and neural dynamics in the medial entorhinal cortex.结构化体验塑造内嗅皮层中的策略学习和神经动力学。
Res Sq. 2025 May 28:rs.3.rs-6658028. doi: 10.21203/rs.3.rs-6658028/v1.
2
Structured experience shapes strategy learning and neural dynamics in the medial entorhinal cortex.结构化体验塑造内嗅皮层中的策略学习和神经动力学。
bioRxiv. 2025 May 13:2025.05.13.653873. doi: 10.1101/2025.05.13.653873.
3
Adolescent and adult mice use both incremental reinforcement learning and short term memory when learning concurrent stimulus-action associations.

本文引用的文献

1
Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.在老鼠和猴子中,对奖励环境中不同类型不确定性的调整机制。
Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23.
2
Shared and specialized coding across posterior cortical areas for dynamic navigation decisions.用于动态导航决策的后皮质区域的共享和专门编码。
Neuron. 2022 Aug 3;110(15):2484-2502.e16. doi: 10.1016/j.neuron.2022.05.012. Epub 2022 Jun 8.
3
Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal.
青少年和成年小鼠在学习并发刺激-动作关联时会同时使用增量强化学习和短期记忆。
PLoS Comput Biol. 2024 Dec 23;20(12):e1012667. doi: 10.1371/journal.pcbi.1012667. eCollection 2024 Dec.
4
Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts.主动强化学习与动作偏差和滞后的比较:混合专家与非专家的控制。
PLoS Comput Biol. 2024 Mar 29;20(3):e1011950. doi: 10.1371/journal.pcbi.1011950. eCollection 2024 Mar.
5
Behavioral strategy shapes activation of the Vip-Sst disinhibitory circuit in visual cortex.行为策略塑造视觉皮层中 Vip-Sst 去抑制回路的激活。
Neuron. 2024 Jun 5;112(11):1876-1890.e4. doi: 10.1016/j.neuron.2024.02.008. Epub 2024 Mar 5.
6
Enhancement of mediodorsal thalamus rescues aberrant belief dynamics in a mouse model with schizophrenia-associated mutation.增强背内侧丘脑可挽救具有精神分裂症相关突变的小鼠模型中的异常信念动态。
bioRxiv. 2024 Feb 14:2024.01.08.574745. doi: 10.1101/2024.01.08.574745.
7
IntelliCage: the development and perspectives of a mouse- and user-friendly automated behavioral test system.智能鼠笼:一种对小鼠和用户友好的自动化行为测试系统的开发与前景
Front Behav Neurosci. 2024 Jan 3;17:1270538. doi: 10.3389/fnbeh.2023.1270538. eCollection 2023.
强化学习和贝叶斯推断为青少年在随机反转中的独特优势提供了互补的模型。
Dev Cogn Neurosci. 2022 Jun;55:101106. doi: 10.1016/j.dcn.2022.101106. Epub 2022 Apr 22.
4
Mice alternate between discrete strategies during perceptual decision-making.小鼠在感知决策过程中会在不同策略之间交替。
Nat Neurosci. 2022 Feb;25(2):201-212. doi: 10.1038/s41593-021-01007-z. Epub 2022 Feb 7.
5
Serotonin neurons modulate learning rate through uncertainty.血清素神经元通过不确定性来调节学习率。
Curr Biol. 2022 Feb 7;32(3):586-599.e7. doi: 10.1016/j.cub.2021.12.006. Epub 2021 Dec 21.
6
Context-dependent persistency as a coding mechanism for robust and widely distributed value coding.上下文相关的持续作为一种稳健且广泛分布的价值编码的编码机制。
Neuron. 2022 Feb 2;110(3):502-515.e11. doi: 10.1016/j.neuron.2021.11.001. Epub 2021 Nov 23.
7
Sex differences in learning from exploration.从探索中学习的性别差异。
Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.
8
Unique features of stimulus-based probabilistic reversal learning.基于刺激的概率性逆转学习的独特特征。
Behav Neurosci. 2021 Aug;135(4):550-570. doi: 10.1037/bne0000474.
9
Mice in a labyrinth show rapid learning, sudden insight, and efficient exploration.实验室里的老鼠表现出快速的学习能力、突然的洞察力和高效的探索能力。
Elife. 2021 Jul 1;10:e66175. doi: 10.7554/eLife.66175.
10
Lapses in perceptual decisions reflect exploration.感知决策的失误反映了探索。
Elife. 2021 Jan 11;10:e55490. doi: 10.7554/eLife.55490.