• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

老鼠在确定性任务中适应性地产生选择变异性。

Mice adaptively generate choice variability in a deterministic task.

机构信息

Sorbonne Université, CNRS, Institut des Systèmes Intelligents et de Robotique (ISIR), 75005, Paris, France.

Sorbonne Université, INSERM, CNRS, Neuroscience Paris Seine - Institut de Biologie Paris Seine (NPS - IBPS), 75005, Paris, France.

出版信息

Commun Biol. 2020 Jan 21;3(1):34. doi: 10.1038/s42003-020-0759-x.

DOI:10.1038/s42003-020-0759-x
PMID:31965053
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6972896/
Abstract

Can decisions be made solely by chance? Can variability be intrinsic to the decision-maker or is it inherited from environmental conditions? To investigate these questions, we designed a deterministic setting in which mice are rewarded for non-repetitive choice sequences, and modeled the experiment using reinforcement learning. We found that mice progressively increased their choice variability. Although an optimal strategy based on sequences learning was theoretically possible and would be more rewarding, animals used a pseudo-random selection which ensures high success rate. This was not the case if the animal is exposed to a uniform probabilistic reward delivery. We also show that mice were blind to changes in the temporal structure of reward delivery once they learned to choose at random. Overall, our results demonstrate that a decision-making process can self-generate variability and randomness, even when the rules governing reward delivery are neither stochastic nor volatile.

摘要

决策能否仅凭偶然做出?决策的可变性是内在的,还是由环境条件决定的?为了探究这些问题,我们设计了一个确定性环境,在此环境中,老鼠因非重复的选择序列而获得奖励,并使用强化学习对实验进行建模。我们发现,老鼠的选择变异性逐渐增加。尽管基于序列学习的最优策略在理论上是可能的,并且会带来更高的回报,但动物采用了一种伪随机选择,以确保高成功率。如果动物暴露于均匀概率的奖励分配中,则不会出现这种情况。我们还表明,一旦老鼠学会随机选择,它们就会对奖励分配的时间结构变化视而不见。总的来说,我们的结果表明,即使奖励分配的规则既不是随机的也不是不稳定的,决策过程也可以自我产生可变性和随机性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/0b48879dc97d/42003_2020_759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/86e54e97f65a/42003_2020_759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/ed0309955ce0/42003_2020_759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/feeae692fb7c/42003_2020_759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/0b48879dc97d/42003_2020_759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/86e54e97f65a/42003_2020_759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/ed0309955ce0/42003_2020_759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/feeae692fb7c/42003_2020_759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6806/6972896/0b48879dc97d/42003_2020_759_Fig4_HTML.jpg

相似文献

1
Mice adaptively generate choice variability in a deterministic task.老鼠在确定性任务中适应性地产生选择变异性。
Commun Biol. 2020 Jan 21;3(1):34. doi: 10.1038/s42003-020-0759-x.
2
Mice exhibit stochastic and efficient action switching during probabilistic decision making.在进行概率决策时,老鼠表现出随机且有效的动作转换。
Proc Natl Acad Sci U S A. 2022 Apr 12;119(15):e2113961119. doi: 10.1073/pnas.2113961119. Epub 2022 Apr 6.
3
Mouse tracking reveals structure knowledge in the absence of model-based choice.鼠标追踪揭示了在没有基于模型的选择的情况下的结构知识。
Nat Commun. 2020 Apr 20;11(1):1893. doi: 10.1038/s41467-020-15696-w.
4
Sex differences in learning from exploration.从探索中学习的性别差异。
Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.
5
Modeling the violation of reward maximization and invariance in reinforcement schedules.强化程序中奖励最大化和不变性违背的建模
PLoS Comput Biol. 2008 Aug 8;4(8):e1000131. doi: 10.1371/journal.pcbi.1000131.
6
The Role of Frontal Cortical and Medial-Temporal Lobe Brain Areas in Learning a Bayesian Prior Belief on Reversals.额叶皮质和内侧颞叶脑区在学习关于反转的贝叶斯先验信念中的作用。
J Neurosci. 2015 Aug 19;35(33):11751-60. doi: 10.1523/JNEUROSCI.1594-15.2015.
7
The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs.学习对知觉决策的影响及其对速度-准确性权衡的意义。
Nat Commun. 2020 Jun 2;11(1):2757. doi: 10.1038/s41467-020-16196-7.
8
Benchmarking for Bayesian Reinforcement Learning.贝叶斯强化学习的基准测试
PLoS One. 2016 Jun 15;11(6):e0157088. doi: 10.1371/journal.pone.0157088. eCollection 2016.
9
Divergent Strategies for Learning in Males and Females.男性和女性学习的差异化策略。
Curr Biol. 2021 Jan 11;31(1):39-50.e4. doi: 10.1016/j.cub.2020.09.075. Epub 2020 Oct 29.
10
Time elapsed between choices in a probabilistic task correlates with repeating the same decision.在概率任务中,选择之间的时间流逝与重复相同的决策相关。
Eur J Neurosci. 2021 Apr;53(8):2639-2654. doi: 10.1111/ejn.15144. Epub 2021 Mar 2.

引用本文的文献

1
Retrieval of contextual memory can be predicted by CA3 remapping and is differentially influenced by NMDAR activity in rat hippocampus subregions.上下文记忆的检索可以通过 CA3 重映射来预测,并且在大鼠海马亚区中受到 NMDA 受体活性的不同影响。
PLoS Biol. 2024 Jul 1;22(7):e3002706. doi: 10.1371/journal.pbio.3002706. eCollection 2024 Jul.
2
Strategy inference during learning via cognitive activity-based credit assignment models.通过基于认知活动的信用分配模型进行学习期间的策略推断。
Sci Rep. 2023 Jun 9;13(1):9408. doi: 10.1038/s41598-023-33604-2.
3
Instruction effects on randomness in sequence generation.

本文引用的文献

1
Computational noise in reward-guided learning drives behavioral variability in volatile environments.奖励导向学习中的计算噪声驱动易变环境中的行为可变性。
Nat Neurosci. 2019 Dec;22(12):2066-2077. doi: 10.1038/s41593-019-0518-9. Epub 2019 Oct 28.
2
Dopamine blockade impairs the exploration-exploitation trade-off in rats.多巴胺阻断会损害大鼠的探索-利用权衡。
Sci Rep. 2019 May 1;9(1):6770. doi: 10.1038/s41598-019-43245-z.
3
Managing competing goals - a key role for the frontopolar cortex.管理相互竞争的目标——前额叶皮层的关键作用。
指令对序列生成中随机性的影响。
Front Psychol. 2023 Mar 22;14:1113654. doi: 10.3389/fpsyg.2023.1113654. eCollection 2023.
4
Mice exhibit stochastic and efficient action switching during probabilistic decision making.在进行概率决策时,老鼠表现出随机且有效的动作转换。
Proc Natl Acad Sci U S A. 2022 Apr 12;119(15):e2113961119. doi: 10.1073/pnas.2113961119. Epub 2022 Apr 6.
5
A theory of memory for binary sequences: Evidence for a mental compression algorithm in humans.一种用于二进制序列的记忆理论:人类心理压缩算法的证据。
PLoS Comput Biol. 2021 Jan 19;17(1):e1008598. doi: 10.1371/journal.pcbi.1008598. eCollection 2021 Jan.
Nat Rev Neurosci. 2017 Nov;18(11):645-657. doi: 10.1038/nrn.2017.111. Epub 2017 Sep 29.
4
Increased locus coeruleus tonic activity causes disengagement from a patch-foraging task.蓝斑紧张性活动增强会导致从斑块觅食任务中脱离。
Cogn Affect Behav Neurosci. 2017 Dec;17(6):1073-1083. doi: 10.3758/s13415-017-0531-y.
5
Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking.腹侧被盖区的烟碱型乙酰胆碱受体促进不确定性寻求。
Nat Neurosci. 2016 Mar;19(3):471-8. doi: 10.1038/nn.4223. Epub 2016 Jan 18.
6
A Time Series Approach to Random Number Generation: Using Recurrence Quantification Analysis to Capture Executive Behavior.一种用于随机数生成的时间序列方法:利用递归量化分析捕捉执行行为。
Front Hum Neurosci. 2015 Jun 5;9:319. doi: 10.3389/fnhum.2015.00319. eCollection 2015.
7
Humans use directed and random exploration to solve the explore-exploit dilemma.人类利用有向探索和随机探索来解决探索与利用的两难困境。
J Exp Psychol Gen. 2014 Dec;143(6):2074-81. doi: 10.1037/a0038199. Epub 2014 Oct 27.
8
Behavioral variability through stochastic choice and its gating by anterior cingulate cortex.通过随机选择产生的行为可变性及其由前扣带皮层控制。
Cell. 2014 Sep 25;159(1):21-32. doi: 10.1016/j.cell.2014.08.037.
9
Temporal structure of motor variability is dynamically regulated and predicts motor learning ability.运动变异性的时间结构是动态调节的,并能预测运动学习能力。
Nat Neurosci. 2014 Feb;17(2):312-21. doi: 10.1038/nn.3616. Epub 2014 Jan 12.
10
Network resets in medial prefrontal cortex mark the onset of behavioral uncertainty.内侧前额叶皮层的网络重置标志着行为不确定性的开始。
Science. 2012 Oct 5;338(6103):135-9. doi: 10.1126/science.1226518.