• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在一种用于小鼠的新型概率反转学习任务中分离概率学习和反转学习

Separating Probability and Reversal Learning in a Novel Probabilistic Reversal Learning Task for Mice.

作者信息

Metha Jeremy A, Brian Maddison L, Oberrauch Sara, Barnes Samuel A, Featherby Travis J, Bossaerts Peter, Murawski Carsten, Hoyer Daniel, Jacobson Laura H

机构信息

Sleep and Cognition, The Florey Institute of Neuroscience and Mental Health, Parkville, VIC, Australia.

Translational Neuroscience, Department of Pharmacology and Therapeutics, School of Biomedical Sciences, Faculty of Medicine, Dentistry and Health Sciences, The University of Melbourne, Parkville, VIC, Australia.

出版信息

Front Behav Neurosci. 2020 Jan 9;13:270. doi: 10.3389/fnbeh.2019.00270. eCollection 2019.

DOI:10.3389/fnbeh.2019.00270
PMID:31998088
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6962304/
Abstract

The exploration/exploitation tradeoff - pursuing a known reward vs. sampling from lesser known options in the hope of finding a better payoff - is a fundamental aspect of learning and decision making. In humans, this has been studied using multi-armed bandit tasks. The same processes have also been studied using simplified probabilistic reversal learning (PRL) tasks with binary choices. Our investigations suggest that protocols previously used to explore PRL in mice may prove beyond their cognitive capacities, with animals performing at a no-better-than-chance level. We sought a novel probabilistic learning task to improve behavioral responding in mice, whilst allowing the investigation of the exploration/exploitation tradeoff in decision making. To achieve this, we developed a two-lever operant chamber task with levers corresponding to different probabilities (high/low) of receiving a saccharin reward, reversing the reward contingencies associated with levers once animals reached a threshold of 80% responding at the high rewarding lever. We found that, unlike in existing PRL tasks, mice are able to learn and behave near optimally with 80% high/20% low reward probabilities. Altering the reward contingencies towards equality showed that some mice displayed preference for the high rewarding lever with probabilities as close as 60% high/40% low. Additionally, we show that animal choice behavior can be effectively modelled using reinforcement learning (RL) models incorporating learning rates for positive and negative prediction error, a perseveration parameter, and a noise parameter. This new decision task, coupled with RL analyses, advances access to investigate the neuroscience of the exploration/exploitation tradeoff in decision making.

摘要

探索/利用权衡——追求已知奖励与从不太知名的选项中进行抽样以期望获得更好的回报——是学习和决策的一个基本方面。在人类中,这已通过多臂赌博任务进行研究。同样的过程也使用具有二元选择的简化概率反转学习(PRL)任务进行了研究。我们的研究表明,先前用于在小鼠中探索PRL的方案可能超出了它们的认知能力,动物的表现仅处于随机水平。我们寻求一种新颖的概率学习任务来改善小鼠的行为反应,同时允许对决策中的探索/利用权衡进行研究。为了实现这一点,我们开发了一种双杠杆操作性条件反射箱任务,其中杠杆对应于获得糖精奖励的不同概率(高/低),一旦动物在高奖励杠杆上的反应达到80%的阈值,就反转与杠杆相关的奖励偶然性。我们发现,与现有的PRL任务不同,小鼠能够在80%高/20%低奖励概率的情况下近乎最优地学习和表现。将奖励偶然性改变为相等表明,一些小鼠在高奖励杠杆的概率低至60%高/40%低时仍表现出偏好。此外,我们表明,使用包含正、负预测误差的学习率、坚持参数和噪声参数的强化学习(RL)模型可以有效地模拟动物的选择行为。这个新的决策任务,再加上RL分析,为研究决策中探索/利用权衡的神经科学提供了便利。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/b55398098f86/fnbeh-13-00270-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/9e0741062a1e/fnbeh-13-00270-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/31e65aa9f8af/fnbeh-13-00270-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/75d449886dfd/fnbeh-13-00270-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/b55398098f86/fnbeh-13-00270-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/9e0741062a1e/fnbeh-13-00270-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/31e65aa9f8af/fnbeh-13-00270-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/75d449886dfd/fnbeh-13-00270-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e17/6962304/b55398098f86/fnbeh-13-00270-g004.jpg

相似文献

1
Separating Probability and Reversal Learning in a Novel Probabilistic Reversal Learning Task for Mice.在一种用于小鼠的新型概率反转学习任务中分离概率学习和反转学习
Front Behav Neurosci. 2020 Jan 9;13:270. doi: 10.3389/fnbeh.2019.00270. eCollection 2019.
2
Orbitofrontal cortex reflects changes in response-outcome contingencies during probabilistic reversal learning.眶额皮质在概率性逆转学习过程中反映了反应-结果偶联的变化。
Neuroscience. 2017 Mar 14;345:27-37. doi: 10.1016/j.neuroscience.2016.03.034. Epub 2016 Mar 17.
3
Sex differences in learning from exploration.从探索中学习的性别差异。
Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.
4
Effects of environmental enrichment on exploratory behavior, win-stay and lose-shift performance, motor sequence learning, and reversal learning during the three-lever operant task in mice.环境富集对小鼠三杆作业任务中探索行为、赢留输移表现、运动序列学习和反转学习的影响。
Behav Brain Res. 2022 Jul 5;429:113904. doi: 10.1016/j.bbr.2022.113904. Epub 2022 Apr 22.
5
A role for neurogenesis in probabilistic reward learning.神经发生在概率性奖励学习中的作用。
Behav Neurosci. 2020 Aug;134(4):283-295. doi: 10.1037/bne0000370. Epub 2020 May 7.
6
Modulation of value-based decision making behavior by subregions of the rat prefrontal cortex.大鼠前额皮质亚区对基于价值的决策行为的调节。
Psychopharmacology (Berl). 2020 May;237(5):1267-1280. doi: 10.1007/s00213-020-05454-7. Epub 2020 Feb 6.
7
Probabilistic reversal learning is impaired in Parkinson's disease.帕金森病患者的概率反转学习受损。
Neuroscience. 2009 Nov 10;163(4):1092-101. doi: 10.1016/j.neuroscience.2009.07.033. Epub 2009 Jul 21.
8
Altered Statistical Learning and Decision-Making in Methamphetamine Dependence: Evidence from a Two-Armed Bandit Task.甲基苯丙胺成瘾中统计学习与决策的改变:来自双臂赌博任务的证据
Front Psychol. 2015 Dec 18;6:1910. doi: 10.3389/fpsyg.2015.01910. eCollection 2015.
9
Dual learning processes underlying human decision-making in reversal learning tasks: functional significance and evidence from the model fit to human behavior.人类在反转学习任务中决策的双重学习过程:功能意义及模型拟合人类行为的证据。
Front Psychol. 2014 Aug 12;5:871. doi: 10.3389/fpsyg.2014.00871. eCollection 2014.
10
Time elapsed between choices in a probabilistic task correlates with repeating the same decision.在概率任务中,选择之间的时间流逝与重复相同的决策相关。
Eur J Neurosci. 2021 Apr;53(8):2639-2654. doi: 10.1111/ejn.15144. Epub 2021 Mar 2.

引用本文的文献

1
Acute isolation is associated with increased reward seeking and reward learning in human adolescents.急性隔离与人类青少年寻求奖励和奖励学习的增加有关。
Commun Psychol. 2025 Sep 5;3(1):135. doi: 10.1038/s44271-025-00306-6.
2
Transient DREADD Manipulation of the Dorsal Dentate Gyrus in Rats Impairs Initial Learning of Place-Outcome Associations.对大鼠背侧齿状回进行短暂的DREADD操作会损害位置-结果关联的初始学习。
Hippocampus. 2025 May;35(3):e70014. doi: 10.1002/hipo.70014.
3
A review on exploration-exploitation trade-off in psychiatric disorders.

本文引用的文献

1
A Primer on Foraging and the Explore/Exploit Trade-Off for Psychiatry Research.精神病学研究中的觅食及探索/利用权衡入门
Neuropsychopharmacology. 2017 Sep;42(10):1931-1939. doi: 10.1038/npp.2017.108. Epub 2017 May 29.
2
The effects of reduced dopamine transporter function and chronic lithium on motivation, probabilistic learning, and neurochemistry in mice: Modeling bipolar mania.多巴胺转运体功能降低和慢性锂对小鼠动机、概率学习及神经化学的影响:双相躁狂症的模型构建
Neuropharmacology. 2017 Feb;113(Pt A):260-270. doi: 10.1016/j.neuropharm.2016.07.030. Epub 2016 Oct 11.
3
Multifaceted Contributions by Different Regions of the Orbitofrontal and Medial Prefrontal Cortex to Probabilistic Reversal Learning.
关于精神疾病中探索-利用权衡的综述。
BMC Psychiatry. 2025 Apr 26;25(1):420. doi: 10.1186/s12888-025-06837-w.
4
Design and validation of novel brain-penetrant HCN channel inhibitors to ameliorate social stress-induced susceptible phenotype.新型脑渗透性HCN通道抑制剂的设计与验证,以改善社会应激诱导的易感表型。
Mol Psychiatry. 2025 Apr 8. doi: 10.1038/s41380-025-02972-8.
5
Characterisation of behaviours relevant to apathy syndrome in the aged male rat.老年雄性大鼠与淡漠综合征相关行为的特征。
Behav Brain Res. 2024 May 28;466:114977. doi: 10.1016/j.bbr.2024.114977. Epub 2024 Apr 1.
6
Approach-avoidance reinforcement learning as a translational and computational model of anxiety-related avoidance.趋近-回避强化学习作为焦虑相关回避的转化和计算模型。
Elife. 2023 Nov 14;12:RP87720. doi: 10.7554/eLife.87720.
7
Activity in the Dorsomedial Striatum Underlies Serial Reversal Learning Performance Under Probabilistic Uncertainty.背内侧纹状体的活动是概率性不确定性下序列反转学习表现的基础。
Biol Psychiatry Glob Open Sci. 2022 Aug 26;3(4):1030-1041. doi: 10.1016/j.bpsgos.2022.08.005. eCollection 2023 Oct.
8
CADM2 is implicated in impulsive personality and numerous other traits by genome- and phenome-wide association studies in humans and mice.CADM2 通过全基因组和表型关联研究在人类和小鼠中与冲动型人格和许多其他特征有关。
Transl Psychiatry. 2023 May 12;13(1):167. doi: 10.1038/s41398-023-02453-y.
9
Reinforcement learning deficits exhibited by postnatal PCP-treated rats enable deep neural network classification.PCP 处理后的新生大鼠表现出的强化学习缺陷可实现深度神经网络分类。
Neuropsychopharmacology. 2023 Aug;48(9):1377-1385. doi: 10.1038/s41386-022-01514-y. Epub 2022 Dec 12.
10
Modulation of ventromedial orbitofrontal cortical glutamatergic activity affects the explore-exploit balance and influences value-based decision-making.调节腹内侧眶额皮质谷氨酸能活动会影响探索-利用平衡,并影响基于价值的决策。
Cereb Cortex. 2023 May 9;33(10):5783-5796. doi: 10.1093/cercor/bhac459.
眶额叶和内侧前额叶皮质不同区域对概率性反转学习的多方面贡献
J Neurosci. 2016 Feb 10;36(6):1996-2006. doi: 10.1523/JNEUROSCI.3366-15.2016.
4
Reversal learning and dopamine: a bayesian perspective.逆向学习与多巴胺:贝叶斯视角
J Neurosci. 2015 Feb 11;35(6):2407-16. doi: 10.1523/JNEUROSCI.1989-14.2015.
5
Risperidone and the 5-HT2A receptor antagonist M100907 improve probabilistic reversal learning in BTBR T + tf/J mice.利培酮和5-羟色胺2A受体拮抗剂M100907可改善BTBR T + tf/J小鼠的概率性逆向学习能力。
Autism Res. 2014 Oct;7(5):555-67. doi: 10.1002/aur.1395. Epub 2014 Jun 3.
6
Preferential involvement by nucleus accumbens shell in mediating probabilistic learning and reversal shifts.伏隔核壳部优先参与调节概率性学习和反转转变。
J Neurosci. 2014 Mar 26;34(13):4618-26. doi: 10.1523/JNEUROSCI.5058-13.2014.
7
Adaptive properties of differential learning rates for positive and negative outcomes.正性和负性结果的差异学习率的适应性特性。
Biol Cybern. 2013 Dec;107(6):711-9. doi: 10.1007/s00422-013-0571-5. Epub 2013 Oct 2.
8
Isolation rearing effects on probabilistic learning and cognitive flexibility in rats.隔离饲养对大鼠概率学习和认知灵活性的影响。
Cogn Affect Behav Neurosci. 2014 Mar;14(1):388-406. doi: 10.3758/s13415-013-0204-4.
9
Prefrontal mechanisms of behavioral flexibility, emotion regulation and value updating.前额叶的行为灵活性、情绪调节和价值更新机制。
Nat Neurosci. 2013 Aug;16(8):1140-5. doi: 10.1038/nn.3440. Epub 2013 Jun 23.
10
Establishing a probabilistic reversal learning test in mice: evidence for the processes mediating reward-stay and punishment-shift behaviour and for their modulation by serotonin.在小鼠中建立概率反转学习测试:中介奖励保持和惩罚转换行为的过程的证据,以及它们被血清素调节的证据。
Neuropharmacology. 2012 Nov;63(6):1012-21. doi: 10.1016/j.neuropharm.2012.07.025. Epub 2012 Jul 21.