Suppr超能文献

在老鼠和猴子中,对奖励环境中不同类型不确定性的调整机制。

Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.

机构信息

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.

Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA.

出版信息

Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23.

Abstract

Despite being unpredictable and uncertain, reward environments often exhibit certain regularities, and animals navigating these environments try to detect and utilize such regularities to adapt their behavior. However, successful learning requires that animals also adjust to uncertainty associated with those regularities. Here, we analyzed choice data from two comparable dynamic foraging tasks in mice and monkeys to investigate mechanisms underlying adjustments to different types of uncertainty. In these tasks, animals selected between two choice options that delivered reward probabilistically, while baseline reward probabilities changed after a variable number (block) of trials without any cues to the animals. To measure adjustments in behavior, we applied multiple metrics based on information theory that quantify consistency in behavior, and fit choice data using reinforcement learning models. We found that in both species, learning and choice were affected by uncertainty about reward outcomes (in terms of determining the better option) and by expectation about when the environment may change. However, these effects were mediated through different mechanisms. First, more uncertainty about the better option resulted in slower learning and forgetting in mice, whereas it had no significant effect in monkeys. Second, expectation of block switches accompanied slower learning, faster forgetting, and increased stochasticity in choice in mice, whereas it only reduced learning rates in monkeys. Overall, while demonstrating the usefulness of metrics based on information theory in examining adaptive behavior, our study provides evidence for multiple types of adjustments in learning and choice behavior according to uncertainty in the reward environment.

摘要

尽管奖励环境具有不可预测性和不确定性,但它们通常会表现出某些规律性,而动物在导航这些环境时会试图发现和利用这些规律性来适应自己的行为。然而,成功的学习要求动物也要适应与这些规律性相关的不确定性。在这里,我们分析了来自小鼠和猴子的两个类似的动态觅食任务的选择数据,以研究适应不同类型不确定性的机制。在这些任务中,动物在两个选择选项之间进行选择,这些选项以概率提供奖励,而基线奖励概率在没有任何动物线索的情况下,经过一定数量(块)的试验后发生变化。为了衡量行为的调整,我们应用了基于信息论的多种度量标准,这些标准量化了行为的一致性,并使用强化学习模型拟合选择数据。我们发现,在这两个物种中,学习和选择都受到奖励结果不确定性(确定更好选项)和环境可能何时变化的预期的影响。然而,这些影响是通过不同的机制介导的。首先,更好选项的不确定性增加导致小鼠学习和遗忘速度变慢,而在猴子中则没有显著影响。其次,对块切换的预期伴随着小鼠学习速度变慢、遗忘速度变快以及选择的随机性增加,而在猴子中,它仅降低了学习率。总的来说,虽然基于信息论的度量标准在检查适应性行为方面非常有用,但我们的研究提供了证据,表明根据奖励环境的不确定性,学习和选择行为会发生多种类型的调整。

相似文献

3
Nutrient-Sensitive Reinforcement Learning in Monkeys.猴子的营养敏感强化学习。
J Neurosci. 2023 Mar 8;43(10):1714-1730. doi: 10.1523/JNEUROSCI.0752-22.2022. Epub 2023 Jan 20.
4
Flexible combination of reward information across primates.灵长类动物的奖励信息的灵活组合。
Nat Hum Behav. 2019 Nov;3(11):1215-1224. doi: 10.1038/s41562-019-0714-3. Epub 2019 Sep 9.
6
Sex differences in learning from exploration.从探索中学习的性别差异。
Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.
9
Multiple Choice Neurodynamical Model of the Uncertain Option Task.不确定选项任务的多重选择神经动力学模型。
PLoS Comput Biol. 2017 Jan 11;13(1):e1005250. doi: 10.1371/journal.pcbi.1005250. eCollection 2017 Jan.

引用本文的文献

3
Reaching vigor tracks learned prediction error.达到活力追踪学习到的预测误差。
bioRxiv. 2025 Mar 25:2025.03.24.645035. doi: 10.1101/2025.03.24.645035.
4
Representation of Anticipated Rewards and Punishments in the Human Brain.人类大脑中预期奖励与惩罚的表征。
Annu Rev Psychol. 2025 Jan;76(1):197-226. doi: 10.1146/annurev-psych-022324-042614. Epub 2024 Dec 3.
6
Mixtures of strategies underlie rodent behavior during reversal learning.策略混合是啮齿动物在反转学习过程中行为的基础。
PLoS Comput Biol. 2023 Sep 14;19(9):e1011430. doi: 10.1371/journal.pcbi.1011430. eCollection 2023 Sep.

本文引用的文献

1
Undermatching Is a Consequence of Policy Compression.政策压缩导致不匹配。
J Neurosci. 2023 Jan 18;43(3):447-457. doi: 10.1523/JNEUROSCI.1003-22.2022. Epub 2022 Dec 6.
3
Serotonin neurons modulate learning rate through uncertainty.血清素神经元通过不确定性来调节学习率。
Curr Biol. 2022 Feb 7;32(3):586-599.e7. doi: 10.1016/j.cub.2021.12.006. Epub 2021 Dec 21.
7
Control over patch encounters changes foraging behavior.对斑块相遇的控制会改变觅食行为。
iScience. 2021 Aug 20;24(9):103005. doi: 10.1016/j.isci.2021.103005. eCollection 2021 Sep 24.
9
Advances in modeling learning and decision-making in neuroscience.神经科学中学习和决策建模的进展。
Neuropsychopharmacology. 2022 Jan;47(1):104-118. doi: 10.1038/s41386-021-01126-y. Epub 2021 Aug 27.
10

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验