在老鼠和猴子中，对奖励环境中不同类型不确定性的调整机制。

Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.

机构信息

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.

Department of Psychology, University of California, Los Angeles, Los Angeles, CA, USA.

出版信息

Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23.

DOI:10.3758/s13415-022-01059-z

PMID:36823249

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10444905/

Abstract

Despite being unpredictable and uncertain, reward environments often exhibit certain regularities, and animals navigating these environments try to detect and utilize such regularities to adapt their behavior. However, successful learning requires that animals also adjust to uncertainty associated with those regularities. Here, we analyzed choice data from two comparable dynamic foraging tasks in mice and monkeys to investigate mechanisms underlying adjustments to different types of uncertainty. In these tasks, animals selected between two choice options that delivered reward probabilistically, while baseline reward probabilities changed after a variable number (block) of trials without any cues to the animals. To measure adjustments in behavior, we applied multiple metrics based on information theory that quantify consistency in behavior, and fit choice data using reinforcement learning models. We found that in both species, learning and choice were affected by uncertainty about reward outcomes (in terms of determining the better option) and by expectation about when the environment may change. However, these effects were mediated through different mechanisms. First, more uncertainty about the better option resulted in slower learning and forgetting in mice, whereas it had no significant effect in monkeys. Second, expectation of block switches accompanied slower learning, faster forgetting, and increased stochasticity in choice in mice, whereas it only reduced learning rates in monkeys. Overall, while demonstrating the usefulness of metrics based on information theory in examining adaptive behavior, our study provides evidence for multiple types of adjustments in learning and choice behavior according to uncertainty in the reward environment.

摘要

尽管奖励环境具有不可预测性和不确定性，但它们通常会表现出某些规律性，而动物在导航这些环境时会试图发现和利用这些规律性来适应自己的行为。然而，成功的学习要求动物也要适应与这些规律性相关的不确定性。在这里，我们分析了来自小鼠和猴子的两个类似的动态觅食任务的选择数据，以研究适应不同类型不确定性的机制。在这些任务中，动物在两个选择选项之间进行选择，这些选项以概率提供奖励，而基线奖励概率在没有任何动物线索的情况下，经过一定数量（块）的试验后发生变化。为了衡量行为的调整，我们应用了基于信息论的多种度量标准，这些标准量化了行为的一致性，并使用强化学习模型拟合选择数据。我们发现，在这两个物种中，学习和选择都受到奖励结果不确定性（确定更好选项）和环境可能何时变化的预期的影响。然而，这些影响是通过不同的机制介导的。首先，更好选项的不确定性增加导致小鼠学习和遗忘速度变慢，而在猴子中则没有显著影响。其次，对块切换的预期伴随着小鼠学习速度变慢、遗忘速度变快以及选择的随机性增加，而在猴子中，它仅降低了学习率。总的来说，虽然基于信息论的度量标准在检查适应性行为方面非常有用，但我们的研究提供了证据，表明根据奖励环境的不确定性，学习和选择行为会发生多种类型的调整。

相似文献

Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.在老鼠和猴子中，对奖励环境中不同类型不确定性的调整机制。

Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23.

Entropy-based metrics for predicting choice behavior based on local response to reward.基于熵的指标，用于预测基于奖励局部响应的选择行为。

Nat Commun. 2021 Nov 12;12(1):6567. doi: 10.1038/s41467-021-26784-w.

Nutrient-Sensitive Reinforcement Learning in Monkeys.猴子的营养敏感强化学习。

J Neurosci. 2023 Mar 8;43(10):1714-1730. doi: 10.1523/JNEUROSCI.0752-22.2022. Epub 2023 Jan 20.

Flexible combination of reward information across primates.灵长类动物的奖励信息的灵活组合。

Nat Hum Behav. 2019 Nov;3(11):1215-1224. doi: 10.1038/s41562-019-0714-3. Epub 2019 Sep 9.

Multiple Mechanisms for Processing Reward Uncertainty in the Primate Basal Forebrain.灵长类动物基底前脑处理奖励不确定性的多种机制。

J Neurosci. 2016 Jul 27;36(30):7852-64. doi: 10.1523/JNEUROSCI.1123-16.2016.

Sex differences in learning from exploration.从探索中学习的性别差异。

Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.

Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty.作为不确定性下适应性学习与选择的神经基础的元可塑性

Neuron. 2017 Apr 19;94(2):401-414.e6. doi: 10.1016/j.neuron.2017.03.044.

Mice exhibit stochastic and efficient action switching during probabilistic decision making.在进行概率决策时，老鼠表现出随机且有效的动作转换。

Proc Natl Acad Sci U S A. 2022 Apr 12;119(15):e2113961119. doi: 10.1073/pnas.2113961119. Epub 2022 Apr 6.

Multiple Choice Neurodynamical Model of the Uncertain Option Task.不确定选项任务的多重选择神经动力学模型。

PLoS Comput Biol. 2017 Jan 11;13(1):e1005250. doi: 10.1371/journal.pcbi.1005250. eCollection 2017 Jan.

Dopamine reward prediction error signal codes the temporal evaluation of a perceptual decision report.多巴胺奖赏预测误差信号编码了对感知决策报告的时间评估。

Proc Natl Acad Sci U S A. 2017 Nov 28;114(48):E10494-E10503. doi: 10.1073/pnas.1712479114. Epub 2017 Nov 13.

引用本文的文献

Stimulus uncertainty and relative reward rates determine adaptive responding in perceptual decision-making.刺激不确定性和相对奖励率决定了知觉决策中的适应性反应。

PLoS Comput Biol. 2025 May 27;21(5):e1012636. doi: 10.1371/journal.pcbi.1012636. eCollection 2025 May.

Foraging animals use dynamic Bayesian updating to model meta-uncertainty in environment representations.觅食动物利用动态贝叶斯更新来对环境表征中的元不确定性进行建模。

PLoS Comput Biol. 2025 Apr 30;21(4):e1012989. doi: 10.1371/journal.pcbi.1012989. eCollection 2025 Apr.

Reaching vigor tracks learned prediction error.达到活力追踪学习到的预测误差。

bioRxiv. 2025 Mar 25:2025.03.24.645035. doi: 10.1101/2025.03.24.645035.

Representation of Anticipated Rewards and Punishments in the Human Brain.人类大脑中预期奖励与惩罚的表征。

Annu Rev Psychol. 2025 Jan;76(1):197-226. doi: 10.1146/annurev-psych-022324-042614. Epub 2024 Dec 3.

Foraging Under Uncertainty Follows the Marginal Value Theorem with Bayesian Updating of Environment Representations.在不确定性下觅食遵循边际价值定理并对环境表征进行贝叶斯更新。

bioRxiv. 2024 Mar 31:2024.03.30.587253. doi: 10.1101/2024.03.30.587253.

Mixtures of strategies underlie rodent behavior during reversal learning.策略混合是啮齿动物在反转学习过程中行为的基础。

PLoS Comput Biol. 2023 Sep 14;19(9):e1011430. doi: 10.1371/journal.pcbi.1011430. eCollection 2023 Sep.

Neuronal Representation of a Working Memory-Based Decision Strategy in the Motor and Prefrontal Cortico-Basal Ganglia Loops.运动和前额皮质-基底神经节回路中基于工作记忆的决策策略的神经元表示。

eNeuro. 2023 Jun 20;10(6). doi: 10.1523/ENEURO.0413-22.2023. Print 2023 Jun.

本文引用的文献

Undermatching Is a Consequence of Policy Compression.政策压缩导致不匹配。

J Neurosci. 2023 Jan 18;43(3):447-457. doi: 10.1523/JNEUROSCI.1003-22.2022. Epub 2022 Dec 6.

The PRO model accounts for the anterior cingulate cortex role in risky decision-making and monitoring.PRO 模型考虑了前扣带皮层在风险决策和监测中的作用。

Cogn Affect Behav Neurosci. 2022 Oct;22(5):952-968. doi: 10.3758/s13415-022-00992-3. Epub 2022 Mar 24.

Serotonin neurons modulate learning rate through uncertainty.血清素神经元通过不确定性来调节学习率。

Curr Biol. 2022 Feb 7;32(3):586-599.e7. doi: 10.1016/j.cub.2021.12.006. Epub 2021 Dec 21.

Learning at Variable Attentional Load Requires Cooperation of Working Memory, Meta-learning, and Attention-augmented Reinforcement Learning.在可变注意力负荷下学习需要工作记忆、元学习和注意力增强的强化学习的合作。

J Cogn Neurosci. 2021 Dec 6;34(1):79-107. doi: 10.1162/jocn_a_01780.

A model for learning based on the joint estimation of stochasticity and volatility.基于随机波动联合估计的学习模型。

Nat Commun. 2021 Nov 15;12(1):6587. doi: 10.1038/s41467-021-26731-9.

Entropy-based metrics for predicting choice behavior based on local response to reward.基于熵的指标，用于预测基于奖励局部响应的选择行为。

Nat Commun. 2021 Nov 12;12(1):6567. doi: 10.1038/s41467-021-26784-w.

Control over patch encounters changes foraging behavior.对斑块相遇的控制会改变觅食行为。

iScience. 2021 Aug 20;24(9):103005. doi: 10.1016/j.isci.2021.103005. eCollection 2021 Sep 24.

Unique features of stimulus-based probabilistic reversal learning.基于刺激的概率性逆转学习的独特特征。

Behav Neurosci. 2021 Aug;135(4):550-570. doi: 10.1037/bne0000474.

Advances in modeling learning and decision-making in neuroscience.神经科学中学习和决策建模的进展。

Neuropsychopharmacology. 2022 Jan;47(1):104-118. doi: 10.1038/s41386-021-01126-y. Epub 2021 Aug 27.

Foraging with the frontal cortex: A cross-species evaluation of reward-guided behavior.用前额皮质觅食：奖赏引导行为的跨物种评估。

Neuropsychopharmacology. 2022 Jan;47(1):134-146. doi: 10.1038/s41386-021-01140-0. Epub 2021 Aug 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验