猴子在面对有噪声的刺激和不均等的奖励时能做出最优选择吗？

Can monkeys choose optimally when faced with noisy stimuli and unequal rewards?

作者信息

Feng Samuel, Holmes Philip, Rorie Alan, Newsome William T

机构信息

Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey, United States of America.

出版信息

PLoS Comput Biol. 2009 Feb;5(2):e1000284. doi: 10.1371/journal.pcbi.1000284. Epub 2009 Feb 13.

DOI:10.1371/journal.pcbi.1000284

PMID:19214201

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2631644/

Abstract

We review the leaky competing accumulator model for two-alternative forced-choice decisions with cued responses, and propose extensions to account for the influence of unequal rewards. Assuming that stimulus information is integrated until the cue to respond arrives and that firing rates of stimulus-selective neurons remain well within physiological bounds, the model reduces to an Ornstein-Uhlenbeck (OU) process that yields explicit expressions for the psychometric function that describes accuracy. From these we compute strategies that optimize the rewards expected over blocks of trials administered with mixed difficulty and reward contingencies. The psychometric function is characterized by two parameters: its midpoint slope, which quantifies a subject's ability to extract signal from noise, and its shift, which measures the bias applied to account for unequal rewards. We fit these to data from two monkeys performing the moving dots task with mixed coherences and reward schedules. We find that their behaviors averaged over multiple sessions are close to optimal, with shifts erring in the direction of smaller penalties. We propose two methods for biasing the OU process to produce such shifts.

摘要

我们回顾了用于有提示响应的二选一强制选择决策的泄漏竞争累加器模型，并提出了扩展模型以解释不等奖励的影响。假设刺激信息在响应提示到来之前进行整合，并且刺激选择性神经元的放电率保持在生理范围内，该模型简化为一个奥恩斯坦 - 乌伦贝克（OU）过程，该过程为描述准确性的心理测量函数产生明确的表达式。由此我们计算出在具有混合难度和奖励条件的试验块中优化预期奖励的策略。心理测量函数由两个参数表征：其中点斜率，量化了受试者从噪声中提取信号的能力；其偏移量，衡量为解释不等奖励而应用的偏差。我们将这些参数拟合到两只猴子执行具有混合相干性和奖励计划的移动点任务的数据中。我们发现，它们在多个会话中的平均行为接近最优，偏差朝着较小惩罚的方向出现误差。我们提出了两种使OU过程产生这种偏差的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d69/2631644/fa52fe952655/pcbi.1000284.g001.jpg

相似文献

Can monkeys choose optimally when faced with noisy stimuli and unequal rewards?猴子在面对有噪声的刺激和不均等的奖励时能做出最优选择吗？

PLoS Comput Biol. 2009 Feb;5(2):e1000284. doi: 10.1371/journal.pcbi.1000284. Epub 2009 Feb 13.

Neuronal Activity in the Premotor Cortex of Monkeys Reflects Both Cue Salience and Motivation for Action Generation and Inhibition.猴子前运动皮层的神经元活动既反映了线索的显著程度，也反映了产生和抑制动作的动机。

J Neurosci. 2021 Sep 8;41(36):7591-7606. doi: 10.1523/JNEUROSCI.0641-20.2021. Epub 2021 Jul 30.

Neural signals in the monkey ventral striatum related to motivation for juice and cocaine rewards.猴子腹侧纹状体中与果汁和可卡因奖励动机相关的神经信号。

J Neurophysiol. 1996 Mar;75(3):1061-73. doi: 10.1152/jn.1996.75.3.1061.

Auditory cortex reflects goal-directed movement but is not necessary for behavioral adaptation in sound-cued reward tracking.听觉皮层反映了目标导向的运动，但对于声音提示奖励跟踪中的行为适应并非必需。

J Neurophysiol. 2020 Oct 1;124(4):1056-1071. doi: 10.1152/jn.00736.2019. Epub 2020 Aug 26.

Cue and reward signals carried by monkey entorhinal cortex neurons during reward schedules.奖励计划期间猴子内嗅皮层神经元携带的线索和奖励信号。

Exp Brain Res. 2007 Aug;181(2):267-76. doi: 10.1007/s00221-007-0926-z. Epub 2007 Mar 30.

Response differences in monkey TE and perirhinal cortex: stimulus association related to reward schedules.猴子颞下回和嗅周皮层的反应差异：与奖励计划相关的刺激关联

J Neurophysiol. 2000 Mar;83(3):1677-92. doi: 10.1152/jn.2000.83.3.1677.

Prefrontal coding of temporally discounted values during intertemporal choice.跨期选择过程中时间折扣价值的前额叶编码

Neuron. 2008 Jul 10;59(1):161-72. doi: 10.1016/j.neuron.2008.05.010.

Dopamine neuronal responses in monkeys performing visually cued reward schedules.执行视觉线索奖励计划的猴子的多巴胺神经元反应。

Eur J Neurosci. 2006 Jul;24(1):277-90. doi: 10.1111/j.1460-9568.2006.04905.x.

Modulation of Tonically Active Neurons of the Monkey Striatum by Events Carrying Different Force and Reward Information.携带不同力和奖励信息的事件对猕猴纹状体紧张性活动神经元的调制

J Neurosci. 2015 Nov 11;35(45):15214-26. doi: 10.1523/JNEUROSCI.0039-15.2015.

Neuronal activity during a cued strategy task: comparison of dorsolateral, orbital, and polar prefrontal cortex.在提示策略任务期间的神经元活动：背外侧、眶额和极前额皮质的比较。

J Neurosci. 2012 Aug 8;32(32):11017-31. doi: 10.1523/JNEUROSCI.1230-12.2012.

引用本文的文献

A solvable neural circuit model revealing the dynamical principle of non-optimal temporal weighting in perceptual decision making.一种可求解的神经回路模型揭示了知觉决策中非最优时间加权的动力学原理。

J Comput Neurosci. 2025 Sep;53(3):441-458. doi: 10.1007/s10827-025-00910-9. Epub 2025 Jul 29.

Stimulus uncertainty and relative reward rates determine adaptive responding in perceptual decision-making.刺激不确定性和相对奖励率决定了知觉决策中的适应性反应。

PLoS Comput Biol. 2025 May 27;21(5):e1012636. doi: 10.1371/journal.pcbi.1012636. eCollection 2025 May.

Rewarding animals based on their subjective percepts is enabled by online Bayesian estimation of perceptual biases.基于动物主观感知对其进行奖励可通过对感知偏差的在线贝叶斯估计来实现。

PLoS Biol. 2025 May 20;23(5):e3002764. doi: 10.1371/journal.pbio.3002764. eCollection 2025 May.

How to reward animals based on their subjective percepts: A Bayesian approach to online estimation of perceptual biases.如何基于动物的主观感知来奖励它们：一种用于在线估计感知偏差的贝叶斯方法。

bioRxiv. 2025 Mar 7:2024.07.25.605047. doi: 10.1101/2024.07.25.605047.

Sensory choices as logistic classification.感觉选择作为逻辑分类。

Neuron. 2024 Sep 4;112(17):2854-2868.e1. doi: 10.1016/j.neuron.2024.06.016. Epub 2024 Jul 15.

Sensory choices as logistic classification.作为逻辑分类的感官选择

bioRxiv. 2024 Jun 27:2024.01.17.576029. doi: 10.1101/2024.01.17.576029.

Neural Representations of Post-Decision Accuracy and Reward Expectation in the Caudate Nucleus and Frontal Eye Field.纹状体和额眼区中决策后准确性和奖励预期的神经表示。

J Neurosci. 2024 Jan 10;44(2):e0902232023. doi: 10.1523/JNEUROSCI.0902-23.2023.

Stable sound decoding despite modulated sound representation in the auditory cortex.尽管听觉皮层中声音表现形式发生调制，声音依然稳定解码。

Curr Biol. 2023 Oct 23;33(20):4470-4483.e7. doi: 10.1016/j.cub.2023.09.031. Epub 2023 Oct 5.

Stable sound decoding despite modulated sound representation in the auditory cortex.尽管听觉皮层中的声音表征存在调制，但声音解码仍保持稳定。

bioRxiv. 2023 Sep 15:2023.01.31.526457. doi: 10.1101/2023.01.31.526457.

Multiphasic value biases in fast-paced decisions.多阶段价值偏见在快节奏决策中的表现。

Elife. 2023 Feb 13;12:e67711. doi: 10.7554/eLife.67711.

本文引用的文献

Robust versus optimal strategies for two-alternative forced choice tasks.用于二选一强制选择任务的稳健策略与最优策略

J Math Psychol. 2010 Apr 1;54(2):230-246. doi: 10.1016/j.jmp.2009.12.004. Epub 2010 Jan 13.

Dynamic integration of reward and stimulus information in perceptual decision-making.在感知决策中，奖励和刺激信息的动态整合。

PLoS One. 2011 Mar 3;6(3):e16749. doi: 10.1371/journal.pone.0016749.

Reward rate optimization in two-alternative decision making: empirical tests of theoretical predictions.双选择决策中的奖励率优化：理论预测的实证检验。

J Exp Psychol Hum Percept Perform. 2009 Dec;35(6):1865-97. doi: 10.1037/a0016926.

Risk assessment in man and mouse.人和小鼠的风险评估。

Proc Natl Acad Sci U S A. 2009 Feb 17;106(7):2459-63. doi: 10.1073/pnas.0812709106. Epub 2009 Feb 2.

On diffusion processes with variable drift rates as models for decision making during learning.以具有可变漂移率的扩散过程作为学习过程中决策的模型

New J Phys. 2008 Jan 31;10(15006):nihpa49499. doi: 10.1088/1367-2630/10/1/015006.

Neurobiological models of two-choice decision making can be reduced to a one-dimensional nonlinear diffusion equation.双选决策的神经生物学模型可以简化为一维非线性扩散方程。

PLoS Comput Biol. 2008 Mar 28;4(3):e1000046. doi: 10.1371/journal.pcbi.1000046.

The diffusion decision model: theory and data for two-choice decision tasks.扩散决策模型：二选一决策任务的理论与数据

Neural Comput. 2008 Apr;20(4):873-922. doi: 10.1162/neco.2008.12-06-420.

The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks.最优决策的物理学：对二选一强制选择任务中表现模型的形式分析。

Psychol Rev. 2006 Oct;113(4):700-65. doi: 10.1037/0033-295X.113.4.700.

Microstimulation of macaque area LIP affects decision-making in a motion discrimination task.对猕猴外侧顶内沟区进行微刺激会影响运动辨别任务中的决策过程。

Nat Neurosci. 2006 May;9(5):682-9. doi: 10.1038/nn1683. Epub 2006 Apr 9.

A recurrent network mechanism of time integration in perceptual decisions.感知决策中时间整合的循环网络机制。

J Neurosci. 2006 Jan 25;26(4):1314-28. doi: 10.1523/JNEUROSCI.3733-05.2006.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

猴子在面对有噪声的刺激和不均等的奖励时能做出最优选择吗？

Can monkeys choose optimally when faced with noisy stimuli and unequal rewards?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献