Feng Samuel, Holmes Philip, Rorie Alan, Newsome William T
Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey, United States of America.
PLoS Comput Biol. 2009 Feb;5(2):e1000284. doi: 10.1371/journal.pcbi.1000284. Epub 2009 Feb 13.
We review the leaky competing accumulator model for two-alternative forced-choice decisions with cued responses, and propose extensions to account for the influence of unequal rewards. Assuming that stimulus information is integrated until the cue to respond arrives and that firing rates of stimulus-selective neurons remain well within physiological bounds, the model reduces to an Ornstein-Uhlenbeck (OU) process that yields explicit expressions for the psychometric function that describes accuracy. From these we compute strategies that optimize the rewards expected over blocks of trials administered with mixed difficulty and reward contingencies. The psychometric function is characterized by two parameters: its midpoint slope, which quantifies a subject's ability to extract signal from noise, and its shift, which measures the bias applied to account for unequal rewards. We fit these to data from two monkeys performing the moving dots task with mixed coherences and reward schedules. We find that their behaviors averaged over multiple sessions are close to optimal, with shifts erring in the direction of smaller penalties. We propose two methods for biasing the OU process to produce such shifts.
我们回顾了用于有提示响应的二选一强制选择决策的泄漏竞争累加器模型,并提出了扩展模型以解释不等奖励的影响。假设刺激信息在响应提示到来之前进行整合,并且刺激选择性神经元的放电率保持在生理范围内,该模型简化为一个奥恩斯坦 - 乌伦贝克(OU)过程,该过程为描述准确性的心理测量函数产生明确的表达式。由此我们计算出在具有混合难度和奖励条件的试验块中优化预期奖励的策略。心理测量函数由两个参数表征:其中点斜率,量化了受试者从噪声中提取信号的能力;其偏移量,衡量为解释不等奖励而应用的偏差。我们将这些参数拟合到两只猴子执行具有混合相干性和奖励计划的移动点任务的数据中。我们发现,它们在多个会话中的平均行为接近最优,偏差朝着较小惩罚的方向出现误差。我们提出了两种使OU过程产生这种偏差的方法。