Suppr超能文献

双选择决策中的奖励率优化:理论预测的实证检验。

Reward rate optimization in two-alternative decision making: empirical tests of theoretical predictions.

机构信息

Princeton Neuroscience Institute, Princeton University, USA.

出版信息

J Exp Psychol Hum Percept Perform. 2009 Dec;35(6):1865-97. doi: 10.1037/a0016926.

Abstract

The drift-diffusion model (DDM) implements an optimal decision procedure for stationary, 2-alternative forced-choice tasks. The height of a decision threshold applied to accumulating information on each trial determines a speed-accuracy tradeoff (SAT) for the DDM, thereby accounting for a ubiquitous feature of human performance in speeded response tasks. However, little is known about how participants settle on particular tradeoffs. One possibility is that they select SATs that maximize a subjective rate of reward earned for performance. For the DDM, there exist unique, reward-rate-maximizing values for its threshold and starting point parameters in free-response tasks that reward correct responses (R. Bogacz, E. Brown, J. Moehlis, P. Holmes, & J. D. Cohen, 2006). These optimal values vary as a function of response-stimulus interval, prior stimulus probability, and relative reward magnitude for correct responses. We tested the resulting quantitative predictions regarding response time, accuracy, and response bias under these task manipulations and found that grouped data conformed well to the predictions of an optimally parameterized DDM.

摘要

漂移-扩散模型(DDM)为静态、二择一强制选择任务实施了最优决策程序。在每个试验上积累信息时应用的决策阈值的高度决定了 DDM 的速度-准确性权衡(SAT),从而解释了人类在快速反应任务中的普遍表现特征。然而,参与者如何确定特定的权衡取舍知之甚少。一种可能性是他们选择 SAT,以使表现获得的主观奖励率最大化。对于 DDM,在奖励正确反应的自由反应任务中,其阈值和起始点参数存在唯一的、奖励率最大化的值(R. Bogacz、E. Brown、J. Moehlis、P. Holmes 和 J. D. Cohen,2006)。这些最优值随反应-刺激间隔、先验刺激概率和正确反应的相对奖励幅度而变化。我们根据这些任务操作检验了关于反应时、准确性和反应偏差的定量预测,发现分组数据与最优参数化 DDM 的预测非常吻合。

相似文献

2
Do humans produce the speed-accuracy trade-off that maximizes reward rate?人类是否会产生使奖励率最大化的速度-准确性权衡?
Q J Exp Psychol (Hove). 2010 May;63(5):863-91. doi: 10.1080/17470210903091643. Epub 2009 Sep 10.
3
Optimal decision making in neural inhibition models.神经抑制模型中的最优决策。
Psychol Rev. 2012 Jan;119(1):201-15. doi: 10.1037/a0026275. Epub 2011 Nov 21.
4
Rapid decision threshold modulation by reward rate in a neural network.神经网络中奖励率对快速决策阈值的调制
Neural Netw. 2006 Oct;19(8):1013-26. doi: 10.1016/j.neunet.2006.05.038. Epub 2006 Sep 20.
6
Explicit melioration by a neural diffusion model.神经扩散模型的显式改进。
Brain Res. 2009 Nov 24;1299:95-117. doi: 10.1016/j.brainres.2009.07.017. Epub 2009 Jul 30.

引用本文的文献

6
Modelling decision-making biases.决策偏差建模
Front Comput Neurosci. 2023 Oct 20;17:1222924. doi: 10.3389/fncom.2023.1222924. eCollection 2023.
7
Predictions and rewards affect decision-making but not subjective experience.预测和奖励会影响决策,但不会影响主观体验。
Proc Natl Acad Sci U S A. 2023 Oct 31;120(44):e2220749120. doi: 10.1073/pnas.2220749120. Epub 2023 Oct 25.
8
Contributions of the Basal Ganglia to Visual Perceptual Decisions.基底神经节对视觉感知决策的贡献。
Annu Rev Vis Sci. 2023 Sep 15;9:385-407. doi: 10.1146/annurev-vision-111022-123804.
10
Visuo-vestibular heading perception: a model system to study multi-sensory decision making.视-前庭头动感知:用于研究多感觉决策的模型系统。
Philos Trans R Soc Lond B Biol Sci. 2023 Sep 25;378(1886):20220334. doi: 10.1098/rstb.2022.0334. Epub 2023 Aug 7.

本文引用的文献

1
Robust versus optimal strategies for two-alternative forced choice tasks.用于二选一强制选择任务的稳健策略与最优策略
J Math Psychol. 2010 Apr 1;54(2):230-246. doi: 10.1016/j.jmp.2009.12.004. Epub 2010 Jan 13.
2
Do humans produce the speed-accuracy trade-off that maximizes reward rate?人类是否会产生使奖励率最大化的速度-准确性权衡?
Q J Exp Psychol (Hove). 2010 May;63(5):863-91. doi: 10.1080/17470210903091643. Epub 2009 Sep 10.
7
Rapid decision threshold modulation by reward rate in a neural network.神经网络中奖励率对快速决策阈值的调制
Neural Netw. 2006 Oct;19(8):1013-26. doi: 10.1016/j.neunet.2006.05.038. Epub 2006 Sep 20.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验