Suppr超能文献

决策标准的获取:奖励率最终胜过准确性。

Acquisition of decision making criteria: reward rate ultimately beats accuracy.

作者信息

Balci Fuat, Simen Patrick, Niyogi Ritwik, Saxe Andrew, Hughes Jessica A, Holmes Philip, Cohen Jonathan D

机构信息

Princeton Neuroscience Institute, Princeton University, Green Hall, Princeton, NJ 08544, USA.

出版信息

Atten Percept Psychophys. 2011 Feb;73(2):640-57. doi: 10.3758/s13414-010-0049-7.

Abstract

Speed-accuracy trade-offs strongly influence the rate of reward that can be earned in many decision-making tasks. Previous reports suggest that human participants often adopt suboptimal speed-accuracy trade-offs in single session, two-alternative forced-choice tasks. We investigated whether humans acquired optimal speed-accuracy trade-offs when extensively trained with multiple signal qualities. When performance was characterized in terms of decision time and accuracy, our participants eventually performed nearly optimally in the case of higher signal qualities. Rather than adopting decision criteria that were individually optimal for each signal quality, participants adopted a single threshold that was nearly optimal for most signal qualities. However, setting a single threshold for different coherence conditions resulted in only negligible decrements in the maximum possible reward rate. Finally, we tested two hypotheses regarding the possible sources of suboptimal performance: (1) favoring accuracy over reward rate and (2) misestimating the reward rate due to timing uncertainty. Our findings provide support for both hypotheses, but also for the hypothesis that participants can learn to approach optimality. We find specifically that an accuracy bias dominates early performance, but diminishes greatly with practice. The residual discrepancy between optimal and observed performance can be explained by an adaptive response to uncertainty in time estimation.

摘要

速度-准确性权衡在许多决策任务中对可获得的奖励率有强烈影响。先前的报告表明,人类参与者在单次会话的二选一强制选择任务中常常采用次优的速度-准确性权衡。我们研究了人类在接受多种信号质量的广泛训练后是否会获得最优的速度-准确性权衡。当根据决策时间和准确性来表征表现时,我们的参与者在较高信号质量的情况下最终表现近乎最优。参与者并非采用针对每种信号质量各自最优的决策标准,而是采用了一个对大多数信号质量近乎最优的单一阈值。然而,为不同的连贯性条件设置单一阈值只会导致最大可能奖励率出现可忽略不计的下降。最后,我们测试了关于次优表现可能来源的两个假设:(1)相较于奖励率更倾向于准确性;(2)由于时间不确定性而错误估计奖励率。我们的研究结果为这两个假设提供了支持,同时也为参与者能够学会接近最优性这一假设提供了支持。我们特别发现,准确性偏差在早期表现中占主导,但随着练习会大幅减少。最优表现与观察到的表现之间的残余差异可以通过对时间估计不确定性的适应性反应来解释。

相似文献

1
Acquisition of decision making criteria: reward rate ultimately beats accuracy.
Atten Percept Psychophys. 2011 Feb;73(2):640-57. doi: 10.3758/s13414-010-0049-7.
2
Do humans produce the speed-accuracy trade-off that maximizes reward rate?
Q J Exp Psychol (Hove). 2010 May;63(5):863-91. doi: 10.1080/17470210903091643. Epub 2009 Sep 10.
3
Reward rate optimization in two-alternative decision making: empirical tests of theoretical predictions.
J Exp Psychol Hum Percept Perform. 2009 Dec;35(6):1865-97. doi: 10.1037/a0016926.
7
Optimal temporal risk assessment.
Front Integr Neurosci. 2011 Sep 27;5:56. doi: 10.3389/fnint.2011.00056. eCollection 2011.
8
Time-based reward maximization.
Philos Trans R Soc Lond B Biol Sci. 2014 Jan 20;369(1637):20120461. doi: 10.1098/rstb.2012.0461. Print 2014 Mar 5.
9
Decision-making impairments in the context of intact reward sensitivity in schizophrenia.
Biol Psychiatry. 2008 Jul 1;64(1):62-9. doi: 10.1016/j.biopsych.2008.02.015. Epub 2008 Apr 2.

引用本文的文献

1
Normative evidence weighing and accumulation in correlated environments.
Elife. 2025 Jul 14;13:RP100258. doi: 10.7554/eLife.100258.
2
Time and memory costs jointly determine a speed-accuracy trade-off and set-size effects.
J Exp Psychol Gen. 2025 Jun;154(6):1611-1627. doi: 10.1037/xge0001760. Epub 2025 Apr 7.
3
Mice monitor their timing errors.
Sci Rep. 2024 Oct 7;14(1):23356. doi: 10.1038/s41598-024-71921-2.
4
Normative evidence weighing and accumulation in correlated environments.
bioRxiv. 2025 Feb 8:2024.05.29.596489. doi: 10.1101/2024.05.29.596489.
5
How cortico-basal ganglia-thalamic subnetworks can shift decision policies to maximize reward rate.
bioRxiv. 2025 Jan 5:2024.05.21.595174. doi: 10.1101/2024.05.21.595174.
6
Decision-making processes in perceptual learning depend on effectors.
Sci Rep. 2024 Mar 7;14(1):5644. doi: 10.1038/s41598-024-55508-5.
8
Capacity Limits Lead to Information Bottlenecks in Ongoing Rapid Motor Behaviors.
eNeuro. 2023 Mar 13;10(3). doi: 10.1523/ENEURO.0289-22.2023. Print 2023 Mar.
9
Strategically managing learning during perceptual decision making.
Elife. 2023 Feb 14;12:e64978. doi: 10.7554/eLife.64978.
10
Normative decision rules in changing environments.
Elife. 2022 Oct 25;11:e79824. doi: 10.7554/eLife.79824.

本文引用的文献

1
Robust versus optimal strategies for two-alternative forced choice tasks.
J Math Psychol. 2010 Apr 1;54(2):230-246. doi: 10.1016/j.jmp.2009.12.004. Epub 2010 Jan 13.
3
Increasing Speed of Processing With Action Video Games.
Curr Dir Psychol Sci. 2009;18(6):321-326. doi: 10.1111/j.1467-8721.2009.01660.x.
4
Reward rate optimization in two-alternative decision making: empirical tests of theoretical predictions.
J Exp Psychol Hum Percept Perform. 2009 Dec;35(6):1865-97. doi: 10.1037/a0016926.
5
A diffusion model decomposition of the practice effect.
Psychon Bull Rev. 2009 Dec;16(6):1026-36. doi: 10.3758/16.6.1026.
6
Do humans produce the speed-accuracy trade-off that maximizes reward rate?
Q J Exp Psychol (Hove). 2010 May;63(5):863-91. doi: 10.1080/17470210903091643. Epub 2009 Sep 10.
7
Risk assessment in man and mouse.
Proc Natl Acad Sci U S A. 2009 Feb 17;106(7):2459-63. doi: 10.1073/pnas.0812709106. Epub 2009 Feb 2.
8
Diffusion model analysis with MATLAB: a DMAT primer.
Behav Res Methods. 2008 Feb;40(1):61-72. doi: 10.3758/brm.40.1.61.
9
Cross-domain transfer of quantitative discriminations: is it all a matter of proportion?
Psychon Bull Rev. 2006 Aug;13(4):636-42. doi: 10.3758/bf03193974.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验