Balci Fuat, Simen Patrick, Niyogi Ritwik, Saxe Andrew, Hughes Jessica A, Holmes Philip, Cohen Jonathan D
Princeton Neuroscience Institute, Princeton University, Green Hall, Princeton, NJ 08544, USA.
Atten Percept Psychophys. 2011 Feb;73(2):640-57. doi: 10.3758/s13414-010-0049-7.
Speed-accuracy trade-offs strongly influence the rate of reward that can be earned in many decision-making tasks. Previous reports suggest that human participants often adopt suboptimal speed-accuracy trade-offs in single session, two-alternative forced-choice tasks. We investigated whether humans acquired optimal speed-accuracy trade-offs when extensively trained with multiple signal qualities. When performance was characterized in terms of decision time and accuracy, our participants eventually performed nearly optimally in the case of higher signal qualities. Rather than adopting decision criteria that were individually optimal for each signal quality, participants adopted a single threshold that was nearly optimal for most signal qualities. However, setting a single threshold for different coherence conditions resulted in only negligible decrements in the maximum possible reward rate. Finally, we tested two hypotheses regarding the possible sources of suboptimal performance: (1) favoring accuracy over reward rate and (2) misestimating the reward rate due to timing uncertainty. Our findings provide support for both hypotheses, but also for the hypothesis that participants can learn to approach optimality. We find specifically that an accuracy bias dominates early performance, but diminishes greatly with practice. The residual discrepancy between optimal and observed performance can be explained by an adaptive response to uncertainty in time estimation.
速度-准确性权衡在许多决策任务中对可获得的奖励率有强烈影响。先前的报告表明,人类参与者在单次会话的二选一强制选择任务中常常采用次优的速度-准确性权衡。我们研究了人类在接受多种信号质量的广泛训练后是否会获得最优的速度-准确性权衡。当根据决策时间和准确性来表征表现时,我们的参与者在较高信号质量的情况下最终表现近乎最优。参与者并非采用针对每种信号质量各自最优的决策标准,而是采用了一个对大多数信号质量近乎最优的单一阈值。然而,为不同的连贯性条件设置单一阈值只会导致最大可能奖励率出现可忽略不计的下降。最后,我们测试了关于次优表现可能来源的两个假设:(1)相较于奖励率更倾向于准确性;(2)由于时间不确定性而错误估计奖励率。我们的研究结果为这两个假设提供了支持,同时也为参与者能够学会接近最优性这一假设提供了支持。我们特别发现,准确性偏差在早期表现中占主导,但随着练习会大幅减少。最优表现与观察到的表现之间的残余差异可以通过对时间估计不确定性的适应性反应来解释。