Moran Rani
School of Psychological Sciences, Tel Aviv University, Ramat Aviv, POB 39040, Tel-Aviv, 69978, Israel,
Psychon Bull Rev. 2015 Feb;22(1):38-53. doi: 10.3758/s13423-014-0669-3.
The issue of optimal performance in speeded two-choice tasks has played a substantial role in the development and evaluation of decision making theories. For difficulty-homogeneous environments, the means to achieve optimality are prescribed by the sequential probability ratio test (SPRT), or equivalently, by the drift diffusion model (DDM). Biases in the external environments are easily accommodated into these models by adopting a prior integration bias. However, for difficulty-heterogeneous environments, the issue is more elusive. I show that in such cases, the SPRT and the DDM are no longer equivalent and both are suboptimal. Optimality is achieved by a diffusion-like accumulation of evidence while adjusting the choice thresholds during the time course of a trial. In the second part of the paper, assuming that decisions are made according to the popular DDM, I show that optimal performance in biased environments mandates incorporating a dynamic-bias component (a shift in the drift threshold) in addition to the prior bias (a shift in the starting point) into the model. These conclusions support a conjecture by Hanks, Mazurek, Kiani, Hopp, and Shadlen, (The Journal of Neuroscience, 31(17), 6339-6352, 2011) and contradict a recent attempt to refute this conjecture by arguing that optimality is achieved with the aid of prior bias alone (van Ravenzwaaij et al., 2012). The psychological plausibility of such "mathematically optimal" strategies is discussed. The current paper contributes to the ongoing effort to understand optimal behavior in biased and heterogeneous environments and corrects prior conclusions with respect to optimality in such conditions.
在限时二选一任务中的最优表现问题,在决策理论的发展和评估中发挥了重要作用。对于难度均匀的环境,实现最优性的方法由序贯概率比检验(SPRT)规定,或者等效地,由漂移扩散模型(DDM)规定。通过采用先验整合偏差,外部环境中的偏差很容易纳入这些模型。然而,对于难度异质的环境,问题更加难以捉摸。我表明,在这种情况下,SPRT和DDM不再等效,并且两者都是次优的。最优性是通过类似扩散的证据积累来实现的,同时在试验过程中调整选择阈值。在本文的第二部分,假设决策是根据流行的DDM做出的,我表明在有偏差的环境中实现最优表现要求除了先验偏差(起点的偏移)之外,还将动态偏差成分(漂移阈值的偏移)纳入模型。这些结论支持了汉克斯、马祖雷克、基亚尼、霍普和沙德伦的猜想(《神经科学杂志》,31(17),6339 - 6352,2011),并与最近试图反驳这一猜想的观点相矛盾,后者认为仅借助先验偏差就能实现最优性(范·拉文兹瓦伊等,2012)。讨论了这种“数学上最优”策略的心理合理性。本文有助于正在进行的理解有偏差和异质环境中最优行为的努力,并纠正了关于此类条件下最优性的先前结论。