Karmali Faisal, Chaudhuri Shomesh E, Yi Yongwoo, Merfeld Daniel M
Jenks Vestibular Physiology Lab, Massachusetts Eye and Ear Infirmary, 243 Charles St., Boston, MA, 02114, USA.
Department of Otology and Laryngology, Harvard Medical School, Boston, MA, USA.
Exp Brain Res. 2016 Mar;234(3):773-89. doi: 10.1007/s00221-015-4501-8. Epub 2015 Dec 8.
When measuring thresholds, careful selection of stimulus amplitude can increase efficiency by increasing the precision of psychometric fit parameters (e.g., decreasing the fit parameter error bars). To find efficient adaptive algorithms for psychometric threshold ("sigma") estimation, we combined analytic approaches, Monte Carlo simulations, and human experiments for a one-interval, binary forced-choice, direction-recognition task. To our knowledge, this is the first time analytic results have been combined and compared with either simulation or human results. Human performance was consistent with theory and not significantly different from simulation predictions. Our analytic approach provides a bound on efficiency, which we compared against the efficiency of standard staircase algorithms, a modified staircase algorithm with asymmetric step sizes, and a maximum likelihood estimation (MLE) procedure. Simulation results suggest that optimal efficiency at determining threshold is provided by the MLE procedure targeting a fraction correct level of 0.92, an asymmetric 4-down, 1-up staircase targeting between 0.86 and 0.92 or a standard 6-down, 1-up staircase. Psychometric test efficiency, computed by comparing simulation and analytic results, was between 41 and 58% for 50 trials for these three algorithms, reaching up to 84% for 200 trials. These approaches were 13-21% more efficient than the commonly used 3-down, 1-up symmetric staircase. We also applied recent advances to reduce accuracy errors using a bias-reduced fitting approach. Taken together, the results lend confidence that the assumptions underlying each approach are reasonable and that human threshold forced-choice decision making is modeled well by detection theory models and mimics simulations based on detection theory models.
在测量阈值时,仔细选择刺激幅度可以通过提高心理测量拟合参数的精度(例如,减小拟合参数误差线)来提高效率。为了找到用于心理测量阈值(“sigma”)估计的高效自适应算法,我们将分析方法、蒙特卡罗模拟和人体实验结合起来,用于单区间、二项迫选、方向识别任务。据我们所知,这是首次将分析结果与模拟或人体结果进行结合和比较。人体表现与理论一致,与模拟预测没有显著差异。我们的分析方法提供了一个效率界限,我们将其与标准阶梯算法、具有不对称步长的改进阶梯算法以及最大似然估计(MLE)程序的效率进行了比较。模拟结果表明,针对正确分数水平为0.92的MLE程序、针对0.86至0.92之间的不对称4降1升阶梯或标准6降1升阶梯,在确定阈值时提供了最佳效率。通过比较模拟和分析结果计算得出的心理测量测试效率,对于这三种算法在50次试验时为41%至58%,在200次试验时高达84%。这些方法比常用的3降1升对称阶梯效率高13%至21%。我们还应用了最近的进展,使用偏差减少拟合方法来减少准确性误差。综上所述,这些结果让人相信每种方法背后的假设是合理的,并且人类阈值迫选决策可以通过检测理论模型很好地建模,并模仿基于检测理论模型的模拟。