Faculty of Psychology and Educational Sciences, KU Leuven-University of Leuven, Leuven, Belgium.
Behav Res Methods. 2018 Apr;50(2):557-575. doi: 10.3758/s13428-017-0885-7.
The conditional power (CP) of the randomization test (RT) was investigated in a simulation study in which three different single-case effect size (ES) measures were used as the test statistics: the mean difference (MD), the percentage of nonoverlapping data (PND), and the nonoverlap of all pairs (NAP). Furthermore, we studied the effect of the experimental design on the RT's CP for three different single-case designs with rapid treatment alternation: the completely randomized design (CRD), the randomized block design (RBD), and the restricted randomized alternation design (RRAD). As a third goal, we evaluated the CP of the RT for three types of simulated data: data generated from a standard normal distribution, data generated from a uniform distribution, and data generated from a first-order autoregressive Gaussian process. The results showed that the MD and NAP perform very similarly in terms of CP, whereas the PND performs substantially worse. Furthermore, the RRAD yielded marginally higher power in the RT, followed by the CRD and then the RBD. Finally, the power of the RT was almost unaffected by the type of the simulated data. On the basis of the results of the simulation study, we recommend at least 20 measurement occasions for single-case designs with a randomized treatment order that are to be evaluated with an RT using a 5% significance level. Furthermore, we do not recommend use of the PND, because of its low power in the RT.
在一项模拟研究中,研究了随机化检验(RT)的条件功效(CP),其中使用了三种不同的单案例效应量(ES)测量值作为检验统计量:均值差(MD)、非重叠数据百分比(PND)和所有对的非重叠(NAP)。此外,我们研究了三种不同的快速治疗交替的单案例设计对 RT 的 CP 的影响:完全随机设计(CRD)、随机区组设计(RBD)和受限随机交替设计(RRAD)。作为第三个目标,我们评估了 RT 的 CP 对于三种类型的模拟数据:从标准正态分布生成的数据、从均匀分布生成的数据和从一阶自回归高斯过程生成的数据。结果表明,MD 和 NAP 在 CP 方面表现非常相似,而 PND 的表现则明显较差。此外,RRAD 在 RT 中产生了略微更高的功效,其次是 CRD,然后是 RBD。最后,RT 的功效几乎不受模拟数据类型的影响。基于模拟研究的结果,我们建议对于随机化治疗顺序的单案例设计,至少要有 20 个测量点,以便在使用 RT 进行评估时使用 5%的显著水平。此外,由于 PND 在 RT 中的功效较低,我们不建议使用 PND。