Watson Samuel I, Girling Alan, Hemming Karla
Institute of Applied Health Research, University of Birmingham, Birmingham, UK.
Stat Med. 2021 Feb 28;40(5):1133-1146. doi: 10.1002/sim.8828. Epub 2020 Nov 30.
In this article, we review and evaluate a number of methods used in the design and analysis of small three-arm parallel cluster randomized trials. We conduct a simulation-based study to evaluate restricted randomization methods including covariate-constrained randomization and a novel method for matched-group cluster randomization. We also evaluate the appropriate modelling of the data and small sample inferential methods for a variety of treatment effects relevant to three-arm trials. Our results indicate that small-sample corrections are required for high (0.05) but not low (0.001) values of the intraclass correlation coefficient and their performance can depend on trial design, number of clusters, and the nature of the hypothesis being tested. The Satterthwaite correction generally performed best at an ICC of 0.05 with a nominal type I error rate for single-period trials, and in trials with repeated measures type I error rates were between 0.04 and 0.06. Restricted randomization methods produce little benefit in trials with repeated measures but in trials with single post-intervention design can provide relatively large gains in power when compared to the most unbalanced possible allocations. Matched-group randomization improves power but is not as effective as covariate-constrained randomization. For model-based analysis, adjusting for fewer covariates than were used in a restricted randomization process under any design can produce non-nominal type I error rates and reductions in power. Where comparisons to two-arm cluster trials are possible, the performance of the methods is qualitatively very similar.
在本文中,我们回顾并评估了一些用于小型三臂平行整群随机试验设计与分析的方法。我们开展了一项基于模拟的研究,以评估受限随机化方法,包括协变量约束随机化和一种用于匹配组整群随机化的新方法。我们还评估了数据的适当建模以及针对与三臂试验相关的各种治疗效果的小样本推断方法。我们的结果表明,对于组内相关系数的高值(0.05)而非低值(0.001),需要进行小样本校正,并且它们的性能可能取决于试验设计、整群数量以及所检验假设的性质。在单周期试验中,当组内相关系数为0.05时,萨特思韦特校正总体表现最佳,名义I型错误率符合要求;在重复测量试验中,I型错误率在0.04至0.06之间。受限随机化方法在重复测量试验中益处不大,但在单干预后设计试验中,与最不平衡的可能分配相比,可在检验效能方面提供相对较大的提升。匹配组随机化可提高检验效能,但不如协变量约束随机化有效。对于基于模型的分析,在任何设计下,若调整的协变量数量少于受限随机化过程中使用的协变量数量,可能会产生非名义I型错误率并降低检验效能。在可以与双臂整群试验进行比较的情况下,这些方法的性能在定性上非常相似。