Westgate Philip M
Department of Biostatistics, College of Public Health, University of Kentucky, 725 Rose Street, Lexington, KY 40536, USA.
Biom J. 2013 Sep;55(5):789-806. doi: 10.1002/bimj.201200237. Epub 2013 Jul 15.
Group randomized trials (GRTs) randomize groups, or clusters, of people to intervention or control arms. To test for the effectiveness of the intervention when subject-level outcomes are binary, and while fitting a marginal model that adjusts for cluster-level covariates and utilizes a logistic link, we develop a pseudo-Wald statistic to improve inference. Alternative Wald statistics could employ bias-corrected empirical sandwich standard error estimates, which have received limited attention in the GRT literature despite their broad utility and applicability in our settings of interest. The test could also be carried out using popular approaches based upon cluster-level summary outcomes. A simulation study covering a variety of realistic GRT settings is used to compare the accuracy of these methods in terms of producing nominal test sizes. Tests based upon the pseudo-Wald statistic and a cluster-level summary approach utilizing the natural log of observed cluster-level odds worked best. Due to weighting, some popular cluster-level summary approaches were found to lead to invalid inference in many settings. Finally, although use of bias-corrected empirical sandwich standard error estimates did not consistently result in nominal sizes, they did work well, thus supporting the applicability of marginal models in GRT settings.
群组随机试验(GRTs)将人群分组或聚类,随机分配至干预组或对照组。为了在个体水平结局为二元变量时检验干预效果,同时拟合一个调整聚类水平协变量并使用逻辑链接的边际模型,我们开发了一种伪 Wald 统计量来改进推断。替代的 Wald 统计量可以采用偏差校正的经验三明治标准误差估计,尽管它们在我们感兴趣的设置中具有广泛的效用和适用性,但在 GRT 文献中受到的关注有限。该检验也可以使用基于聚类水平汇总结局的常用方法进行。一项涵盖各种现实 GRT 设置的模拟研究用于比较这些方法在产生名义检验规模方面的准确性。基于伪 Wald 统计量和使用观察到的聚类水平优势比自然对数的聚类水平汇总方法的检验效果最佳。由于加权,发现一些流行的聚类水平汇总方法在许多设置中会导致无效推断。最后,尽管使用偏差校正的经验三明治标准误差估计并没有始终产生名义规模,但它们确实效果良好,从而支持了边际模型在 GRT 设置中的适用性。