Braschel Melissa C, Svec Ivana, Darlington Gerarda A, Donner Allan
Department of Mathematics & Statistics, University of Guelph, Guelph, ON, Canada
Department of Mathematics & Statistics, University of Guelph, Guelph, ON, Canada.
Clin Trials. 2016 Apr;13(2):180-7. doi: 10.1177/1740774515606377. Epub 2015 Sep 28.
Many investigators rely on previously published point estimates of the intraclass correlation coefficient rather than on their associated confidence intervals to determine the required size of a newly planned cluster randomized trial. Although confidence interval methods for the intraclass correlation coefficient that can be applied to community-based trials have been developed for a continuous outcome variable, fewer methods exist for a binary outcome variable. The aim of this study is to evaluate confidence interval methods for the intraclass correlation coefficient applied to binary outcomes in community intervention trials enrolling a small number of large clusters. Existing methods for confidence interval construction are examined and compared to a new ad hoc approach based on dividing clusters into a large number of smaller sub-clusters and subsequently applying existing methods to the resulting data.
Monte Carlo simulation is used to assess the width and coverage of confidence intervals for the intraclass correlation coefficient based on Smith's large sample approximation of the standard error of the one-way analysis of variance estimator, an inverted modified Wald test for the Fleiss-Cuzick estimator, and intervals constructed using a bootstrap-t applied to a variance-stabilizing transformation of the intraclass correlation coefficient estimate. In addition, a new approach is applied in which clusters are randomly divided into a large number of smaller sub-clusters with the same methods applied to these data (with the exception of the bootstrap-t interval, which assumes large cluster sizes). These methods are also applied to a cluster randomized trial on adolescent tobacco use for illustration.
When applied to a binary outcome variable in a small number of large clusters, existing confidence interval methods for the intraclass correlation coefficient provide poor coverage. However, confidence intervals constructed using the new approach combined with Smith's method provide nominal or close to nominal coverage when the intraclass correlation coefficient is small (<0.05), as is the case in most community intervention trials.
This study concludes that when a binary outcome variable is measured in a small number of large clusters, confidence intervals for the intraclass correlation coefficient may be constructed by dividing existing clusters into sub-clusters (e.g. groups of 5) and using Smith's method. The resulting confidence intervals provide nominal or close to nominal coverage across a wide range of parameters when the intraclass correlation coefficient is small (<0.05). Application of this method should provide investigators with a better understanding of the uncertainty associated with a point estimator of the intraclass correlation coefficient used for determining the sample size needed for a newly designed community-based trial.
许多研究者在确定新计划的整群随机试验所需样本量时,依赖于先前发表的组内相关系数的点估计值,而非其相关的置信区间。尽管已针对连续结局变量开发出可应用于社区试验的组内相关系数置信区间方法,但针对二分类结局变量的方法较少。本研究的目的是评估在纳入少量大群组的社区干预试验中,应用于二分类结局的组内相关系数置信区间方法。研究对现有的置信区间构建方法进行了检验,并与一种基于将群组划分为大量较小子群组并随后将现有方法应用于所得数据的新的临时方法进行比较。
采用蒙特卡罗模拟,基于方差分析估计量标准误的史密斯大样本近似、Fleiss-Cuzick估计量的反向修正Wald检验以及应用于组内相关系数估计量方差稳定变换的自抽样t法,来评估组内相关系数置信区间的宽度和覆盖率。此外,应用一种新方法,即将群组随机划分为大量较小子群组,并将相同方法应用于这些数据(自抽样t区间除外,其假定群组规模较大)。这些方法也应用于一项关于青少年烟草使用的整群随机试验以作说明。
当应用于少量大群组中的二分类结局变量时,现有的组内相关系数置信区间方法覆盖率较差。然而,当组内相关系数较小时(<0.05),如大多数社区干预试验的情况,使用新方法结合史密斯方法构建的置信区间提供了标称或接近标称的覆盖率。
本研究得出结论,当在少量大群组中测量二分类结局变量时,组内相关系数的置信区间可通过将现有群组划分为子群组(如每组5个)并使用史密斯方法来构建。当组内相关系数较小时(<0.05),所得置信区间在广泛的参数范围内提供标称或接近标称的覆盖率。应用此方法应能让研究者更好地理解与用于确定新设计的社区试验所需样本量的组内相关系数点估计值相关的不确定性。