在群组随机试验和拟合二项结局的 GEE 型边缘模型的背景下，一种现成的改进方法，可以用于估计群组内相关性。

A readily available improvement over method of moments for intra-cluster correlation estimation in the context of cluster randomized trials and fitting a GEE-type marginal model for binary outcomes.

机构信息

Department of Biostatistics, College of Public Health, University of Kentucky, Lexington, KY, USA.

出版信息

Clin Trials. 2019 Feb;16(1):41-51. doi: 10.1177/1740774518803635. Epub 2018 Oct 8.

Abstract

BACKGROUND/AIMS: Cluster randomized trials are popular in health-related research due to the need or desire to randomize clusters of subjects to different trial arms as opposed to randomizing each subject individually. As outcomes from subjects within the same cluster tend to be more alike than outcomes from subjects within other clusters, an exchangeable correlation arises that is measured via the intra-cluster correlation coefficient. Intra-cluster correlation coefficient estimation is especially important due to the increasing awareness of the need to publish such values from studies in order to help guide the design of future cluster randomized trials. Therefore, numerous methods have been proposed to accurately estimate the intra-cluster correlation coefficient, with much attention given to binary outcomes. As marginal models are often of interest, we focus on intra-cluster correlation coefficient estimation in the context of fitting such a model with binary outcomes using generalized estimating equations. Traditionally, intra-cluster correlation coefficient estimation with generalized estimating equations has been based on the method of moments, although such estimators can be negatively biased. Furthermore, alternative estimators that work well, such as the analysis of variance estimator, are not as readily applicable in the context of practical data analyses with generalized estimating equations. Therefore, in this article we assess, in terms of bias, the readily available residual pseudo-likelihood approach to intra-cluster correlation coefficient estimation with the GLIMMIX procedure of SAS (SAS Institute, Cary, NC). Furthermore, we study a possible corresponding approach to confidence interval construction for the intra-cluster correlation coefficient.

METHODS

We utilize a simulation study and application example to assess bias in intra-cluster correlation coefficient estimates obtained from GLIMMIX using residual pseudo-likelihood. This estimator is contrasted with method of moments and analysis of variance estimators which are standards of comparison. The approach to confidence interval construction is assessed by examining coverage probabilities.

RESULTS

Overall, the residual pseudo-likelihood estimator performs very well. It has considerably less bias than moment estimators, which are its competitor for general generalized estimating equation-based analyses, and therefore, it is a major improvement in practice. Furthermore, it works almost as well as analysis of variance estimators when they are applicable. Confidence intervals have near-nominal coverage when the intra-cluster correlation coefficient estimate has negligible bias.

CONCLUSION

Our results show that the residual pseudo-likelihood estimator is a good option for intra-cluster correlation coefficient estimation when conducting a generalized estimating equation-based analysis of binary outcome data arising from cluster randomized trials. The estimator is practical in that it is simply a result from fitting a marginal model with GLIMMIX, and a confidence interval can be easily obtained. An additional advantage is that, unlike most other options for performing generalized estimating equation-based analyses, GLIMMIX provides analysts the option to utilize small-sample adjustments that ensure valid inference.

摘要

背景/目的：由于需要或希望将受试者的群组随机分配到不同的试验臂，而不是对每个受试者进行单独随机分配，因此群组随机试验在与健康相关的研究中很受欢迎。由于同一群组内的受试者的结果往往比其他群组内的受试者的结果更相似，因此会产生可交换的相关性，这种相关性可以通过组内相关系数来衡量。由于越来越意识到需要公布此类研究的结果值，以帮助指导未来群组随机试验的设计，因此组内相关系数的估计尤其重要。因此，已经提出了许多方法来准确估计组内相关系数，并且非常关注二进制结果。由于边缘模型通常很感兴趣，因此我们专注于使用广义估计方程拟合具有二进制结果的此类模型的情况下的组内相关系数估计。传统上，使用广义估计方程的组内相关系数估计是基于矩法的，尽管此类估计可能存在负偏差。此外，方差分析估计器等效果良好的替代估计器在使用广义估计方程进行实际数据分析的情况下并不那么容易应用。因此，在本文中，我们根据偏差评估了 SAS 的 GLIMMIX 过程（SAS Institute，Cary，NC）中使用剩余拟似然法进行组内相关系数估计的情况。此外，我们研究了一种可能的对应方法来构建组内相关系数的置信区间。

方法

我们利用模拟研究和应用实例来评估使用剩余拟似然法从 GLIMMIX 获得的组内相关系数估计值的偏差。该估计器与矩法估计器和方差分析估计器进行对比，这两种估计器是比较的标准。通过检查覆盖概率来评估置信区间构建的方法。

结果

总体而言，剩余拟似然估计器的性能非常好。它的偏差明显小于矩法估计器，后者是其用于一般基于广义估计方程的分析的竞争对手，因此在实践中是一个重大改进。此外，当它们适用时，它的效果几乎与方差分析估计器一样好。当组内相关系数估计值偏差可忽略不计时，置信区间的覆盖率接近名义值。

结论

我们的结果表明，当对来自群组随机试验的二进制结果数据进行基于广义估计方程的分析时，剩余拟似然估计器是组内相关系数估计的一个不错选择。该估计器很实用，因为它只是使用 GLIMMIX 拟合边缘模型的结果，并且可以轻松获得置信区间。另一个优点是，与执行基于广义估计方程的分析的大多数其他选项不同，GLIMMIX 为分析人员提供了使用小样本调整的选项，以确保有效的推断。