Gallis John A, Li Fan, Turner Elizabeth L
Department of Biostatistics and Bioinformatics, Duke University, Duke Global Health Institute, Durham, NC.
Department of Biostatistics, Yale School of Public Health, New Haven, CT.
Stata J. 2020 Jun;20(2):363-381. doi: 10.1177/1536867x20931001. Epub 2020 Jun 19.
Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.
整群随机试验是将整群(例如学校或诊所)随机分配到各个比较组,但对个体进行测量,常用于评估公共卫生、教育和社会科学领域的干预措施。分析通常针对个体层面的结果进行,且此类分析方法必须考虑到同一整群成员的结果往往比其他整群成员的结果更相似。一种常用的个体层面分析技术是广义估计方程(GEE)。然而,通常整群的随机分组数量较少(例如30个或更少),在这种情况下,从三明治方差估计器获得的GEE标准误将产生偏差,导致I型错误膨胀。已经提出并研究了一些偏差校正标准误来解决这种有限样本偏差问题,但在Stata中尚未实现。在本文中,我们描述了几种对稳健三明治方差的常用偏差校正方法。然后我们介绍新创建的命令xtgeebcv,它将使Stata用户能够轻松地对从GEE模型获得的标准误应用有限样本校正。接着我们提供示例来演示xtgeebcv的用法。最后,我们讨论关于在何种情况下使用哪种有限样本校正的建议,并考虑未来可能改进xtgeebcv的研究领域。