CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Eval Rev. 2024 Dec;48(6):1088-1114. doi: 10.1177/0193841X231169557. Epub 2024 Jan 17.
When planning a cluster randomized trial, evaluators often have access to an enumerated cohort representing the target population of clusters. Practicalities of conducting the trial, such as the need to oversample clusters with certain characteristics in order to improve trial economy or support inferences about subgroups of clusters, may preclude simple random sampling from the cohort into the trial, and thus interfere with the goal of producing generalizable inferences about the target population. We describe a nested trial design where the randomized clusters are embedded within a cohort of trial-eligible clusters from the target population and where clusters are selected for inclusion in the trial with known sampling probabilities that may depend on cluster characteristics (e.g., allowing clusters to be chosen to facilitate trial conduct or to examine hypotheses related to their characteristics). We develop and evaluate methods for analyzing data from this design to generalize causal inferences to the target population underlying the cohort. We present identification and estimation results for the expectation of the average potential outcome and for the average treatment effect, in the entire target population of clusters and in its non-randomized subset. In simulation studies, we show that all the estimators have low bias but markedly different precision. Cluster randomized trials where clusters are selected for inclusion with known sampling probabilities that depend on cluster characteristics, combined with efficient estimation methods, can precisely quantify treatment effects in the target population, while addressing objectives of trial conduct that require oversampling clusters on the basis of their characteristics.
当规划集群随机试验时,评估人员通常可以访问代表集群目标人群的枚举队列。进行试验的实际情况,例如需要对具有某些特征的集群进行过采样,以提高试验经济性或支持对集群子组的推断,可能会阻止从队列中简单随机抽样到试验中,从而干扰对目标人群产生可推广推断的目标。我们描述了一种嵌套试验设计,其中随机分组嵌套在目标人群中试验合格分组的队列中,并且分组是根据已知的抽样概率选择纳入试验的,这些概率可能取决于分组特征(例如,允许选择分组来方便试验进行或检验与分组特征相关的假设)。我们开发并评估了从这种设计中分析数据的方法,以便将因果推断推广到队列所基于的目标人群。我们提出了整个目标人群中以及非随机分组子集的平均潜在结果和平均处理效果的期望的识别和估计结果。在模拟研究中,我们表明所有估计量的偏差都很小,但精度却有很大的不同。对于那些根据分组特征选择纳入且具有已知抽样概率的分组的集群随机试验,结合高效的估计方法,可以精确量化目标人群中的治疗效果,同时满足基于分组特征进行过采样的试验进行目标。