School of Computing Sciences, University of East Anglia, Norwich, UK.
Department of Population and Quantitative Health Sciences, University of Massachusetts Medical School, Worcester, Massachusetts, USA.
Res Synth Methods. 2021 Nov;12(6):711-730. doi: 10.1002/jrsm.1491. Epub 2021 Jul 6.
The conventional Q statistic, using estimated inverse-variance (IV) weights, underlies a variety of problems in random-effects meta-analysis. In previous work on standardized mean difference and log-odds-ratio, we found superior performance with an estimator of the overall effect whose weights use only group-level sample sizes. The Q statistic with those weights has the form proposed by DerSimonian and Kacker. The distribution of this Q and the Q with IV weights must generally be approximated. We investigate approximations for those distributions, as a basis for testing and estimating the between-study variance (τ ). A simulation study, with mean difference as the effect measure, provides a framework for assessing accuracy of the approximations, level and power of the tests, and bias in estimating τ . Two examples illustrate estimation of τ and the overall mean difference. Use of Q with sample-size-based weights and its exact distribution (available for mean difference and evaluated by Farebrother's algorithm) provides precise levels even for very small and unbalanced sample sizes. The corresponding estimator of τ is almost unbiased for 10 or more small studies. This performance compares favorably with the extremely liberal behavior of the standard tests of heterogeneity and the largely biased estimators based on inverse-variance weights.
传统的 Q 统计量使用估计的倒数方差(IV)权重,这是随机效应荟萃分析中存在的各种问题的基础。在之前关于标准化均数差和对数优势比的研究中,我们发现,使用仅使用组级样本量的效应总体估计量的表现更好。使用这些权重的 Q 统计量具有 DerSimonian 和 Kacker 提出的形式。这些权重的 Q 统计量和 IV 权重的 Q 统计量通常必须进行近似。我们研究了这些分布的近似值,作为检验和估计研究间方差(τ)的基础。以均数差作为效应量的模拟研究为评估这些近似值的准确性、检验的水平和功效以及估计τ的偏差提供了一个框架。两个示例说明了τ和总体均数差的估计。使用基于样本量的权重的 Q 及其精确分布(可用于均数差,并通过 Farebrother 的算法进行评估),即使对于非常小和不平衡的样本量,也能提供精确的水平。对于 10 个或更多小研究,τ 的相应估计值几乎无偏。这种性能与异质性标准检验的极端宽松行为以及基于倒数方差权重的基本有偏估计值相比具有优势。