Heart Institute (InCor), Laboratory of Genetics and Molecular Cardiology, São Paulo, 05403-000, Brazil.
Department of Biochemistry and Molecular Biology, Federal University of São Paulo, São Paulo, 05403-000, Brazil.
Res Synth Methods. 2010 Apr;1(2):149-61. doi: 10.1002/jrsm.13. Epub 2010 Oct 28.
We describe how an appropriate interpretation of the Q-test depends on its power to detect a given typical amount of between-study variance (τ(2)) as well as prior beliefs on heterogeneity. We illustrate these concepts in an evaluation of 1011 meta-analyses of clinical trials with ⩾4 studies and binary outcomes. These concepts can be seen as an application of the Bayes theorem. Across the 1011 meta-analyses, power to detect typical heterogeneity was low in most situations. Thus, usually a non-significant Q test did not change perceptibly prior convictions on heterogeneity. Conversely, significant results for the Q test typically augmented considerably the probability of heterogeneity. The posterior probability of heterogeneity depends on what τ(2) we want to detect. With the same approach, one may also estimate the posterior probability for the presence of heterogeneity that is large enough to annul statistically significant summary effects; that is half the average within-study variance of the combined studies; and that is able to change the summary effect estimate of the meta-analysis by 20%. The discussed analyses are exploratory, and may depend heavily on prior assumptions when power for the Q-test is low. Statistical heterogeneity in meta-analyses should be cautiously interpreted considering the power to detect a specific τ(2) and prior assumptions about the presence of heterogeneity. Copyright © 2010 John Wiley & Sons, Ltd.
我们描述了 Q 检验的适当解释取决于其检测给定的研究间方差(τ(2))以及对异质性的先验信念的能力。我们在对 1011 项具有 ⩾4 项研究和二项结局的临床试验的荟萃分析评估中说明了这些概念。这些概念可以被视为贝叶斯定理的应用。在 1011 项荟萃分析中,在大多数情况下,检测典型异质性的能力较低。因此,通常非显著性 Q 检验不会明显改变对异质性的先入为主的信念。相反,Q 检验的显著结果通常会极大地增加异质性的概率。异质性的后验概率取决于我们要检测的 τ(2)。通过相同的方法,人们还可以估计存在足够大的异质性的后验概率,以消除统计上显著的汇总效应;即合并研究的平均内部研究方差的一半;并且能够改变荟萃分析的汇总效应估计值 20%。所讨论的分析是探索性的,并且当 Q 检验的功效较低时,可能会严重依赖于先验假设。考虑到检测特定 τ(2)的能力和对异质性存在的先验假设,荟萃分析中的统计异质性应谨慎解释。版权所有 © 2010 约翰威立父子有限公司。