Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht, the Netherlands.
Clin Microbiol Infect. 2014 Feb;20(2):123-9. doi: 10.1111/1469-0691.12494.
Variance between studies in a meta-analysis will exist. This heterogeneity may be of clinical, methodological or statistical origin. The last of these is quantified by the I(2) -statistic. We investigated, using simulated studies, the accuracy of I(2) in the assessment of heterogeneity and the effects of heterogeneity on the predictive value of meta-analyses. The relevance of quantifying I(2) was determined according to the likely presence of heterogeneity between studies (low, high, or unknown) and the calculated I(2) (low or high). The findings were illustrated by published meta-analyses of selective digestive decontamination and weaning protocols. As expected, I(2) increases and the likelihood of drawing correct inferences from a meta-analysis decreases with increasing heterogeneity. With low levels of heterogeneity, I(2) does not appear to be predictive of the accuracy of the meta-analysis result. With high levels of heterogeneity, even meta-analyses with low I(2) -values have low predictive values. Most commonly, the level of heterogeneity in a meta-analysis will be unknown. In these scenarios, I(2) determination may help to identify estimates with low predictive values (high I(2) ). In this situation, the results of a meta-analysis will be unreliable. With low I(2) -values and unknown levels of heterogeneity, predictive values of pooled estimates may range extensively, and findings should be interpreted with caution. In conclusion, quantifying statistical heterogeneity through I(2) -statistics is only helpful when the amount of clinical heterogeneity is unknown and I(2) is high. Objective methods to quantify the levels of clinical and methodological heterogeneity are urgently needed to allow reliable determination of the accuracy of meta-analyses.
荟萃分析中会存在研究间的差异。这种异质性可能源于临床、方法学或统计学方面。最后一种情况可以通过 I(2) -统计量来量化。我们通过模拟研究,调查了 I(2)在评估异质性和荟萃分析预测价值方面的准确性。根据研究间异质性的可能存在情况(低、高或未知)和计算出的 I(2)(低或高)来确定量化 I(2)的相关性。通过选择性消化道去污染和脱机方案的荟萃分析实例来说明这些发现。正如预期的那样,随着异质性的增加,从荟萃分析中得出正确推论的可能性降低,I(2)也会增加。在低水平的异质性下,I(2)似乎不能预测荟萃分析结果的准确性。在高水平的异质性下,即使 I(2)值较低的荟萃分析也具有较低的预测值。通常,荟萃分析中的异质性水平是未知的。在这种情况下,I(2)的确定可能有助于识别预测值较低的估计值(高 I(2) )。在这种情况下,荟萃分析的结果是不可靠的。在 I(2)值低且异质性水平未知的情况下,汇总估计值的预测值可能会广泛变化,应谨慎解释结果。总之,通过 I(2) -统计量量化统计异质性只有在临床异质性的程度未知且 I(2)较高时才有用。需要迫切需要客观的方法来量化临床和方法学异质性的程度,以可靠地确定荟萃分析的准确性。