Department of Clinical, Neuro and Developmental Psychology, Amsterdam Public Health research institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
Department of Psychology, Pennsylvania State University, Pennsylvania, USA.
Epidemiol Psychiatr Sci. 2021 Dec 2;30:e78. doi: 10.1017/S2045796021000664.
One of the most used methods to examine sources of heterogeneity in meta-analyses is the so-called 'subgroup analysis'. In a subgroup analysis, the included studies are divided into two or more subgroups, and it is tested whether the pooled effect sizes found in these subgroups differ significantly from each other. Subgroup analyses can be considered as a core component of most published meta-analyses. One important problem of subgroup analyses is the lack of statistical power to find significant differences between subgroups. In this paper, we explore the power problems of subgroup analyses in more detail, using 'metapower', a recently developed statistical package in R to examine power in meta-analyses, including subgroup analyses. We show that subgroup analyses require many more included studies in a meta-analysis than are needed for the main analyses. We work out an example of an 'average' meta-analysis, in which a subgroup analysis requires 3-4 times the number of studies that are needed for the main analysis to have sufficient power. This number of studies increases exponentially with decreasing effect sizes and when the studies are not evenly divided over the subgroups. Higher heterogeneity also requires increasing numbers of studies. We conclude that subgroup analyses remain an important method to examine potential sources of heterogeneity in meta-analyses, but that meta-analysts should keep in mind that power is very low for most subgroup analyses. As in any statistical evaluation, researchers should not rely on a test and p-value to interpret results, but should compare the confidence intervals and interpret results carefully.
一种最常用于元分析中检验异质性来源的方法是所谓的“亚组分析”。在亚组分析中,纳入的研究被分为两个或更多的亚组,并检验这些亚组中发现的汇总效应大小是否存在显著差异。亚组分析可以被认为是大多数已发表的元分析的核心组成部分。亚组分析的一个重要问题是缺乏统计能力来发现亚组之间的显著差异。在本文中,我们使用 R 中最近开发的统计软件包“metapower”更详细地探讨了亚组分析的功效问题,以检查包括亚组分析在内的元分析中的功效。我们表明,亚组分析在元分析中需要比主要分析多得多的纳入研究。我们制定了一个“平均”元分析的例子,其中亚组分析需要主要分析所需研究数量的 3-4 倍才有足够的功效。当研究在亚组之间不均匀分布时,这个研究数量会随着效应大小的减小呈指数级增加。更高的异质性也需要更多的研究。我们得出结论,亚组分析仍然是检验元分析中潜在异质性来源的重要方法,但元分析人员应牢记,大多数亚组分析的功效非常低。与任何统计评估一样,研究人员不应仅依赖检验和 p 值来解释结果,而应仔细比较置信区间并谨慎解释结果。