Department of Hygiene and Epidemiology, Clinical Trials and Evidence-Based Medicine Unit, University of Ioannina School of Medicine, Ioannina 45110, Greece.
J Clin Epidemiol. 2011 Oct;64(10):1060-9. doi: 10.1016/j.jclinepi.2010.12.012. Epub 2011 Mar 31.
To assess whether nominally statistically significant effects in meta-analyses of clinical trials are true and whether their magnitude is inflated.
Data from the Cochrane Database of Systematic Reviews 2005 (issue 4) and 2010 (issue 1) were used. We considered meta-analyses with binary outcomes and four or more trials in 2005 with P<0.05 for the random-effects odds ratio (OR). We examined whether any of these meta-analyses had updated counterparts in 2010. We estimated the credibility (true-positive probability) under different prior assumptions and inflation in OR estimates in 2005.
Four hundred sixty-one meta-analyses in 2005 were eligible, and 80 had additional trials included by 2010. The effect sizes (ORs) were smaller in the updating data (2005-2010) than in the respective meta-analyses in 2005 (median 0.85-fold, interquartile range [IQR]: 0.66-1.06), even more prominently for meta-analyses with less than 300 events in 2005 (median 0.67-fold, IQR: 0.54-0.96). Mean credibility of the 461 meta-analyses in 2005 was 63-84% depending on the assumptions made. Credibility estimates changed >20% in 19-31 (24-39%) of the 80 updated meta-analyses.
Most meta-analyses with nominally significant results pertain to truly nonnull effects, but exceptions are not uncommon. The magnitude of observed effects, especially in meta-analyses with limited evidence, is often inflated.
评估临床试验荟萃分析中名义上有统计学意义的效果是否真实,以及其大小是否被夸大。
使用了 Cochrane 系统评价数据库 2005 年(第 4 期)和 2010 年(第 1 期)的数据。我们考虑了 2005 年具有二元结局且有四个或更多试验的荟萃分析,其随机效应比值比(OR)具有<0.05 的统计学意义。我们检验了这些荟萃分析中是否有任何一个在 2010 年有更新的对应分析。我们在不同的先验假设下估计可信度(真阳性概率)和 2005 年 OR 估计的膨胀。
2005 年有 461 项荟萃分析符合条件,其中 80 项在 2010 年增加了试验。更新数据(2005-2010 年)中的效应大小(OR)比 2005 年各自的荟萃分析更小(中位数为 0.85 倍,四分位距 [IQR]:0.66-1.06),对于 2005 年少于 300 个事件的荟萃分析,这种差异更为明显(中位数为 0.67 倍,IQR:0.54-0.96)。2005 年 461 项荟萃分析的平均可信度取决于所做的假设,为 63-84%。在 80 项更新的荟萃分析中,有 19-31 项(24-39%)的可信度估计值变化超过 20%。
大多数具有名义显著结果的荟萃分析都涉及真正的非零效应,但也不罕见有例外。观察到的效应大小,尤其是在证据有限的荟萃分析中,往往被夸大。