Wetterslev Jørn, Thorlund Kristian, Brok Jesper, Gluud Christian
Copenhagen Trial Unit, Centre for Clinical Intervention Research, Rigshospitalet, Copenhagen University Hospital, Blegdamsvej 9, Copenhagen, Denmark.
J Clin Epidemiol. 2008 Jan;61(1):64-75. doi: 10.1016/j.jclinepi.2007.03.013. Epub 2007 Aug 23.
Cumulative meta-analyses are prone to produce spurious P<0.05 because of repeated testing of significance as trial data accumulate. Information size in a meta-analysis should at least equal the sample size of an adequately powered trial. Trial sequential analysis (TSA) corresponds to group sequential analysis of a single trial and may be applied to meta-analysis to evaluate the evidence.
Six randomly selected neonatal meta-analyses with at least five trials reporting a binary outcome were examined. Low-bias heterogeneity-adjusted information size and information size determined from an assumed intervention effect of 15% were calculated. These were used for constructing trial sequential monitoring boundaries. We assessed the cumulative z-curves' crossing of P=0.05 and the boundaries.
Five meta-analyses showed early potentially spurious P<0.05 values. In three significant meta-analyses the cumulative z-curves crossed both boundaries, establishing firm evidence of an intervention effect. In two nonsignificant meta-analyses the cumulative z-curves crossed P=0.05, but never the boundaries, demonstrating early potentially spurious P<0.05 values. In one nonsignificant meta-analysis the cumulative z-curves never crossed P=0.05 or the boundaries.
TSAs may establish when firm evidence is reached in meta-analysis.
由于随着试验数据的积累对显著性进行重复检验,累积荟萃分析容易产生虚假的P<0.05。荟萃分析中的信息量应至少等于一项功效充分的试验的样本量。试验序贯分析(TSA)相当于单个试验的成组序贯分析,可应用于荟萃分析以评估证据。
检查了六项随机选择的新生儿荟萃分析,这些分析中至少有五项试验报告了二元结局。计算了低偏倚异质性调整后的信息量以及根据15%的假定干预效应确定信息量。这些用于构建试验序贯监测界限。我们评估了累积z曲线与P=0.05及界限的交叉情况。
五项荟萃分析显示早期存在可能虚假的P<0.05值。在三项有显著结果的荟萃分析中,累积z曲线越过了两条界限,确立了干预效应的确凿证据。在两项无显著结果的荟萃分析中,累积z曲线越过了P=0.05,但从未越过界限,表明早期存在可能虚假的P<0.05值。在一项无显著结果的荟萃分析中,累积z曲线从未越过P=0.05或界限。
试验序贯分析可确定在荟萃分析中何时获得确凿证据。