Peter MacCallum Cancer Centre, East Melbourne, Victoria 3002, Australia.
Department of Anaesthesia, Austin Health, Heidelberg, Victoria 3084, Australia
Br J Anaesth. 2016 Oct;117(4):431-441. doi: 10.1093/bja/aew170. Epub 2016 Oct 17.
The reliability of meta-analysis (MA) in predicting the findings of subsequent large randomized controlled trials (RCTs) has not been assessed in perioperative medicine and anaesthesia.
Using Medline and PubMed, large RCTs (n≥1000) published since 2000 in the anaesthesia and perioperative medicine/critical care literature were identified. All previous MAs of RCTs investigating the same intervention and population were sourced. For all reported major morbid endpoints common to each, results (significant/non-significant P<0.05) were compared.
18 large RCTs and 44 prior MAs investigating the effects of 16 interventions were identified. Where endpoint results in the large RCTs were each compared with the single largest recent preceding MA, 35 of a total of 57 outcomes were predicted correctly by the MAs (61.4%). The odds ratio for a significant result from MA compared with the subsequent large RCT was 3.6, P=0.033 Bonferroni corrected. The positive predictive value of MAs was 22.7%; the negative predictive value was 85.7%, Kappa was 0.094 indicating slight agreement. The estimated power for each endpoint for large RCTs and MAs were similar, but the median study size for large RCTs was larger than that of the MAs, n=4,482 vs 1,389, P<0.0001.
There was a strong tendency towards positive findings in MA not substantiated by subsequent large RCTs, which was not attributable to differences in study power. This finding suggests caution in clinical decision-making in anaesthesia and perioperative medicine based on findings of meta-analysis.
元分析(MA)预测随后的大型随机对照试验(RCT)结果的可靠性在围手术期医学和麻醉学中尚未得到评估。
使用 Medline 和 PubMed,确定了 2000 年以来发表在麻醉和围手术期医学/重症监护文献中的大型 RCT(n≥1000)。所有之前研究相同干预措施和人群的 RCT 的 MA 均被溯源。对于每个报告的主要发病率终点,都比较了它们的结果(有意义/无意义 P<0.05)。
共确定了 18 项大型 RCT 和 44 项之前的 MA,这些研究调查了 16 种干预措施的效果。在大型 RCT 中的终点结果与单个最近的最大 MA 进行比较时,MA 预测正确的 57 个结果中的 35 个(61.4%)。MA 与随后的大型 RCT 相比,有显著结果的优势比为 3.6,P=0.033 经 Bonferroni 校正。MA 的阳性预测值为 22.7%;阴性预测值为 85.7%,Kappa 值为 0.094,表明存在轻微一致性。对于每个终点,大型 RCT 和 MA 的估计效能相似,但大型 RCT 的中位研究规模大于 MA,n=4482 与 n=1389,P<0.0001。
MA 存在强烈的阳性倾向,但未被随后的大型 RCT 证实,这并不是由于研究能力的差异造成的。这一发现表明,在基于荟萃分析结果进行麻醉和围手术期医学的临床决策时应保持谨慎。