Knowledge Translation Program, Li Ka Shing Knowledge Institute, St. Michael's Hospital, 209 Victoria Street, Toronto, Ontario, M5B 1W8, Canada.
Institute of Reproductive and Developmental Biology, Department of Surgery & Cancer, Faculty of Medicine, Imperial College, London, UK.
BMC Med Res Methodol. 2021 Oct 25;21(1):224. doi: 10.1186/s12874-021-01401-y.
Network meta-analysis (NMA) has attracted growing interest in evidence-based medicine. Consistency between different sources of evidence is fundamental to the reliability of the NMA results. The purpose of the present study was to estimate the prevalence of evidence of inconsistency and describe its association with different NMA characteristics.
We updated our collection of NMAs with articles published up to July 2018. We included networks with randomised clinical trials, at least four treatment nodes, at least one closed loop, a dichotomous primary outcome, and available arm-level data. We assessed consistency using the design-by-treatment interaction (DBT) model and testing all the inconsistency parameters globally through the Wald-type chi-squared test statistic. We estimated the prevalence of evidence of inconsistency and its association with different network characteristics (e.g., number of studies, interventions, intervention comparisons, loops). We evaluated the influence of the network characteristics on the DBT p-value via a multivariable regression analysis and the estimated Pearson correlation coefficients. We also evaluated heterogeneity in NMA (consistency) and DBT (inconsistency) random-effects models.
We included 201 published NMAs. The p-value of the design-by-treatment interaction (DBT) model was lower than 0.05 in 14% of the networks and lower than 0.10 in 20% of the networks. Networks including many studies and comparing few interventions were more likely to have small DBT p-values (less than 0.10), which is probably because they yielded more precise estimates and power to detect differences between designs was higher. In the presence of inconsistency (DBT p-value lower than 0.10), the consistency model displayed higher heterogeneity than the DBT model.
Our findings show that inconsistency was more frequent than what would be expected by chance, suggesting that researchers should devote more resources to exploring how to mitigate inconsistency. The results of this study highlight the need to develop strategies to detect inconsistency (because of the relatively high prevalence of evidence of inconsistency in published networks), and particularly in cases where the existing tests have low power.
网络荟萃分析(NMA)在循证医学中越来越受到关注。不同证据来源之间的一致性是 NMA 结果可靠性的基础。本研究旨在估计不一致证据的发生率,并描述其与不同 NMA 特征的关系。
我们更新了我们的 NMA 集合,其中包括截至 2018 年 7 月发表的文章。我们纳入了包含随机临床试验、至少 4 个治疗节点、至少 1 个闭环、二分类主要结局和可用手臂水平数据的网络。我们使用设计-治疗相互作用(DBT)模型评估一致性,并通过 Wald 型卡方检验统计量全局检验所有不一致参数。我们估计了不一致证据的发生率及其与不同网络特征(如研究数量、干预措施、干预比较、循环)的关系。我们通过多变量回归分析评估了网络特征对 DBT p 值的影响,并估计了 Pearson 相关系数。我们还评估了 NMA(一致性)和 DBT(不一致)随机效应模型中的异质性。
我们纳入了 201 项已发表的 NMA。在 14%的网络中,设计-治疗相互作用(DBT)模型的 p 值小于 0.05,在 20%的网络中,p 值小于 0.10。包含大量研究和比较较少干预措施的网络更有可能具有较小的 DBT p 值(小于 0.10),这可能是因为它们产生了更精确的估计,并且检测设计之间差异的能力更高。在存在不一致性(DBT p 值小于 0.10)的情况下,一致性模型显示出比 DBT 模型更高的异质性。
我们的研究结果表明,不一致性比预期更为常见,这表明研究人员应该投入更多资源来探索如何减轻不一致性。本研究的结果强调了需要制定策略来检测不一致性(因为发表网络中不一致证据的发生率相对较高),特别是在现有检验功效较低的情况下。