Key Laboratory of Population Health Across-life Cycle, Ministry of Education of the People's Republic of China, Anhui Medical University, Anhui, China.
Anhui Provincial Key Laboratory of Population Health and Aristogenics, Anhui Medical University, Anhui, China.
BMJ. 2022 May 10;377:e069155. doi: 10.1136/bmj-2021-069155.
To investigate the validity of data extraction in systematic reviews of adverse events, the effect of data extraction errors on the results, and to develop a classification framework for data extraction errors to support further methodological research.
Reproducibility study.
PubMed was searched for eligible systematic reviews published between 1 January 2015 and 1 January 2020. Metadata from the randomised controlled trials were extracted from the systematic reviews by four authors. The original data sources (eg, full text and ClinicalTrials.gov) were then referred to by the same authors to reproduce the data used in these meta-analyses.
Systematic reviews were included when based on randomised controlled trials for healthcare interventions that reported safety as the exclusive outcome, with at least one pair meta-analysis that included five or more randomised controlled trials and with a 2×2 table of data for event counts and sample sizes in intervention and control arms available for each trial in the meta-analysis.
The primary outcome was data extraction errors summarised at three levels: study level, meta-analysis level, and systematic review level. The potential effect of such errors on the results was further investigated.
201 systematic reviews and 829 pairwise meta-analyses involving 10 386 randomised controlled trials were included. Data extraction could not be reproduced in 1762 (17.0%) of 10 386 trials. In 554 (66.8%) of 829 meta-analyses, at least one randomised controlled trial had data extraction errors; 171 (85.1%) of 201 systematic reviews had at least one meta-analysis with data extraction errors. The most common types of data extraction errors were numerical errors (49.2%, 867/1762) and ambiguous errors (29.9%, 526/1762), mainly caused by ambiguous definitions of the outcomes. These categories were followed by three others: zero assumption errors, misidentification, and mismatching errors. The impact of these errors were analysed on 288 meta-analyses. Data extraction errors led to 10 (3.5%) of 288 meta-analyses changing the direction of the effect and 19 (6.6%) of 288 meta-analyses changing the significance of the P value. Meta-analyses that had two or more different types of errors were more susceptible to these changes than those with only one type of error (for moderate changes, 11 (28.2%) of 39 26 (10.4%) 249, P=0.002; for large changes, 5 (12.8%) of 39 8 (3.2%) of 249, P=0.01).
Systematic reviews of adverse events potentially have serious issues in terms of the reproducibility of the data extraction, and these errors can mislead the conclusions. Implementation guidelines are urgently required to help authors of future systematic reviews improve the validity of data extraction.
调查系统评价中不良事件数据提取的有效性,数据提取错误对结果的影响,并开发一个数据提取错误分类框架,以支持进一步的方法学研究。
可重复性研究。
在 2015 年 1 月 1 日至 2020 年 1 月 1 日期间,通过 PubMed 检索符合条件的系统评价。由四位作者从系统评价中提取随机对照试验的元数据。然后,同一作者参考原始数据源(例如全文和 ClinicalTrials.gov),以重现这些荟萃分析中使用的数据。
纳入基于随机对照试验的系统评价,这些试验报告了安全性作为唯一结局的医疗干预措施,至少有一个包含五个或更多随机对照试验的两因素荟萃分析,以及每个试验在荟萃分析中可用于干预组和对照组的事件计数和样本量的 2×2 表。
主要结局是总结为三个层次的提取数据错误:研究水平、荟萃分析水平和系统评价水平。进一步调查了这些错误对结果的潜在影响。
共纳入 201 篇系统评价和 829 对包含 10386 项随机对照试验的两因素荟萃分析。在 10386 项试验中,有 1762 项(17.0%)无法重现数据提取。在 829 项荟萃分析中,至少有一项随机对照试验存在数据提取错误;在 201 篇系统评价中,有 171 篇(85.1%)至少有一项荟萃分析存在数据提取错误。最常见的数据提取错误类型是数值错误(49.2%,867/1762)和模糊错误(29.9%,526/1762),主要是由于结局的定义不明确造成的。其次是其他三个类别:零假设错误、识别错误和不匹配错误。分析了这些错误对 288 项荟萃分析的影响。数据提取错误导致 288 项荟萃分析中有 10 项(3.5%)改变了效应方向,19 项(6.6%)改变了 P 值的显著性。有两种或两种以上不同类型错误的荟萃分析比只有一种类型错误的荟萃分析更容易受到这些变化的影响(对于中度变化,39 项中的 11 项(28.2%)与 249 项中的 26 项(10.4%)相比,P=0.002;对于较大变化,39 项中的 5 项(12.8%)与 249 项中的 8 项(3.2%)相比,P=0.01)。
不良事件的系统评价在数据提取的可重复性方面存在严重问题,这些错误可能会误导结论。迫切需要实施指南,以帮助未来系统评价的作者提高数据提取的有效性。