Test Evaluation Research Group, University of Birmingham, Birmingham, UK; NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham, UK.
Nottingham Clinical Trials Unit, Faculty of Medicine and Health Sciences, University of Nottingham, Nottingham, UK.
J Clin Epidemiol. 2020 May;121:1-14. doi: 10.1016/j.jclinepi.2019.12.007. Epub 2019 Dec 14.
The objective of this study was to examine methodological and reporting characteristics of systematic reviews and meta-analyses which compare diagnostic test accuracy (DTA) of multiple index tests, identify good practice, and develop guidance for better reporting.
Methodological survey of 127 comparative or multiple tests reviews published in 74 different general medical and specialist journals. We summarized methods and reporting characteristics that are likely to differ between reviews of a single test and comparative reviews. We then developed guidance to enhance reporting of test comparisons in DTA reviews.
Of 127 reviews, 16 (13%) reviews restricted study selection and test comparisons to comparative accuracy studies while the remaining 111 (87%) reviews included any study type. Fifty-three reviews (42%) statistically compared test accuracy with only 18 (34%) of these using recommended methods. Reporting of several items-in particular the role of the index tests, test comparison strategy, and limitations of indirect comparisons (i.e., comparisons involving any study type)-was deficient in many reviews. Five reviews with exemplary methods and reporting were identified.
Reporting quality of reviews which evaluate and compare multiple tests is poor. The guidance developed, complemented with the exemplars, can assist review authors in producing better quality comparative reviews.
本研究旨在检查比较多种诊断测试准确性(DTA)的系统评价和荟萃分析的方法学和报告特征,确定良好实践,并为更好的报告制定指导。
对发表在 74 种不同一般医学和专科期刊中的 127 篇比较或多重测试综述进行方法学调查。我们总结了可能在单个测试综述和比较综述之间存在差异的方法和报告特征。然后,我们制定了指导意见,以增强 DTA 综述中测试比较的报告。
在 127 篇综述中,16 篇(13%)综述将研究选择和测试比较限制在比较准确性研究中,而其余 111 篇(87%)综述包括任何研究类型。53 篇综述(42%)仅使用统计学方法比较了测试准确性,其中仅 18 篇(34%)使用了推荐的方法。许多综述在报告几个项目方面存在缺陷,特别是索引测试的作用、测试比较策略以及间接比较的局限性(即涉及任何研究类型的比较)。确定了五篇具有出色方法和报告的综述。
评估和比较多种测试的综述的报告质量很差。制定的指导意见,辅以范例,可以帮助综述作者制作出更好质量的比较综述。