Singh Sonal, Chang Stephanie M, Matchar David B, Bass Eric B
Johns Hopkins University School of Medicine and Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD
Center for Outcomes and Evidence, Agency for Healthcare Research and Quality, Rockville, MD
Grading the strength of a body of diagnostic test evidence involves challenges over and above those related to grading the evidence from health care intervention studies. This chapter identifies challenges and outlines principles for grading the body of evidence related to diagnostic test performance.
Diagnostic test evidence is challenging to grade because standard tools for grading evidence were designed for questions about treatment rather than diagnostic testing; and the clinical usefulness of a diagnostic test depends on multiple links in a chain of evidence connecting the performance of a test to changes in clinical outcomes.
Reviewers grading the strength of a body of evidence on diagnostic tests should consider the principle domains of as well as that would decrease an effect, and , similar to what is done to grade evidence on treatment interventions. Given that most evidence regarding the clinical value of diagnostic tests is indirect, an analytic framework must be developed to clarify the key questions, and strength of evidence for each link in that framework should be graded separately. However, if reviewers choose to combine domains into a single grade of evidence, they should explain their rationale for a particular summary grade and the relevant domains that were weighed in assigning the summary grade.
对诊断试验证据的力度进行分级所面临的挑战,超出了与医疗保健干预研究证据分级相关的挑战。本章确定了这些挑战,并概述了对与诊断试验性能相关的证据进行分级的原则。
诊断试验证据难以分级,原因在于证据分级的标准工具是为治疗问题而非诊断试验设计的;而且诊断试验的临床效用取决于一系列证据中的多个环节,这些环节将试验性能与临床结果的变化联系起来。
对诊断试验证据力度进行分级的评审人员,应考虑与治疗干预证据分级类似的、会降低效应的原则领域以及 , 。鉴于大多数关于诊断试验临床价值的证据都是间接的,必须建立一个分析框架来阐明关键问题,并且该框架中每个环节的证据力度应分别进行分级。然而,如果评审人员选择将各个领域合并为单一的证据等级,他们应解释给出特定汇总等级的理由以及在确定汇总等级时所权衡的相关领域。