Lijmer J G, Mol B W, Heisterkamp S, Bonsel G J, Prins M H, van der Meulen J H, Bossuyt P M
Department of Clinical Epidemiology and Biostatistics, Academic Medical Center, University of Amsterdam, The Netherlands.
JAMA. 1999 Sep 15;282(11):1061-6. doi: 10.1001/jama.282.11.1061.
The literature contains a large number of potential biases in the evaluation of diagnostic tests. Strict application of appropriate methodological criteria would invalidate the clinical application of most study results.
To empirically determine the quantitative effect of study design shortcomings on estimates of diagnostic accuracy.
Observational study of the methodological features of 184 original studies evaluating 218 diagnostic tests. Meta-analyses on diagnostic tests were identified through a systematic search of the literature using MEDLINE, EMBASE, and DARE databases and the Cochrane Library (1996-1997). Associations between study characteristics and estimates of diagnostic accuracy were evaluated with a regression model.
Relative diagnostic odds ratio (RDOR), which compared the diagnostic odds ratios of studies of a given test that lacked a particular methodological feature with those without the corresponding shortcomings in design.
Fifteen (6.8%) of 218 evaluations met all 8 criteria; 64 (30%) met 6 or more. Studies evaluating tests in a diseased population and a separate control group overestimated the diagnostic performance compared with studies that used a clinical population (RDOR, 3.0; 95% confidence interval [CI], 2.0-4.5). Studies in which different reference tests were used for positive and negative results of the test under study overestimated the diagnostic performance compared with studies using a single reference test for all patients (RDOR, 2.2; 95% CI, 1.5-3.3). Diagnostic performance was also overestimated when the reference test was interpreted with knowledge of the test result (RDOR, 1.3; 95% CI, 1.0-1.9), when no criteria for the test were described (RDOR, 1.7; 95% CI, 1.1-2.5), and when no description of the population under study was provided (RDOR, 1.4; 95% CI, 1.1-1.7).
These data provide empirical evidence that diagnostic studies with methodological shortcomings may overestimate the accuracy of a diagnostic test, particularly those including nonrepresentative patients or applying different reference standards.
在诊断试验的评估中,文献中存在大量潜在偏倚。严格应用适当的方法学标准会使大多数研究结果的临床应用无效。
通过实证确定研究设计缺陷对诊断准确性估计的定量影响。
对184项评估218种诊断试验的原始研究的方法学特征进行观察性研究。通过使用MEDLINE、EMBASE和DARE数据库以及Cochrane图书馆(1996 - 1997年)对文献进行系统检索,确定诊断试验的荟萃分析。使用回归模型评估研究特征与诊断准确性估计之间的关联。
相对诊断比值比(RDOR),用于比较给定试验的研究中缺乏特定方法学特征的研究与设计中无相应缺陷的研究的诊断比值比。
218项评估中有15项(6.8%)符合所有8项标准;64项(30%)符合6项或更多标准。与使用临床人群的研究相比,在患病群体和单独对照组中评估试验的研究高估了诊断性能(RDOR,3.0;95%置信区间[CI],2.0 - 4.5)。与对所有患者使用单一参考试验的研究相比,对所研究试验的阳性和阴性结果使用不同参考试验的研究高估了诊断性能(RDOR,2.2;95% CI,1.5 - 3.3)。当参考试验根据试验结果的知识进行解释时(RDOR,1.3;95% CI,1.0 - 1.9)、未描述试验标准时(RDOR,1.7;95% CI,1.1 - 2.5)以及未提供所研究人群的描述时(RDOR,1.4;95% CI,1.1 - 1.7),诊断性能也被高估。
这些数据提供了实证证据,表明存在方法学缺陷的诊断研究可能高估诊断试验的准确性,特别是那些包括非代表性患者或应用不同参考标准的数据。