Dreier Maren, Borutta Birgit, Stahmeyer Jona, Krauth Christian, Walter Ulla
Institute of Epidemiology, Social Medicine and Health System Research, Hannover Medical School, Hannover, Germany.
GMS Health Technol Assess. 2010 Jun 14;6:Doc07. doi: 10.3205/hta000085.
HEALTH CARE POLICY BACKGROUND: Findings from scientific studies form the basis for evidence-based health policy decisions.
Quality assessments to evaluate the credibility of study results are an essential part of health technology assessment reports and systematic reviews. Quality assessment tools (QAT) for assessing the study quality examine to what extent study results are systematically distorted by confounding or bias (internal validity). The tools can be divided into checklists, scales and component ratings.
What QAT are available to assess the quality of interventional studies or studies in the field of health economics, how do they differ from each other and what conclusions can be drawn from these results for quality assessments?
A systematic search of relevant databases from 1988 onwards is done, supplemented by screening of the references, of the HTA reports of the German Agency for Health Technology Assessment (DAHTA) and an internet search. The selection of relevant literature, the data extraction and the quality assessment are carried out by two independent reviewers. The substantive elements of the QAT are extracted using a modified criteria list consisting of items and domains specific to randomized trials, observational studies, diagnostic studies, systematic reviews and health economic studies. Based on the number of covered items and domains, more and less comprehensive QAT are distinguished. In order to exchange experiences regarding problems in the practical application of tools, a workshop is hosted.
A total of eight systematic methodological reviews is identified as well as 147 QAT: 15 for systematic reviews, 80 for randomized trials, 30 for observational studies, 17 for diagnostic studies and 22 for health economic studies. The tools vary considerably with regard to the content, the performance and quality of operationalisation. Some tools do not only include the items of internal validity but also the items of quality of reporting and external validity. No tool covers all elements or domains. Design-specific generic tools are presented, which cover most of the content criteria.
The evaluation of QAT by using content criteria is difficult, because there is no scientific consensus on the necessary elements of internal validity, and not all of the generally accepted elements are based on empirical evidence. Comparing QAT with regard to contents neglects the operationalisation of the respective parameters, for which the quality and precision are important for transparency, replicability, the correct assessment and interrater reliability. QAT, which mix items on the quality of reporting and internal validity, should be avoided.
There are different, design-specific tools available which can be preferred for quality assessment, because of its wider coverage of substantive elements of internal validity. To minimise the subjectivity of the assessment, tools with a detailed and precise operationalisation of the individual elements should be applied. For health economic studies, tools should be developed and complemented with instructions, which define the appropriateness of the criteria. Further research is needed to identify study characteristics that influence the internal validity of studies.
医疗保健政策背景:科学研究的结果构成了循证医疗保健政策决策的基础。
评估研究结果可信度的质量评估是卫生技术评估报告和系统评价的重要组成部分。用于评估研究质量的质量评估工具(QAT)可检验研究结果在多大程度上因混杂因素或偏倚而产生系统性扭曲(内部效度)。这些工具可分为清单、量表和成分评级。
有哪些QAT可用于评估干预性研究或卫生经济学领域的研究质量,它们彼此有何不同,以及从这些结果中可以得出哪些关于质量评估的结论?
对1988年起的相关数据库进行系统检索,并辅以参考文献筛选、德国卫生技术评估机构(DAHTA)的卫生技术评估报告以及互联网搜索。由两名独立评审员进行相关文献的选择、数据提取和质量评估。使用由随机对照试验、观察性研究、诊断性研究、系统评价和卫生经济学研究特有的项目和领域组成的修改后的标准清单提取QAT的实质性要素。根据涵盖的项目和领域数量,区分出较全面和不太全面的QAT。为了交流关于工具实际应用中问题方面的经验,举办了一次研讨会。
共识别出八项系统方法学评价以及147种QAT:15种用于系统评价,80种用于随机对照试验,30种用于观察性研究,17种用于诊断性研究,22种用于卫生经济学研究。这些工具在内容、操作性的表现和质量方面差异很大。一些工具不仅包括内部效度的项目,还包括报告质量和外部效度的项目。没有一种工具涵盖所有要素或领域。展示了针对特定设计的通用工具,其涵盖了大部分内容标准。
使用内容标准对QAT进行评估很困难,因为对于内部效度的必要要素没有科学共识,而且并非所有公认的要素都基于经验证据。就内容而言比较QAT忽略了各个参数在操作性方面的问题,而对于透明度、可重复性、正确评估和评分者间信度来说,操作性的质量和精度很重要。应避免将报告质量和内部效度项目混在一起的QAT。
有不同的、针对特定设计的工具可用于质量评估,因其对内部效度实质性要素的覆盖范围更广而更受青睐。为了尽量减少评估的主观性,应使用对各个要素进行详细且精确操作性定义的工具。对于卫生经济学研究,应开发工具并辅以说明,以界定标准的适用性。需要进一步开展研究以确定影响研究内部效度的研究特征。