Faculty of Pharmacy, Pharmacy and Bank Building (A15), The University of Sydney, New South Wales 2006, Australia.
Res Social Adm Pharm. 2013 May-Jun;9(3):330-8. doi: 10.1016/j.sapharm.2012.04.004. Epub 2012 Jun 12.
Evaluations of interrater agreement and interrater reliability can be applied to a number of different contexts and are frequently encountered in social and administrative pharmacy research. The objectives of this study were to highlight key differences between interrater agreement and interrater reliability; describe the key concepts and approaches to evaluating interrater agreement and interrater reliability; and provide examples of their applications to research in the field of social and administrative pharmacy. This is a descriptive review of interrater agreement and interrater reliability indices. It outlines the practical applications and interpretation of these indices in social and administrative pharmacy research. Interrater agreement indices assess the extent to which the responses of 2 or more independent raters are concordant. Interrater reliability indices assess the extent to which raters consistently distinguish between different responses. A number of indices exist, and some common examples include Kappa, the Kendall coefficient of concordance, Bland-Altman plots, and the intraclass correlation coefficient. Guidance on the selection of an appropriate index is provided. In conclusion, selection of an appropriate index to evaluate interrater agreement or interrater reliability is dependent on a number of factors including the context in which the study is being undertaken, the type of variable under consideration, and the number of raters making assessments.
评价者间一致性和评价者间可靠性可应用于许多不同的情境,并且在社会和管理药学研究中经常遇到。本研究的目的是突出评价者间一致性和评价者间可靠性之间的关键差异;描述评价者间一致性和评价者间可靠性的关键概念和方法;并提供其在社会和管理药学领域研究中的应用实例。这是对评价者间一致性和评价者间可靠性指标的描述性综述。它概述了这些指标在社会和管理药学研究中的实际应用和解释。评价者间一致性指标评估 2 个或更多独立评价者的反应是否一致。评价者间可靠性指标评估评价者在区分不同反应方面的一致性程度。有许多指标存在,一些常见的例子包括 Kappa、Kendall 一致性系数、Bland-Altman 图和组内相关系数。提供了选择适当指标的指导。总之,选择适当的指标来评估评价者间一致性或评价者间可靠性取决于许多因素,包括研究进行的背景、考虑的变量类型以及进行评估的评价者数量。