Perkins Susan M, Becker Mark P
Division of Biostatistics, Indiana University, 1050 Wishard Blvd., RG 4101, Indianapolis, IN 46202-2872, USA.
Stat Med. 2002 Jun 30;21(12):1743-60. doi: 10.1002/sim.1146.
New models that are useful in the assessment of rater agreement, particularly when the rating scale is ordered or partially ordered, are presented. The models are parameterized to address two important aspects of rater agreement: (i) agreement in terms of the overall frequency in which raters assign categories; and (ii) the extent to which raters agree on the category assigned to individual subjects or items. We present methodology for the simultaneous modelling of univariate marginal responses and bivariate marginal associations in the K-way contingency table representing the joint distribution of K rater responses. The univariate marginal responses provide information for evaluating agreement in terms of the overall frequency of responses, and the bivariate marginal associations provide information on category-wise agreement among pairs of raters. In addition, estimated scores within a generalized log non-linear model for bivariate associations facilitate the assessment of category distinguishability.
本文提出了一些新模型,这些模型在评估评分者一致性方面很有用,特别是当评分量表是有序或部分有序时。这些模型通过参数化来解决评分者一致性的两个重要方面:(i)评分者分配类别时的总体频率方面的一致性;(ii)评分者在分配给个体受试者或项目的类别上的一致程度。我们提出了一种方法,用于在表示K个评分者反应联合分布的K维列联表中同时对单变量边际反应和双变量边际关联进行建模。单变量边际反应提供了用于根据反应的总体频率评估一致性的信息,而双变量边际关联提供了关于评分者对之间按类别一致性的信息。此外,在双变量关联的广义对数非线性模型中的估计分数有助于评估类别可区分性。