Uebersax J S
J Psychiatr Res. 1982;17(4):335-42. doi: 10.1016/0022-3956(82)90039-5.
The several different versions of the kappa coefficient currently in use can be seen as variants of a single basic computational method. This paper present general formulas for calculating kappa, both to measure the overall reliability of a set of diagnostic categories and to measure the specific reliability of individual categories. The formulas extend the range of situations in which kappa can be calculated to include multiple-rater, partially crossed designs of the sort that researchers frequently encounter.