Banerjee M, Fielding J
Department of Pathology, Wayne State University, Detroit, MI 48201, USA.
Res Nurs Health. 1997 Oct;20(5):465-70. doi: 10.1002/(sici)1098-240x(199710)20:5<465::aid-nur10>3.0.co;2-8.
Reliability of nursing observations often is estimated using Cohen's kappa, a chance-adjusted measure of agreement between observer RNs. However, use of kappa as an omnibus measure sometimes can be misleading. In a study partly designed to describe the frequency and reliability of nursing diagnoses in long-term care facilities, 360 residents each were assessed independently by two registered nurses, and kappa and observed proportion of agreement were calculated as estimates of reliability. For some diagnoses we observed high proportions of agreement, yet paradoxically low kappa values. This article presents an in-depth statistical analysis to resolve this paradox. Results from our analysis also suggest means for planning improvements in the diagnostic performance of participating RNs. Consequently, our approach can be used in similar studies of diagnosis reliability to enhance nursing research, education, and practice.