Eliasziw M, Young S L, Woodbury M G, Fryday-Field K
Department of Epidemiology and Biostatistics, University of Western Ontario, London, Canada.
Phys Ther. 1994 Aug;74(8):777-88. doi: 10.1093/ptj/74.8.777.
Statistical methodology for the concurrent assessment of interrater and intrarater reliability is presented. Application of the methodology is illustrated with an example of one therapist using two goniometers repeatedly to measure knee joint angles. Methods for estimating the coefficients, testing hypotheses, constructing confidence intervals, and computing sample size requirements are provided. In addition, the calculation and clinical interpretation of the standard error of measurement (SEM) are discussed. It is recommended that (1) when both interrater and intrarater reliability are being assessed, a repeated-measures design be used to take advantage of the increased precision gained by using all observations in the statistical analysis, and (2) appropriate statistical tests, confidence intervals, and SEMs always be used in conjunction with the estimated reliability coefficients.
本文介绍了用于同时评估评分者间信度和评分者内信度的统计方法。通过一名治疗师使用两个测角仪反复测量膝关节角度的实例来说明该方法的应用。文中提供了估计系数、检验假设、构建置信区间以及计算样本量要求的方法。此外,还讨论了测量标准误(SEM)的计算和临床解释。建议:(1)在同时评估评分者间信度和评分者内信度时,采用重复测量设计,以便利用在统计分析中使用所有观测值所获得的更高精度;(2)适当的统计检验、置信区间和SEM应始终与估计的信度系数结合使用。