Weir Joseph P
Applied Physiology Laboratory, Division of Physical Therapy, Des Moines University-Osteopathic Medical Center, Des Moines, Iowa 50312, USA.
J Strength Cond Res. 2005 Feb;19(1):231-40. doi: 10.1519/15184.1.
Reliability, the consistency of a test or measurement, is frequently quantified in the movement sciences literature. A common metric is the intraclass correlation coefficient (ICC). In addition, the SEM, which can be calculated from the ICC, is also frequently reported in reliability studies. However, there are several versions of the ICC, and confusion exists in the movement sciences regarding which ICC to use. Further, the utility of the SEM is not fully appreciated. In this review, the basics of classic reliability theory are addressed in the context of choosing and interpreting an ICC. The primary distinction between ICC equations is argued to be one concerning the inclusion (equations 2,1 and 2,k) or exclusion (equations 3,1 and 3,k) of systematic error in the denominator of the ICC equation. Inferential tests of mean differences, which are performed in the process of deriving the necessary variance components for the calculation of ICC values, are useful to determine if systematic error is present. If so, the measurement schedule should be modified (removing trials where learning and/or fatigue effects are present) to remove systematic error, and ICC equations that only consider random error may be safely used. The use of ICC values is discussed in the context of estimating the effects of measurement error on sample size, statistical power, and correlation attenuation. Finally, calculation and application of the SEM are discussed. It is shown how the SEM and its variants can be used to construct confidence intervals for individual scores and to determine the minimal difference needed to be exhibited for one to be confident that a true change in performance of an individual has occurred.
可靠性,即测试或测量的一致性,在运动科学文献中经常被量化。一个常用的指标是组内相关系数(ICC)。此外,可根据ICC计算得出的标准误(SEM)在可靠性研究中也经常被报告。然而,ICC有多个版本,在运动科学领域对于使用哪个ICC存在困惑。此外,SEM的效用尚未得到充分认识。在本综述中,经典可靠性理论的基础在选择和解释ICC的背景下进行了阐述。ICC方程之间的主要区别被认为在于ICC方程分母中系统误差的纳入(方程2,1和2,k)或排除(方程3,1和3,k)。在推导计算ICC值所需的方差分量过程中进行的均值差异推断检验,有助于确定是否存在系统误差。如果存在,应修改测量方案(去除存在学习和/或疲劳效应的试验)以消除系统误差,这样就可以安全地使用仅考虑随机误差的ICC方程。在估计测量误差对样本量、统计功效和相关衰减的影响的背景下讨论了ICC值的使用。最后,讨论了SEM的计算和应用。展示了如何使用SEM及其变体为个体分数构建置信区间,以及确定个体表现出现真正变化时需要表现出的最小差异。
J Strength Cond Res. 2005-2
J Neurosci Methods. 2022-1-1
J Appl Physiol (1985). 2020-9-1
Acta Paediatr. 2008-9
J Psychopathol Clin Sci. 2025-9-4
J Funct Morphol Kinesiol. 2025-8-1