使用组内相关系数和标准误来量化重测信度。

Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM.

作者信息

Weir Joseph P

机构信息

Applied Physiology Laboratory, Division of Physical Therapy, Des Moines University-Osteopathic Medical Center, Des Moines, Iowa 50312, USA.

出版信息

J Strength Cond Res. 2005 Feb;19(1):231-40. doi: 10.1519/15184.1.

DOI:10.1519/15184.1

PMID:15705040

Abstract

Reliability, the consistency of a test or measurement, is frequently quantified in the movement sciences literature. A common metric is the intraclass correlation coefficient (ICC). In addition, the SEM, which can be calculated from the ICC, is also frequently reported in reliability studies. However, there are several versions of the ICC, and confusion exists in the movement sciences regarding which ICC to use. Further, the utility of the SEM is not fully appreciated. In this review, the basics of classic reliability theory are addressed in the context of choosing and interpreting an ICC. The primary distinction between ICC equations is argued to be one concerning the inclusion (equations 2,1 and 2,k) or exclusion (equations 3,1 and 3,k) of systematic error in the denominator of the ICC equation. Inferential tests of mean differences, which are performed in the process of deriving the necessary variance components for the calculation of ICC values, are useful to determine if systematic error is present. If so, the measurement schedule should be modified (removing trials where learning and/or fatigue effects are present) to remove systematic error, and ICC equations that only consider random error may be safely used. The use of ICC values is discussed in the context of estimating the effects of measurement error on sample size, statistical power, and correlation attenuation. Finally, calculation and application of the SEM are discussed. It is shown how the SEM and its variants can be used to construct confidence intervals for individual scores and to determine the minimal difference needed to be exhibited for one to be confident that a true change in performance of an individual has occurred.

摘要

可靠性，即测试或测量的一致性，在运动科学文献中经常被量化。一个常用的指标是组内相关系数（ICC）。此外，可根据ICC计算得出的标准误（SEM）在可靠性研究中也经常被报告。然而，ICC有多个版本，在运动科学领域对于使用哪个ICC存在困惑。此外，SEM的效用尚未得到充分认识。在本综述中，经典可靠性理论的基础在选择和解释ICC的背景下进行了阐述。ICC方程之间的主要区别被认为在于ICC方程分母中系统误差的纳入（方程2,1和2,k）或排除（方程3,1和3,k）。在推导计算ICC值所需的方差分量过程中进行的均值差异推断检验，有助于确定是否存在系统误差。如果存在，应修改测量方案（去除存在学习和/或疲劳效应的试验）以消除系统误差，这样就可以安全地使用仅考虑随机误差的ICC方程。在估计测量误差对样本量、统计功效和相关衰减的影响的背景下讨论了ICC值的使用。最后，讨论了SEM的计算和应用。展示了如何使用SEM及其变体为个体分数构建置信区间，以及确定个体表现出现真正变化时需要表现出的最小差异。

相似文献

Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM.

J Strength Cond Res. 2005 Feb;19(1):231-40. doi: 10.1519/15184.1.

Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

J Am Acad Audiol. 2018 Mar;29(3):223-232. doi: 10.3766/jaaa.16134.

Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine.

Sports Med. 1998 Oct;26(4):217-38. doi: 10.2165/00007256-199826040-00002.

The Spanish version of the Patient-Rated Wrist Evaluation outcome measure: cross-cultural adaptation process, reliability, measurement error and construct validity.

Health Qual Life Outcomes. 2017 Aug 24;15(1):169. doi: 10.1186/s12955-017-0745-2.

Test-retest reliability of the 5-minute psychomotor vigilance task in working-aged females.

J Neurosci Methods. 2022 Jan 1;365:109379. doi: 10.1016/j.jneumeth.2021.109379. Epub 2021 Oct 7.

Reliability of 3D upper limb motion analysis in children with obstetric brachial plexus palsy.

Physiol Meas. 2017 Mar;38(3):524-538. doi: 10.1088/1361-6579/aa5c13. Epub 2017 Jan 31.

Comparison of methods for estimating the intraclass correlation coefficient for binary responses in cancer prevention cluster randomized trials.

Contemp Clin Trials. 2012 Sep;33(5):869-80. doi: 10.1016/j.cct.2012.05.004. Epub 2012 May 22.

Validity and Reliability of a Digital Inclinometer to Assess Knee Joint Position Sense in an Open Kinetic Chain.

J Sport Rehabil. 2019 May 1;28(4):332-338. doi: 10.1123/jsr.2017-0221. Epub 2018 Dec 12.

Endurance test selection optimized via sample size predictions.

J Appl Physiol (1985). 2020 Sep 1;129(3):467-473. doi: 10.1152/japplphysiol.00408.2020. Epub 2020 Jul 30.

Grip strength in children: test-retest reliability using Grippit.

Acta Paediatr. 2008 Sep;97(9):1226-31. doi: 10.1111/j.1651-2227.2008.00895.x. Epub 2008 Jun 6.

引用本文的文献

Predicting cognitive functioning in mood disorders through smartphone typing dynamics.

J Psychopathol Clin Sci. 2025 Sep 4. doi: 10.1037/abn0001052.

Effects of unsupervised walking on walk performance and functional mobility in individuals with chronic stroke: a blind randomized clinical trial.

Sao Paulo Med J. 2025 Aug 29;143(5):e2024190. doi: 10.1590/1516-3180.2024.0190.R2.26022025. eCollection 2025.

Comparison of intra-session reliability of force-velocity-power variables between a horizontal dynamic leg press device and vertical jump tests.

PLoS One. 2025 Sep 2;20(9):e0331671. doi: 10.1371/journal.pone.0331671. eCollection 2025.

Comparison of the inter-recti distance in nulliparous women measured in supine and standing positions using ultrasound imaging.

Sci Rep. 2025 Aug 24;15(1):31088. doi: 10.1038/s41598-025-16781-0.

Association Between Stiffness of the Deep Fibres of the Tibialis Anterior Muscle and Posture Performance After Ankle Fracture Surgery.

J Funct Morphol Kinesiol. 2025 Aug 1;10(3):300. doi: 10.3390/jfmk10030300.

Effectiveness of a Flossing Protocol and Manual Therapy in Improving the Clinical and Functional Status of Subjects with Recurrent Ankle Sprains; A Double-Blind Randomized Clinical Trial.

Med Sci (Basel). 2025 Aug 20;13(3):149. doi: 10.3390/medsci13030149.

Association between weight-bearing ankle dorsiflexion range of motion during deep squat sitting and quality of life after ankle fracture surgery: a cross-sectional study.

Front Rehabil Sci. 2025 Aug 4;6:1645621. doi: 10.3389/fresc.2025.1645621. eCollection 2025.

Reliability and reactivity of heart rate variability and pupillometry in response to controlled autonomic perturbations in university students.

Behav Res Methods. 2025 Aug 19;57(9):267. doi: 10.3758/s13428-025-02793-1.

Reproducibility and validity of adapted clinical tests for the assessment of muscle strength in community-dwelling older adults living with Alzheimer's disease.

BMC Geriatr. 2025 Aug 18;25(1):636. doi: 10.1186/s12877-025-06269-x.

Assessing Social Participation Among Kidney Transplant Recipients Using PROMIS Computer Adaptive Testing.

Kidney Int Rep. 2025 May 19;10(8):2708-2719. doi: 10.1016/j.ekir.2025.05.023. eCollection 2025 Aug.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用组内相关系数和标准误来量化重测信度。

Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献