Suppr超能文献

使用等效性检验评估测量一致性的基础指南。

A Primer on the Use of Equivalence Testing for Evaluating Measurement Agreement.

机构信息

Department of Statistics, Iowa State University, Ames, IA.

出版信息

Med Sci Sports Exerc. 2018 Apr;50(4):837-845. doi: 10.1249/MSS.0000000000001481.

Abstract

PURPOSE

Statistical equivalence testing is more appropriate than conventional tests of difference to assess the validity of physical activity (PA) measures. This article presents the underlying principles of equivalence testing and gives three examples from PA and fitness assessment research.

METHODS

The three examples illustrate different uses of equivalence tests. Example 1 uses PA data to evaluate an activity monitor's equivalence to a known criterion. Example 2 illustrates the equivalence of two field-based measures of physical fitness with no known reference method. Example 3 uses regression to evaluate an activity monitor's equivalence across a suite of 23 activities.

RESULTS

The examples illustrate the appropriate reporting and interpretation of results from equivalence tests. In the first example, the mean criterion measure is significantly within ±15% of the mean PA monitor. The mean difference is 0.18 METs and the 90% confidence interval of -0.15 to 0.52 is inside the equivalence region of -0.65 to 0.65. In the second example, we chose to define equivalence for these two measures as a ratio of mean values between 0.98 and 1.02. The estimated ratio of mean V˙O2 values is 0.99, which is significantly (P = 0.007) inside the equivalence region. In the third example, the PA monitor is not equivalent to the criterion across the suite of activities. The estimated regression intercept and slope are -1.23 and 1.06. Neither confidence interval is within the suggested regression equivalence regions.

CONCLUSIONS

When the study goal is to show similarity between methods, equivalence testing is more appropriate than traditional statistical tests of differences (e.g., ANOVA and t-tests).

摘要

目的

与传统的差异检验相比,统计学等效性检验更适合评估体力活动(PA)测量的有效性。本文介绍了等效性检验的基本原理,并给出了来自 PA 和体能评估研究的三个示例。

方法

这三个示例说明了等效性检验的不同用途。示例 1 使用 PA 数据来评估活动监测器与已知标准的等效性。示例 2 说明了两种基于现场的体能测量方法与无已知参考方法的等效性。示例 3 使用回归来评估活动监测器在 23 项活动中的等效性。

结果

这些示例说明了从等效性检验中得出的结果的适当报告和解释。在第一个示例中,平均标准测量值显著在±15%的平均 PA 监测值内。平均差异为 0.18 METs,-0.15 至 0.52 的 90%置信区间在-0.65 至 0.65 的等效区间内。在第二个示例中,我们选择将这两种测量方法的等效性定义为平均值比值在 0.98 和 1.02 之间。估计的平均 V˙O2 值比值为 0.99,显著(P=0.007)在等效区间内。在第三个示例中,PA 监测器在整个活动套件中与标准不一致。估计的回归截距和斜率分别为-1.23 和 1.06。置信区间都不在建议的回归等效区间内。

结论

当研究目标是显示方法之间的相似性时,等效性检验比传统的差异检验(如 ANOVA 和 t 检验)更合适。

相似文献

3
Primer of statistics in dental research: part I.牙科研究中的统计学基础:第一部分。
J Prosthodont Res. 2014 Jan;58(1):11-6. doi: 10.1016/j.jpor.2013.12.006. Epub 2014 Jan 22.
8
Equivalence tests for comparing correlation and regression coefficients.用于比较相关系数和回归系数的等效性检验。
Br J Math Stat Psychol. 2015 May;68(2):292-309. doi: 10.1111/bmsp.12045. Epub 2014 Oct 27.

引用本文的文献

本文引用的文献

5
Validity and Calibration of the Youth Activity Profile.青少年活动概况的效度与校准
PLoS One. 2015 Dec 2;10(12):e0143949. doi: 10.1371/journal.pone.0143949. eCollection 2015.
9
Equating accelerometer estimates among youth: The Rosetta Stone 2.青少年计步器数据的等效性:罗塞塔石 2 号。
J Sci Med Sport. 2016 Mar;19(3):242-249. doi: 10.1016/j.jsams.2015.02.006. Epub 2015 Feb 23.
10

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验