Department of Zoology, University of Otago, 340 Great King Street, Dunedin, 9054, New Zealand.
Biol Rev Camb Philos Soc. 2010 Nov;85(4):935-56. doi: 10.1111/j.1469-185X.2010.00141.x.
Repeatability (more precisely the common measure of repeatability, the intra-class correlation coefficient, ICC) is an important index for quantifying the accuracy of measurements and the constancy of phenotypes. It is the proportion of phenotypic variation that can be attributed to between-subject (or between-group) variation. As a consequence, the non-repeatable fraction of phenotypic variation is the sum of measurement error and phenotypic flexibility. There are several ways to estimate repeatability for Gaussian data, but there are no formal agreements on how repeatability should be calculated for non-Gaussian data (e.g. binary, proportion and count data). In addition to point estimates, appropriate uncertainty estimates (standard errors and confidence intervals) and statistical significance for repeatability estimates are required regardless of the types of data. We review the methods for calculating repeatability and the associated statistics for Gaussian and non-Gaussian data. For Gaussian data, we present three common approaches for estimating repeatability: correlation-based, analysis of variance (ANOVA)-based and linear mixed-effects model (LMM)-based methods, while for non-Gaussian data, we focus on generalised linear mixed-effects models (GLMM) that allow the estimation of repeatability on the original and on the underlying latent scale. We also address a number of methods for calculating standard errors, confidence intervals and statistical significance; the most accurate and recommended methods are parametric bootstrapping, randomisation tests and Bayesian approaches. We advocate the use of LMM- and GLMM-based approaches mainly because of the ease with which confounding variables can be controlled for. Furthermore, we compare two types of repeatability (ordinary repeatability and extrapolated repeatability) in relation to narrow-sense heritability. This review serves as a collection of guidelines and recommendations for biologists to calculate repeatability and heritability from both Gaussian and non-Gaussian data.
可重复性(更准确地说是可重复性的常用度量,即组内相关系数 ICC)是量化测量精度和表型稳定性的重要指标。它是可以归因于受试者间(或组间)变异的表型变异比例。因此,表型变异的不可重复性部分是测量误差和表型灵活性的总和。有几种方法可以估计正态数据的可重复性,但对于非正态数据(例如二项式、比例和计数数据),如何计算可重复性尚无正式协议。除了点估计外,无论数据类型如何,都需要适当的不确定性估计(标准误差和置信区间)和可重复性估计的统计显着性。我们回顾了计算可重复性和相关统计数据的方法,包括正态和非正态数据。对于正态数据,我们提出了三种常用的可重复性估计方法:基于相关的方法、基于方差分析(ANOVA)的方法和基于线性混合效应模型(LMM)的方法,而对于非正态数据,我们专注于广义线性混合效应模型(GLMM),允许在原始和潜在潜在尺度上估计可重复性。我们还解决了计算标准误差、置信区间和统计显着性的多种方法;最准确和推荐的方法是参数引导、随机化检验和贝叶斯方法。我们主张主要使用基于 LMM 和 GLMM 的方法,因为可以更轻松地控制混杂变量。此外,我们还比较了两种类型的可重复性(普通可重复性和外推可重复性)与狭义遗传力的关系。本综述旨在为生物学家提供从正态和非正态数据计算可重复性和遗传力的指南和建议。