Département de Sciences Biologiques, Université de Montréal, C.P. 6128, succursale Centre-ville, Montréal, Québec, Canada H3C 3J7.
J Theor Biol. 2009 Aug 21;259(4):727-43. doi: 10.1016/j.jtbi.2009.04.022. Epub 2009 May 4.
Following the pioneering work of Felsenstein and Garland, phylogeneticists have been using regression through the origin to analyze comparative data using independent contrasts. The reason why regression through the origin must be used with such data was revisited. The demonstration led to the formulation of a permutation test for the coefficient of determination and the regression coefficient estimates in regression through the origin. Simulations were carried out to measure type I error and power of the parametric and permutation tests under two models of data generation: regression models I and II (correlation model). Although regression through the origin assumes model I data, in independent contrast data error is present in the explanatory as well as the response variables. Two forms of permutations were investigated to test the regression coefficients: permutation of the values of the response variable y, and permutation of the residuals of the regression model. The simulations showed that the parametric tests or any of the permutation tests can be used when the error is normal, which is the usual assumption in independent contrast studies; only the test by permutation of y should be used when the error is highly asymmetric; and the parametric tests should be used when extreme values are present in covariables. Two examples are presented. The first one concerns non-specificity in fish parasites of the genus Lamellodiscus, the second the richness in parasites in 78 species of mammals.
继 Felsenstein 和 Garland 的开创性工作之后,系统发育学家一直使用起源回归来分析使用独立对比的比较数据。重新探讨了使用这种数据必须使用起源回归的原因。该证明导致了起源回归中决定系数和回归系数估计的置换检验的制定。模拟在两种数据生成模型下测量了参数和置换检验的Ⅰ型错误和功效:回归模型 I 和 II(相关模型)。尽管起源回归假设模型 I 数据,但在独立对比数据中,解释变量和响应变量都存在误差。研究了两种形式的置换来检验回归系数:响应变量 y 值的置换和回归模型残差的置换。模拟表明,当误差为正态时,可以使用参数检验或任何置换检验,这是独立对比研究中的常见假设;仅当误差高度不对称时,才应使用 y 值的置换检验;并且当协变量中存在极值时,应使用参数检验。给出了两个例子。第一个例子涉及 Lamellodiscus 属鱼类寄生虫的非特异性,第二个例子涉及 78 种哺乳动物寄生虫的丰富度。