Jamrozik J, Fatehi J, Schaeffer L R
Centre for Genetic Improvement of Livestock, Department of Animal and Poultry Science, University of Guelph, Guelph, ON, Canada.
J Anim Breed Genet. 2007 Feb;124(1):3-11. doi: 10.1111/j.1439-0388.2007.00633.x.
Robust procedures for estimation of breeding values were applied to multiple-trait random regression test-day (TD) model to reduce the influence of outliers on inferences. Robust estimation methods consisted of correcting selected observations (defined as outliers) in the process of solving mixed-model equations in such a way that 'new' observations gave residuals (actual observation minus predicted) within k residual standard deviations for a given day in milk in 305-day lactation. Data were 980,503 TD records on 63,346 Canadian Jersey cows. Milk, fat, protein and somatic cell score in the first three lactations were analysed jointly in the model that included fixed herd-TD effect and regressions within region-age-season of calving, and regressions with random coefficients for animal genetic and permanent environmental effects. All regressions were orthogonal polynomials of order 4. Robust procedures for k = 1.5, 2.0, 2.5, 2.75 and 3.0 were contrasted with the regular best linear unbiased prediction (BLUP) method in terms of numbers and distributions of outliers, and estimated breeding values (EBV) of animals. Distributions of outliers were similar across traits and lactations. Early days in milk (from 5 to 15) were associated with larger frequency of outliers compared with the remaining part of lactation. Several, computationally simple, robust methods (for k > 2.0) reduced the influence of outlier observations in the model and improved the overall model performance. Differences in rankings of animals from robust evaluations were small compared with the regular BLUP method. No clear associations between changes in EBV (rankings) of top animals from different methods and the occurrence of outliers were detected.
稳健的育种值估计程序应用于多性状随机回归测定日(TD)模型,以减少异常值对推断的影响。稳健估计方法包括在求解混合模型方程的过程中校正选定的观测值(定义为异常值),使得“新”观测值在305天泌乳期内给定产奶日的k个残差标准差范围内产生残差(实际观测值减去预测值)。数据为63346头加拿大泽西奶牛的980503条TD记录。对前三个泌乳期的牛奶、脂肪、蛋白质和体细胞评分进行联合分析,模型中包括固定的牛群-测定日效应以及产犊地区-年龄-季节内的回归,以及动物遗传和永久环境效应的随机系数回归。所有回归均为4阶正交多项式。将k = 1.5、2.0、2.5、2.75和3.0的稳健程序与常规最佳线性无偏预测(BLUP)方法在异常值的数量和分布以及动物的估计育种值(EBV)方面进行了对比。异常值的分布在不同性状和泌乳期之间相似。与泌乳期的其余部分相比,产奶早期(第5至15天)的异常值频率更高。几种计算简单的稳健方法(k > 2.0)减少了模型中异常观测值的影响,提高了整体模型性能。与常规BLUP方法相比,稳健评估中动物排名的差异较小。未检测到不同方法中顶级动物的EBV(排名)变化与异常值出现之间的明确关联。