Meyer Karin
Animal Genetics and Breeding Unit, University of New England, Armidale NSW 2351, Australia.
Genet Sel Evol. 2005 Sep-Oct;37(5):473-500. doi: 10.1186/1297-9686-37-6-473.
Regression on the basis function of B-splines has been advocated as an alternative to orthogonal polynomials in random regression analyses. Basic theory of splines in mixed model analyses is reviewed, and estimates from analyses of weights of Australian Angus cattle from birth to 820 days of age are presented. Data comprised 84533 records on 20731 animals in 43 herds, with a high proportion of animals with 4 or more weights recorded. Changes in weights with age were modelled through B-splines of age at recording. A total of thirteen analyses, considering different combinations of linear, quadratic and cubic B-splines and up to six knots, were carried out. Results showed good agreement for all ages with many records, but fluctuated where data were sparse. On the whole, analyses using B-splines appeared more robust against "end-of-range" problems and yielded more consistent and accurate estimates of the first eigenfunctions than previous, polynomial analyses. A model fitting quadratic B-splines, with knots at 0, 200, 400, 600 and 821 days and a total of 91 covariance components, appeared to be a good compromise between detailedness of the model, number of parameters to be estimated, plausibility of results, and fit, measured as residual mean square error.
在随机回归分析中,基于B样条基函数的回归已被提倡作为正交多项式的替代方法。本文回顾了混合模型分析中样条的基本理论,并给出了对澳大利亚安格斯牛从出生到820日龄体重权重分析的估计结果。数据包括来自43个牛群中20731头动物的84533条记录,其中有很大比例的动物记录了4次或更多次体重。记录时体重随年龄的变化通过年龄的B样条进行建模。总共进行了13次分析,考虑了线性、二次和三次B样条的不同组合以及多达6个节点。结果表明,对于有许多记录的所有年龄,结果吻合良好,但在数据稀疏的地方会出现波动。总体而言,与之前的多项式分析相比,使用B样条的分析在应对“范围末端”问题时显得更为稳健,并且对第一特征函数的估计更加一致和准确。一个拟合二次B样条的模型,节点设置在0、200、400、600和821天,共有91个协方差分量,在模型的详细程度、待估计参数的数量、结果的合理性以及以残差均方误差衡量的拟合度之间似乎是一个很好的折衷方案。