Faculty of Agricultural Sciences and Food, University Ss Cyril and Methodius, PO Box 297, 1000 Skopje, Macedonia.
J Dairy Sci. 2013 Mar;96(3):1834-43. doi: 10.3168/jds.2012-5910. Epub 2013 Jan 26.
One aim of the research was to challenge a previously selected repeatability model with 2 other repeatability models. The main aim, however, was to evaluate random regression models based on the repeatability model with lowest mean-squared error of prediction, using Legendre polynomials up to third order for both animal additive genetic and permanent environmental effects. The random regression and repeatability models were compared for model fit (using likelihood-ratio testing, Akaike information criterion, and the Bayesian information criterion) and the models' mean-squared errors of prediction, and by cross-validation. Cross-validation was carried out by correlating excluded observations in one data set with the animals' breeding values as predicted from the pedigree only in the remaining data, and vice versa (splitting proportion: 0.492). The data was from primiparous goats in 2 closely tied buck circles (17 flocks) in Norway, with 11,438 records for daily milk yield and 5,686 to 5,896 records for content traits (fat, protein, and lactose percentages). A simple pattern was revealed; for daily milk yield with about 5 records per animal in first lactation, a second-order random regression model should be chosen, whereas for content traits that had only about 3 observations per goat, a first-order polynomial was preferred. The likelihood-ratio test, Akaike information criterion, and mean-squared error of prediction favored more complex models, although the results from the latter and the Bayesian information criterion were in the direction of those obtained with cross-validation. As the correlation from cross-validation was largest with random regression, genetic merit was predicted more accurate with random regression models than with the repeatability model.
本研究的目的之一是用另外两个重复性模型来检验先前选定的重复性模型。然而,主要目的是评估基于预测均方误差最小的重复性模型的随机回归模型,同时对动物加性遗传和永久环境效应使用三阶勒让德多项式。通过似然比检验、赤池信息量准则和贝叶斯信息量准则对随机回归和重复性模型进行比较,比较内容包括模型拟合(拟合优度)和预测均方误差,以及交叉验证。交叉验证是通过将一个数据集的排除观测值与仅根据系谱预测的动物育种值相关联来完成的,反之亦然(分割比例:0.492)。该数据来自挪威两个紧密联系的公羊圈(17 个羊群)中的初产母羊,每日产奶量记录 11438 条,含量性状(脂肪、蛋白质和乳糖百分比)记录 5686 到 5896 条。结果揭示了一个简单的模式;对于初产母羊的每日产奶量,每只动物大约有 5 条记录,应选择二阶随机回归模型,而对于每只羊只有大约 3 个观测值的含量性状,首选一阶多项式。似然比检验、赤池信息量准则和预测均方误差倾向于更复杂的模型,尽管后者和贝叶斯信息量准则的结果与交叉验证的结果一致。由于交叉验证的相关性最大,因此随机回归模型比重复性模型更能准确地预测遗传优势。