Tusell L, Pérez-Rodríguez P, Forni S, Gianola D
Department of Animal Sciences, University of Wisconsin-Madison, Madison, WI, USA.
J Anim Breed Genet. 2014 Apr;131(2):105-15. doi: 10.1111/jbg.12070. Epub 2014 Jan 8.
Predictive ability of yet-to-be observed litter size (pig) grain yield (wheat) records of several reproducing kernel Hilbert spaces (RKHS) regression models combining different number of Gaussian or t kernels was evaluated. Predictive performance was assessed as the average (over 50 replicates) predictive correlation in the testing set. Predictions from these models were combined using three different types of model averaging: (i) mean of predicted phenotypes obtained in each model, (ii) weighted average using mean squared error as weight or (iii) using the marginal likelihood as weight. (ii) and (iii) were obtained in a validation set with 5% of the data. Phenotypes consisted of 2598, 1604 and 1879 average litter size records from three commercial pig lines and wheat grain yield of 599 lines evaluated in four macro-environments. SNPs from the PorcineSNP60 BeadChip and 1447 DArT markers were used as predictors for the pig and wheat data analyses, respectively. Gaussian and univariate t kernels led to same predictive performance. Multikernel RKHS regression models overcame shortcomings of single kernel models (increasing the predictive correlation of RKHS models by 0.05 where 3 Gaussian or t kernels were fitted in the RKHS models simultaneously). None of the proposed averaging strategies improved the predictive correlations attained with single models using multiple kernel fitting.
评估了几种组合不同数量高斯核或t核的再生核希尔伯特空间(RKHS)回归模型对尚未观测到的窝产仔数(猪)和谷物产量(小麦)记录的预测能力。预测性能通过测试集中的平均(50次重复)预测相关性来评估。使用三种不同类型的模型平均方法对这些模型的预测结果进行组合:(i)每个模型中获得的预测表型的均值;(ii)以均方误差为权重的加权平均;或(iii)以边际似然为权重。(ii)和(iii)是在包含5%数据的验证集中获得的。表型包括来自三个商业猪品系的2598、1604和1879条平均窝产仔数记录,以及在四个宏观环境中评估的599个品系的小麦谷物产量。分别使用PorcineSNP60 BeadChip的单核苷酸多态性(SNP)和1447个DArT标记作为猪和小麦数据分析的预测因子。高斯核和单变量t核导致相同的预测性能。多核RKHS回归模型克服了单核模型的缺点(在RKHS模型中同时拟合3个高斯核或t核时,将RKHS模型的预测相关性提高了0.05)。所提出的平均策略均未提高使用多核拟合的单模型所达到的预测相关性。