Kudinov Andrei A, Koivula Minna, Aamand Gert P, Strandén Ismo, Mäntysaari Esa A
Natural Resources Institute Finland (Luke), Jokioinen, Finland.
Nordic Cattle Genetic Evaluation, Aarhus, Denmark.
Front Genet. 2022 Nov 21;13:1012205. doi: 10.3389/fgene.2022.1012205. eCollection 2022.
Single-step genomic BLUP (ssGBLUP) model for routine genomic prediction of breeding values is developed intensively for many dairy cattle populations. Compatibility between the genomic () and the pedigree () relationship matrices remains an important challenge required in ssGBLUP. The compatibility relates to the amount of missing pedigree information. There are two prevailing approaches to account for the incomplete pedigree information: unknown parent groups (UPG) and metafounders (MF). unknown parent groups have been used routinely in pedigree-based evaluations to account for the differences in genetic level between groups of animals with missing parents. The MF approach is an extension of the UPG approach. The MF approach defines MF which are related pseudo-individuals. The MF approach needs a matrix of the size number of MF to describe relationships between MF. The UPG and MF can be the same. However, the challenge in the MF approach is the estimation of having many MF, typically needed in dairy cattle. In our study, we present an approach to fit the same amount of MF as UPG in ssGBLUP with Woodbury matrix identity (ssGTBLUP). We used 305-day milk, protein, and fat yield data from the DFS (Denmark, Finland, Sweden) Red Dairy cattle population. The pedigree had more than 6 million animals of which 207,475 were genotyped. We constructed the preliminary gamma matrix ( ) with 29 MF which was expanded to 148 MF by a covariance function ( ). The quality of the extrapolation of the matrix was studied by comparing average off-diagonal elements between breed groups. On average relationships among MF in were 1.8% higher than in . The use of increased the correlation between the and matrices by 0.13 and 0.11 for the diagonal and off-diagonal elements, respectively. [G]EBV were predicted using the ssGTBLUP and Pedigree-BLUP models with the MF and UPG. The prediction reliabilities were slightly higher for the ssGTBLUP model using MF than UPG. The ssGBLUP MF model showed less overprediction compared to other models.
单步基因组最佳线性无偏预测(ssGBLUP)模型被广泛应用于许多奶牛群体的育种值常规基因组预测。基因组()和系谱()关系矩阵之间的兼容性仍然是ssGBLUP中需要面对的一个重要挑战。这种兼容性与缺失系谱信息的数量有关。有两种常用方法来处理不完整的系谱信息:未知亲本组(UPG)和元祖(MF)。未知亲本组已常规用于基于系谱的评估中,以解释有缺失亲本的动物组之间遗传水平的差异。MF方法是UPG方法的扩展。MF方法定义了与伪个体相关的MF。MF方法需要一个大小为MF数量的矩阵来描述MF之间的关系。UPG和MF可以相同。然而,MF方法面临的挑战是在奶牛群体中通常需要估计大量的MF。在我们的研究中,我们提出了一种在具有伍德伯里矩阵恒等式的ssGBLUP(ssGTBLUP)中拟合与UPG数量相同的MF的方法。我们使用了来自DFS(丹麦、芬兰、瑞典)红奶牛群体的305天产奶量、蛋白质产量和脂肪产量数据。系谱中有超过600万头动物,其中207,475头进行了基因分型。我们构建了包含29个MF的初步伽马矩阵(),并通过协方差函数()将其扩展到148个MF。通过比较品种组之间的平均非对角元素,研究了矩阵外推的质量。平均而言,中的MF之间的关系比中的高1.8%。使用增加了矩阵与矩阵之间对角和非对角元素的相关性,分别提高了0.13和0.11。使用MF和UPG的ssGTBLUP和系谱BLUP模型预测了[G]EBV。使用MF的ssGTBLUP模型的预测可靠性略高于使用UPG的情况。与其他模型相比,ssGBLUP MF模型的过度预测较少。