Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
CRV BV, Wassenaarweg 20, 6843 NW, Arnhem, The Netherlands.
Genet Sel Evol. 2018 Nov 3;50(1):51. doi: 10.1186/s12711-018-0429-3.
The single-step single nucleotide polymorphism best linear unbiased prediction (ssSNPBLUP) method, such as single-step genomic BLUP (ssGBLUP), simultaneously analyses phenotypic, pedigree, and genomic information of genotyped and non-genotyped animals. In contrast to ssGBLUP, SNP effects are fitted explicitly as random effects in the ssSNPBLUP model. Similarly, principal components associated with the genomic information can be fitted explicitly as random effects in a single-step principal component BLUP (ssPCBLUP) model to remove noise in genomic information. Single-step genomic BLUP is solved efficiently by using the preconditioned conjugate gradient (PCG) method. Unfortunately, convergence issues have been reported when solving ssSNPBLUP by using PCG. Poor convergence may be linked with poor spectral condition numbers of the preconditioned coefficient matrices of ssSNPBLUP. These condition numbers, and thus convergence, could be improved through the deflated PCG (DPCG) method, which is a two-level PCG method for ill-conditioned linear systems. Therefore, the first aim of this study was to compare the properties of the preconditioned coefficient matrices of ssGBLUP and ssSNPBLUP, and to document convergence patterns that are obtained with the PCG method. The second aim was to implement and test the efficiency of a DPCG method for solving ssSNPBLUP and ssPCBLUP.
For two dairy cattle datasets, the smallest eigenvalues obtained for ssSNPBLUP (ssPCBLUP) and ssGBLUP, both solved with the PCG method, were similar. However, the largest eigenvalues obtained for ssSNPBLUP and ssPCBLUP were larger than those for ssGBLUP, which resulted in larger condition numbers and in slow convergence for both systems solved by the PCG method. Different implementations of the DPCG method led to smaller condition numbers, and faster convergence for ssSNPBLUP and for ssPCBLUP, by deflating the largest unfavourable eigenvalues.
Poor convergence of ssSNPBLUP and ssPCBLUP when solved by the PCG method are related to larger eigenvalues and larger condition numbers in comparison to ssGBLUP. These convergence issues were solved with a DPCG method that annihilates the effect of the largest unfavourable eigenvalues of the preconditioned coefficient matrix of ssSNPBLUP and of ssPCBLUP on the convergence of the PCG method. It resulted in a convergence pattern, at least, similar to that of ssGBLUP.
一步法单核苷酸多态性最佳线性无偏预测(ssSNPBLUP)方法,如一步法基因组 BLUP(ssGBLUP),同时分析了已基因型和未基因型动物的表型、系谱和基因组信息。与 ssGBLUP 不同,ssSNPBLUP 模型中 SNP 效应被明确拟合为随机效应。同样,可以将与基因组信息相关的主成分明确拟合为一步法主成分 BLUP(ssPCBLUP)模型中的随机效应,以去除基因组信息中的噪声。ssGBLUP 通过使用预处理共轭梯度(PCG)方法可以有效地解决。不幸的是,当使用 PCG 求解 ssSNPBLUP 时,已经报告了收敛问题。较差的收敛性可能与 ssSNPBLUP 的预处理系数矩阵的较差谱条件数有关。这些条件数,因此收敛性,可以通过膨胀 PCG(DPCG)方法来提高,这是一种用于病态线性系统的两级 PCG 方法。因此,本研究的第一个目的是比较 ssGBLUP 和 ssSNPBLUP 的预处理系数矩阵的特性,并记录使用 PCG 方法获得的收敛模式。第二个目的是实现和测试用于解决 ssSNPBLUP 和 ssPCBLUP 的 DPCG 方法的效率。
对于两个奶牛数据集,使用 PCG 方法求解的 ssSNPBLUP(ssPCBLUP)和 ssGBLUP 的最小特征值相似。然而,ssSNPBLUP 和 ssPCBLUP 的最大特征值大于 ssGBLUP 的最大特征值,这导致两个系统的条件数较大,PCG 方法的收敛速度较慢。不同的 DPCG 方法的实现导致 ssSNPBLUP 和 ssPCBLUP 的条件数较小,收敛速度较快,通过膨胀预处理系数矩阵的最大不利特征值来消除其对 PCG 方法收敛的影响。
当使用 PCG 方法求解时,ssSNPBLUP 和 ssPCBLUP 的较差收敛性与 ssGBLUP 相比,与较大的特征值和较大的条件数有关。这些收敛问题通过 DPCG 方法得到解决,该方法消除了预处理系数矩阵的 ssSNPBLUP 和 ssPCBLUP 的最大不利特征值对 PCG 方法收敛的影响。它产生了至少类似于 ssGBLUP 的收敛模式。