Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
Facultad de Agronomía, Universidad de Buenos Aires, C1417DSQ, Buenos Aires, Argentina.
Genet Sel Evol. 2022 Jul 16;54(1):52. doi: 10.1186/s12711-022-00741-7.
Single-step genomic predictions obtained from a breeding value model require calculating the inverse of the genomic relationship matrix [Formula: see text]. The Algorithm for Proven and Young (APY) creates a sparse representation of [Formula: see text] with a low computational cost. APY consists of selecting a group of core animals and expressing the breeding values of the remaining animals as a linear combination of those from the core animals plus an error term. The objectives of this study were to: (1) extend APY to marker effects models; (2) derive equations for marker effect estimates when APY is used for breeding value models, and (3) show the implication of selecting a specific group of core animals in terms of a marker effects model.
We derived a family of marker effects models called APY-SNP-BLUP. It differs from the classic marker effects model in that the row space of the genotype matrix is reduced and an error term is fitted for non-core animals. We derived formulas for marker effect estimates that take this error term in account. The prediction error variance (PEV) of the marker effect estimates depends on the PEV for core animals but not directly on the PEV of the non-core animals. We extended the APY-SNP-BLUP to include a residual polygenic effect and accommodate non-genotyped animals. We show that selecting a specific group of core animals is equivalent to select a subspace of the row space of the genotype matrix. As the number of core animals increases, subspaces corresponding to different sets of core animals tend to overlap, showing that random selection of core animals is algebraically justified.
The APY-(ss)GBLUP models can be expressed in terms of marker effect models. When the number of core animals is equal to the rank of the genotype matrix, APY-SNP-BLUP is identical to the classic marker effects model. If the number of core animals is less than the rank of the genotype matrix, genotypes for non-core animals are imputed as a linear combination of the genotypes of the core animals. For estimating SNP effects, only relationships and estimated breeding values for core animals are needed.
从育种值模型获得的一步法基因组预测需要计算基因组关系矩阵的逆矩阵[公式:见正文]。算法为证明和年轻(APY)用低计算成本创建基因组关系矩阵的稀疏表示[公式:见正文]。APY 由选择一组核心动物组成,并将其余动物的育种值表示为核心动物的线性组合加上误差项。本研究的目的是:(1)将 APY 扩展到标记效应模型;(2)当 APY 用于育种值模型时,推导出标记效应估计的方程,(3)根据标记效应模型显示选择特定核心动物组的影响。
我们推导出了一组称为 APY-SNP-BLUP 的标记效应模型。它与经典标记效应模型的不同之处在于,基因型矩阵的行空间减少,并且为非核心动物拟合误差项。我们推导出了考虑到该误差项的标记效应估计的公式。标记效应估计的预测误差方差(PEV)取决于核心动物的 PEV,但不直接取决于非核心动物的 PEV。我们将 APY-SNP-BLUP 扩展到包括剩余多基因效应,并适应非基因动物。我们表明,选择特定的核心动物组相当于选择基因型矩阵行空间的子空间。随着核心动物数量的增加,不同核心动物组对应的子空间倾向于重叠,这表明随机选择核心动物在代数上是合理的。
APY-(ss)GBLUP 模型可以用标记效应模型表示。当核心动物的数量等于基因型矩阵的秩时,APY-SNP-BLUP 与经典标记效应模型相同。如果核心动物的数量小于基因型矩阵的秩,则非核心动物的基因型被估计为核心动物的基因型的线性组合。对于估计 SNP 效应,只需要核心动物的关系和估计的育种值。