基于 Proven and Young 算法的标记效应模型与育种值模型和直接基因组值的等效性

On the equivalence between marker effect models and breeding value models and direct genomic values with the Algorithm for Proven and Young.

机构信息

Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.

Facultad de Agronomía, Universidad de Buenos Aires, C1417DSQ, Buenos Aires, Argentina.

出版信息

Genet Sel Evol. 2022 Jul 16;54(1):52. doi: 10.1186/s12711-022-00741-7.

DOI:10.1186/s12711-022-00741-7

PMID:35842585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9288049/

Abstract

BACKGROUND

Single-step genomic predictions obtained from a breeding value model require calculating the inverse of the genomic relationship matrix [Formula: see text]. The Algorithm for Proven and Young (APY) creates a sparse representation of [Formula: see text] with a low computational cost. APY consists of selecting a group of core animals and expressing the breeding values of the remaining animals as a linear combination of those from the core animals plus an error term. The objectives of this study were to: (1) extend APY to marker effects models; (2) derive equations for marker effect estimates when APY is used for breeding value models, and (3) show the implication of selecting a specific group of core animals in terms of a marker effects model.

RESULTS

We derived a family of marker effects models called APY-SNP-BLUP. It differs from the classic marker effects model in that the row space of the genotype matrix is reduced and an error term is fitted for non-core animals. We derived formulas for marker effect estimates that take this error term in account. The prediction error variance (PEV) of the marker effect estimates depends on the PEV for core animals but not directly on the PEV of the non-core animals. We extended the APY-SNP-BLUP to include a residual polygenic effect and accommodate non-genotyped animals. We show that selecting a specific group of core animals is equivalent to select a subspace of the row space of the genotype matrix. As the number of core animals increases, subspaces corresponding to different sets of core animals tend to overlap, showing that random selection of core animals is algebraically justified.

CONCLUSIONS

The APY-(ss)GBLUP models can be expressed in terms of marker effect models. When the number of core animals is equal to the rank of the genotype matrix, APY-SNP-BLUP is identical to the classic marker effects model. If the number of core animals is less than the rank of the genotype matrix, genotypes for non-core animals are imputed as a linear combination of the genotypes of the core animals. For estimating SNP effects, only relationships and estimated breeding values for core animals are needed.

摘要

背景

从育种值模型获得的一步法基因组预测需要计算基因组关系矩阵的逆矩阵[公式：见正文]。算法为证明和年轻（APY）用低计算成本创建基因组关系矩阵的稀疏表示[公式：见正文]。APY 由选择一组核心动物组成，并将其余动物的育种值表示为核心动物的线性组合加上误差项。本研究的目的是：（1）将 APY 扩展到标记效应模型；（2）当 APY 用于育种值模型时，推导出标记效应估计的方程，（3）根据标记效应模型显示选择特定核心动物组的影响。

结果

我们推导出了一组称为 APY-SNP-BLUP 的标记效应模型。它与经典标记效应模型的不同之处在于，基因型矩阵的行空间减少，并且为非核心动物拟合误差项。我们推导出了考虑到该误差项的标记效应估计的公式。标记效应估计的预测误差方差（PEV）取决于核心动物的 PEV，但不直接取决于非核心动物的 PEV。我们将 APY-SNP-BLUP 扩展到包括剩余多基因效应，并适应非基因动物。我们表明，选择特定的核心动物组相当于选择基因型矩阵行空间的子空间。随着核心动物数量的增加，不同核心动物组对应的子空间倾向于重叠，这表明随机选择核心动物在代数上是合理的。

结论

APY-(ss)GBLUP 模型可以用标记效应模型表示。当核心动物的数量等于基因型矩阵的秩时，APY-SNP-BLUP 与经典标记效应模型相同。如果核心动物的数量小于基因型矩阵的秩，则非核心动物的基因型被估计为核心动物的基因型的线性组合。对于估计 SNP 效应，只需要核心动物的关系和估计的育种值。

相似文献

On the equivalence between marker effect models and breeding value models and direct genomic values with the Algorithm for Proven and Young.基于 Proven and Young 算法的标记效应模型与育种值模型和直接基因组值的等效性

Genet Sel Evol. 2022 Jul 16;54(1):52. doi: 10.1186/s12711-022-00741-7.

Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?当数据的世代数较少时，采用具有成熟和年轻算法的一步法基因组 REML 是否更具计算效率？

J Anim Sci. 2022 May 1;100(5). doi: 10.1093/jas/skac082.

Efficient approximation of reliabilities for single-step genomic best linear unbiased predictor models with the Algorithm for Proven and Young.利用 Proven 和 Young 算法对单步基因组最佳线性无偏预测模型进行可靠性的有效逼近。

J Anim Sci. 2022 Jan 1;100(1). doi: 10.1093/jas/skab353.

An efficient exact method to obtain GBLUP and single-step GBLUP when the genomic relationship matrix is singular.当基因组关系矩阵为奇异矩阵时，一种获取广义贝叶斯线性无偏预测（GBLUP）和单步GBLUP的高效精确方法。

Genet Sel Evol. 2016 Oct 27;48(1):80. doi: 10.1186/s12711-016-0260-7.

Leveraging low-density crossbred genotypes to offset crossbred phenotypes and their impact on purebred predictions.利用低密度杂交基因型来抵消杂交表型及其对纯种预测的影响。

J Anim Sci. 2022 Dec 1;100(12). doi: 10.1093/jas/skac359.

The quality of the algorithm for proven and young with various sets of core animals in a multibreed sheep population1.在一个多品种绵羊群体中，针对不同核心动物集合的成熟和年轻算法的质量。1

J Anim Sci. 2019 Mar 1;97(3):1090-1100. doi: 10.1093/jas/skz010.

Core-dependent changes in genomic predictions using the Algorithm for Proven and Young in single-step genomic best linear unbiased prediction.利用算法进行一步法基因组最佳线性无偏预测时基于核心的基因组预测变化。

J Anim Sci. 2020 Dec 1;98(12). doi: 10.1093/jas/skaa374.

Technical note: Equivalent genomic models with a residual polygenic effect.技术说明：具有残留多基因效应的等效基因组模型。

J Dairy Sci. 2016 Mar;99(3):2016-2025. doi: 10.3168/jds.2015-10394. Epub 2015 Dec 24.

Indirect predictions with a large number of genotyped animals using the algorithm for proven and young.使用经过验证和年轻的算法对大量基因分型动物进行间接预测。

J Anim Sci. 2020 Jun 1;98(6). doi: 10.1093/jas/skaa154.

Implementation of genomic recursions in single-step genomic best linear unbiased predictor for US Holsteins with a large number of genotyped animals.在具有大量基因分型动物的美国荷斯坦奶牛单步基因组最佳线性无偏预测器中实施基因组递归。

J Dairy Sci. 2016 Mar;99(3):1968-1974. doi: 10.3168/jds.2015-10540. Epub 2016 Jan 21.

引用本文的文献

Megavariate methods capture complex genotype-by-environment interactions.多变量方法能够捕捉复杂的基因与环境的相互作用。

Genetics. 2025 Apr 17;229(4). doi: 10.1093/genetics/iyae179.

Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations.在大型基因分型人群中，使用经过验证和新兴的算法进行单步 GWAS 的标记效应 p 值。

Genet Sel Evol. 2024 Aug 22;56(1):59. doi: 10.1186/s12711-024-00925-3.

Reviewing the definition of mortality in broiler chickens and its implications in genomic evaluations.审查肉鸡死亡率的定义及其在基因组评估中的意义。

J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae190.

Temporal dynamics of genetic parameters and SNP effects for performance and disorder traits in poultry undergoing genomic selection.家禽基因组选择过程中生产性能和疾病性状的遗传参数及单核苷酸多态性效应的时间动态变化

J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae097.

Short Communication: Reduced GBLUP equations to core animals in the algorithm for proven and young (APY).简短通讯：在经产和青年动物算法（APY）中简化针对核心动物的基因组最佳线性无偏预测（GBLUP）方程

Vet Anim Sci. 2024 Jan 4;23:100334. doi: 10.1016/j.vas.2024.100334. eCollection 2024 Mar.

Derivation of indirect predictions using genomic recursions across generations in a broiler population.利用肉鸡群体跨世代的基因组递归进行间接预测的推导。

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad355.

Efficient ways to combine data from broiler and layer chickens to account for sequential genomic selection.肉鸡和蛋鸡数据的有效组合方法，以实现序贯基因组选择。

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad177.

Extension of the reduced animal model to single-step methods.将简化的动物模型扩展到单步方法。

J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skac272.

本文引用的文献

Convergence behavior of single-step GBLUP and SNPBLUP for different termination criteria.不同终止准则下单步 GBLUP 和 SNPBLUP 的收敛行为。

Genet Sel Evol. 2021 Apr 9;53(1):34. doi: 10.1186/s12711-021-00626-1.

J Anim Sci. 2020 Dec 1;98(12). doi: 10.1093/jas/skaa374.

Approximate Genome-Based Kernel Models for Large Data Sets Including Main Effects and Interactions.适用于包含主效应和交互作用的大数据集的近似基于基因组的核模型。

Front Genet. 2020 Oct 15;11:567757. doi: 10.3389/fgene.2020.567757. eCollection 2020.

Indirect predictions with a large number of genotyped animals using the algorithm for proven and young.使用经过验证和年轻的算法对大量基因分型动物进行间接预测。

J Anim Sci. 2020 Jun 1;98(6). doi: 10.1093/jas/skaa154.

Current status of genomic evaluation.基因组评估的现状。

J Anim Sci. 2020 Apr 1;98(4). doi: 10.1093/jas/skaa101.

Using Monte Carlo method to include polygenic effects in calculation of SNP-BLUP model reliability.使用蒙特卡罗方法将多基因效应纳入 SNP-BLUP 模型可靠性计算中。

J Dairy Sci. 2020 Jun;103(6):5170-5182. doi: 10.3168/jds.2019-17255. Epub 2020 Apr 3.

A second-level diagonal preconditioner for single-step SNPBLUP.单步 SNPBLUP 的二级对角预处理子。

Genet Sel Evol. 2019 Jun 25;51(1):30. doi: 10.1186/s12711-019-0472-8.

Deflated preconditioned conjugate gradient method for solving single-step BLUP models efficiently.高效求解单步 BLUP 模型的瘪预处理共轭梯度法。

Genet Sel Evol. 2018 Nov 3;50(1):51. doi: 10.1186/s12711-018-0429-3.

Sparse single-step genomic BLUP in crossbreeding schemes.杂交方案中的稀疏单步基因组 BLUP。

J Anim Sci. 2018 Jun 4;96(6):2060-2073. doi: 10.1093/jas/sky136.

Large-scale genomic prediction using singular value decomposition of the genotype matrix.基于基因型矩阵奇异值分解的大规模基因组预测。

Genet Sel Evol. 2018 Feb 28;50(1):6. doi: 10.1186/s12711-018-0373-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验