复杂性状的预测：最佳线性无偏预测的稳健替代方法

Prediction of Complex Traits: Robust Alternatives to Best Linear Unbiased Prediction.

作者信息

Gianola Daniel, Cecchinato Alessio, Naya Hugo, Schön Chris-Carolin

机构信息

Department of Animal Sciences, University of Wisconsin-Madison, Madison, WI, United States.

Department of Dairy Science, University of Wisconsin-Madison, Madison, WI, United States.

出版信息

Front Genet. 2018 Jun 5;9:195. doi: 10.3389/fgene.2018.00195. eCollection 2018.

DOI:10.3389/fgene.2018.00195

PMID:29951082

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6008589/

Abstract

A widely used method for prediction of complex traits in animal and plant breeding is "genomic best linear unbiased prediction" (GBLUP). In a quantitative genetics setting, BLUP is a linear regression of phenotypes on a pedigree or on a genomic relationship matrix, depending on the type of input information available. Normality of the distributions of random effects and of model residuals is not required for BLUP but a Gaussian assumption is made implicitly. A potential downside is that Gaussian linear regressions are sensitive to outliers, genetic or environmental in origin. We present simple (relative to a fully Bayesian analysis) to implement robust alternatives to BLUP using a linear model with residual or Laplace distributions instead of a Gaussian one, and evaluate the methods with milk yield records on Italian Brown Swiss cattle, grain yield data in inbred wheat lines, and using three traits measured on accessions of . The methods do not use Markov chain Monte Carlo sampling and model hyper-parameters, viewed here as regularization "knobs," are tuned via some cross-validation. Uncertainty of predictions are evaluated by employing bootstrapping or by random reconstruction of training and testing sets. It was found (e.g., test-day milk yield in cows, flowering time and FRIGIDA expression in ) that the best predictions were often those obtained with the robust methods. The results obtained are encouraging and stimulate further investigation and generalization.

摘要

在动植物育种中，一种广泛使用的复杂性状预测方法是“基因组最佳线性无偏预测”（GBLUP）。在数量遗传学背景下，根据可用输入信息的类型，BLUP是表型对系谱或基因组关系矩阵的线性回归。BLUP不需要随机效应和模型残差分布的正态性，但隐含地做出了高斯假设。一个潜在的缺点是高斯线性回归对异常值敏感，这些异常值可能源于遗传或环境因素。我们提出了简单的（相对于完全贝叶斯分析）方法，使用具有残差或拉普拉斯分布而非高斯分布的线性模型来实现BLUP的稳健替代方法，并使用意大利褐牛的产奶量记录、近交小麦品系的谷物产量数据以及对……的种质所测量的三个性状来评估这些方法。这些方法不使用马尔可夫链蒙特卡罗抽样，并且将模型超参数（在此视为正则化“旋钮”）通过一些交叉验证进行调整。预测的不确定性通过自举法或通过随机重建训练集和测试集来评估。结果发现（例如，奶牛的测定日产奶量、……的开花时间和FRIGIDA表达），最佳预测往往是通过稳健方法获得的。所获得的结果令人鼓舞，并激发了进一步的研究和推广。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f14d/6008589/e6e9fad97b47/fgene-09-00195-g0001.jpg

相似文献

Prediction of Complex Traits: Robust Alternatives to Best Linear Unbiased Prediction.复杂性状的预测：最佳线性无偏预测的稳健替代方法

Front Genet. 2018 Jun 5;9:195. doi: 10.3389/fgene.2018.00195. eCollection 2018.

A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法

Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.

Short communication: Single-step genomic evaluation of milk production traits using multiple-trait random regression model in Chinese Holsteins.短篇交流：利用多性状随机回归模型对中国荷斯坦奶牛产奶性状进行单步基因组评估。

J Dairy Sci. 2018 Dec;101(12):11143-11149. doi: 10.3168/jds.2018-15090. Epub 2018 Sep 27.

Genomic prediction based on data from three layer lines: a comparison between linear methods.基于三层品系数据的基因组预测：线性方法之间的比较

Genet Sel Evol. 2014 Oct 1;46(1):57. doi: 10.1186/s12711-014-0057-5.

Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.使用基于不同加权因子构建的基因组关系矩阵来考虑位点特异性方差的基因组预测比较。

J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.

Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction.在基因组预测中不进行交叉验证的交叉验证

G3 (Bethesda). 2016 Oct 13;6(10):3107-3128. doi: 10.1534/g3.116.033381.

Use of a Bayesian model including QTL markers increases prediction reliability when test animals are distant from the reference population.当测验动物与参考群体相距较远时，使用包含 QTL 标记的贝叶斯模型可以提高预测的可靠性。

J Dairy Sci. 2019 Aug;102(8):7237-7247. doi: 10.3168/jds.2018-15815. Epub 2019 May 31.

The Usage of an SNP-SNP Relationship Matrix for Best Linear Unbiased Prediction (BLUP) Analysis Using a Community-Based Cohort Study.基于社区队列研究的单核苷酸多态性-单核苷酸多态性关系矩阵在最佳线性无偏预测（BLUP）分析中的应用

Genomics Inform. 2014 Dec;12(4):254-60. doi: 10.5808/GI.2014.12.4.254. Epub 2014 Dec 31.

A robust Bayesian genome-based median regression model.一种稳健的基于基因组的贝叶斯中位数回归模型。

Theor Appl Genet. 2019 May;132(5):1587-1606. doi: 10.1007/s00122-019-03303-6. Epub 2019 Feb 12.

Application of robust procedures for estimation of breeding values in multiple-trait random regression test-day model.稳健程序在多性状随机回归测定日模型中估计育种值的应用。

J Anim Breed Genet. 2007 Feb;124(1):3-11. doi: 10.1111/j.1439-0388.2007.00633.x.

引用本文的文献

Local and Bayesian Survival FDR Estimations to Identify Reliable Associations in Whole Genome of Bread Wheat.基于局部和贝叶斯生存 FDR 估计的全基因组小麦可靠关联分析。

Int J Mol Sci. 2023 Sep 12;24(18):14011. doi: 10.3390/ijms241814011.

Inference about quantitative traits under selection: a Bayesian revisitation for the post-genomic era.选择下数量性状的推断：后基因组时代的贝叶斯再探讨。

Genet Sel Evol. 2022 Dec 2;54(1):78. doi: 10.1186/s12711-022-00765-z.

Heterosis and Hybrid Crop Breeding: A Multidisciplinary Review.杂种优势与杂交作物育种：多学科综述

Front Genet. 2021 Feb 24;12:643761. doi: 10.3389/fgene.2021.643761. eCollection 2021.

A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法

Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.

Bayesian, Likelihood-Free Modelling of Phenotypic Plasticity and Variability in Individuals and Populations.个体和群体中表型可塑性与变异性的贝叶斯无似然建模

Front Genet. 2019 Sep 20;10:727. doi: 10.3389/fgene.2019.00727. eCollection 2019.

A robust Bayesian genome-based median regression model.一种稳健的基于基因组的贝叶斯中位数回归模型。

Theor Appl Genet. 2019 May;132(5):1587-1606. doi: 10.1007/s00122-019-03303-6. Epub 2019 Feb 12.

Phenotypic Selection in Ornamental Breeding: It's Better to Have the BLUPs Than to Have the BLUEs.观赏植物育种中的表型选择：拥有最佳线性无偏预测值（BLUPs）比最佳线性无偏估计值（BLUEs）更好。

Front Plant Sci. 2018 Nov 5;9:1511. doi: 10.3389/fpls.2018.01511. eCollection 2018.

本文引用的文献

It is unlikely that genomic selection will ever be 100% accurate.基因组选择不太可能达到100%的准确性。

J Anim Breed Genet. 2017 Dec;134(6):438-440. doi: 10.1111/jbg.12307.

A 100-Year Review: Methods and impact of genetic selection in dairy cattle-From daughter-dam comparisons to deep learning algorithms.一个世纪的回顾：奶牛遗传选择的方法和影响——从女儿-母畜比较到深度学习算法。

J Dairy Sci. 2017 Dec;100(12):10234-10250. doi: 10.3168/jds.2017-12954.

Reevaluation of SNP heritability in complex human traits.复杂人类性状中SNP遗传力的重新评估。

Nat Genet. 2017 Jul;49(7):986-992. doi: 10.1038/ng.3865. Epub 2017 May 22.

Genomic variance estimates: With or without disequilibrium covariances?基因组方差估计：是否考虑不平衡协方差？

J Anim Breed Genet. 2017 Jun;134(3):232-241. doi: 10.1111/jbg.12268.

Predicted Residual Error Sum of Squares of Mixed Models: An Application for Genomic Prediction.混合模型的预测残差平方和：在基因组预测中的应用

G3 (Bethesda). 2017 Mar 10;7(3):895-909. doi: 10.1534/g3.116.038059.

Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes.基于单核苷酸多态性基因型的实现亲缘关系的有效估计

Genetics. 2017 Mar;205(3):1063-1078. doi: 10.1534/genetics.116.197004. Epub 2017 Jan 18.

Genome-wide association study for cheese yield and curd nutrient recovery in dairy cows.奶牛产奶量和凝乳营养成分回收率的全基因组关联研究。

J Dairy Sci. 2017 Feb;100(2):1259-1271. doi: 10.3168/jds.2016-11586. Epub 2016 Nov 23.

Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis.基于基因组关系矩阵的全基因组关联研究：小麦和拟南芥的案例分析

G3 (Bethesda). 2016 Oct 13;6(10):3241-3256. doi: 10.1534/g3.116.034256.

Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction.在基因组预测中不进行交叉验证的交叉验证

G3 (Bethesda). 2016 Oct 13;6(10):3107-3128. doi: 10.1534/g3.116.033381.

Simple Penalties on Maximum-Likelihood Estimates of Genetic Parameters to Reduce Sampling Variation.对遗传参数最大似然估计值的简单惩罚以减少抽样变异

Genetics. 2016 Aug;203(4):1885-900. doi: 10.1534/genetics.115.186114. Epub 2016 Jun 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

复杂性状的预测：最佳线性无偏预测的稳健替代方法

Prediction of Complex Traits: Robust Alternatives to Best Linear Unbiased Prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献