基于贝叶斯 G-BLUP 和阈性状的多基因模型的快速基因组预测，包括截尾正态数据。

Fast genomic predictions via Bayesian G-BLUP and multilocus models of threshold traits including censored Gaussian data.

机构信息

Department of Agricultural Sciences, University of Helsinki, Helsinki FIN-00014.

出版信息

G3 (Bethesda). 2013 Sep 4;3(9):1511-23. doi: 10.1534/g3.113.007096.

DOI:10.1534/g3.113.007096

PMID:23821618

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3755911/

Abstract

Because of the increased availability of genome-wide sets of molecular markers along with reduced cost of genotyping large samples of individuals, genomic estimated breeding values have become an essential resource in plant and animal breeding. Bayesian methods for breeding value estimation have proven to be accurate and efficient; however, the ever-increasing data sets are placing heavy demands on the parameter estimation algorithms. Although a commendable number of fast estimation algorithms are available for Bayesian models of continuous Gaussian traits, there is a shortage for corresponding models of discrete or censored phenotypes. In this work, we consider a threshold approach of binary, ordinal, and censored Gaussian observations for Bayesian multilocus association models and Bayesian genomic best linear unbiased prediction and present a high-speed generalized expectation maximization algorithm for parameter estimation under these models. We demonstrate our method with simulated and real data. Our example analyses suggest that the use of the extra information present in an ordered categorical or censored Gaussian data set, instead of dichotomizing the data into case-control observations, increases the accuracy of genomic breeding values predicted by Bayesian multilocus association models or by Bayesian genomic best linear unbiased prediction. Furthermore, the example analyses indicate that the correct threshold model is more accurate than the directly used Gaussian model with a censored Gaussian data, while with a binary or an ordinal data the superiority of the threshold model could not be confirmed.

摘要

由于全基因组分子标记数据集的可用性增加，以及对大量个体进行基因分型的成本降低，基因组估计育种值已成为植物和动物育种的重要资源。用于育种值估计的贝叶斯方法已被证明是准确和有效的；然而，不断增加的数据量对参数估计算法提出了很高的要求。尽管对于连续高斯性状的贝叶斯模型有相当数量的快速估计算法，但对于离散或截尾表型的相应模型却很少。在这项工作中，我们考虑了二进制、有序和截尾高斯观测的阈值方法，用于贝叶斯多基因座关联模型和贝叶斯基因组最佳线性无偏预测，并提出了一种用于这些模型下参数估计的高速广义期望最大化算法。我们用模拟和真实数据来演示我们的方法。我们的实例分析表明，使用有序分类或截尾高斯数据集的额外信息，而不是将数据二分为病例对照观测值，可以提高贝叶斯多基因座关联模型或贝叶斯基因组最佳线性无偏预测所预测的基因组育种值的准确性。此外，实例分析表明，对于截尾高斯数据，正确的阈值模型比直接使用高斯模型更准确，而对于二进制或有序数据，阈值模型的优越性则无法得到确认。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/629f/3755911/05c4eaef3db1/1511f1.jpg

相似文献

Fast genomic predictions via Bayesian G-BLUP and multilocus models of threshold traits including censored Gaussian data.

G3 (Bethesda). 2013 Sep 4;3(9):1511-23. doi: 10.1534/g3.113.007096.

Maximum Threshold Genomic Prediction Model for Ordinal Traits.

G3 (Bethesda). 2020 Nov 5;10(11):4083-4102. doi: 10.1534/g3.120.401733.

Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.

J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.

[Bayesian methods for genomic breeding value estimation].

Yi Chuan. 2014 Feb;36(2):111-8. doi: 10.3724/sp.j.1005.2014.00111.

Using visual scores for genomic prediction of complex traits in breeding programs.

Theor Appl Genet. 2023 Dec 15;137(1):9. doi: 10.1007/s00122-023-04512-w.

Back to basics for Bayesian model building in genomic selection.

Genetics. 2012 Jul;191(3):969-87. doi: 10.1534/genetics.112.139014. Epub 2012 May 2.

Genomic BLUP decoded: a look into the black box of genomic prediction.

Genetics. 2013 Jul;194(3):597-607. doi: 10.1534/genetics.113.152207. Epub 2013 May 2.

Genome-based prediction of Bayesian linear and non-linear regression models for ordinal data.

Plant Genome. 2020 Jul;13(2):e20021. doi: 10.1002/tpg2.20021. Epub 2020 May 14.

A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.

Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.

G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.

引用本文的文献

Maximum Threshold Genomic Prediction Model for Ordinal Traits.

G3 (Bethesda). 2020 Nov 5;10(11):4083-4102. doi: 10.1534/g3.120.401733.

Estimation of dynamic SNP-heritability with Bayesian Gaussian process models.

Bioinformatics. 2020 Jun 1;36(12):3795-3802. doi: 10.1093/bioinformatics/btaa199.

Two novel genomic regions associated with fearfulness in dogs overlap human neuropsychiatric loci.

Transl Psychiatry. 2019 Jan 17;9(1):18. doi: 10.1038/s41398-018-0361-x.

A fast algorithm for Bayesian multi-locus model in genome-wide association studies.

Mol Genet Genomics. 2017 Aug;292(4):923-934. doi: 10.1007/s00438-017-1322-4. Epub 2017 May 22.

Simultaneous discovery, estimation and prediction analysis of complex traits using a bayesian mixture model.

PLoS Genet. 2015 Apr 7;11(4):e1004969. doi: 10.1371/journal.pgen.1004969. eCollection 2015 Apr.

本文引用的文献

Genomic selection using indicator traits to reduce the environmental impact of milk production.

J Dairy Sci. 2013 Aug;96(8):5306-14. doi: 10.3168/jds.2012-6041. Epub 2013 May 30.

A fast EM algorithm for BayesA-like prediction of genomic breeding values.

PLoS One. 2012;7(11):e49157. doi: 10.1371/journal.pone.0049157. Epub 2012 Nov 9.

Bayesian methods for estimating GEBVs of threshold traits.

Heredity (Edinb). 2013 Mar;110(3):213-9. doi: 10.1038/hdy.2012.65. Epub 2012 Oct 31.

Robustness of Bayesian multilocus association models to cryptic relatedness.

Ann Hum Genet. 2012 Nov;76(6):510-23. doi: 10.1111/j.1469-1809.2012.00729.x. Epub 2012 Sep 12.

Linear versus nonlinear methods of sire evaluation for categorical traits: a simulation study.

Genet Sel Evol (1983). 1985;17(1):115-32. doi: 10.1186/1297-9686-17-1-115.

Will genomic selection be a practical method for plant breeding?

Ann Bot. 2012 Nov;110(6):1303-16. doi: 10.1093/aob/mcs109. Epub 2012 May 29.

Back to basics for Bayesian model building in genomic selection.

Genetics. 2012 Jul;191(3):969-87. doi: 10.1534/genetics.112.139014. Epub 2012 May 2.

A common dataset for genomic analysis of livestock populations.

G3 (Bethesda). 2012 Apr;2(4):429-35. doi: 10.1534/g3.111.001453. Epub 2012 Apr 1.

Accuracy of genomic selection methods in a standard data set of loblolly pine (Pinus taeda L.).

Genetics. 2012 Apr;190(4):1503-10. doi: 10.1534/genetics.111.137026. Epub 2012 Jan 23.

Extension of the bayesian alphabet for genomic selection.

BMC Bioinformatics. 2011 May 23;12:186. doi: 10.1186/1471-2105-12-186.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于贝叶斯 G-BLUP 和阈性状的多基因模型的快速基因组预测，包括截尾正态数据。

Fast genomic predictions via Bayesian G-BLUP and multilocus models of threshold traits including censored Gaussian data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献