• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于贝叶斯基因组的预测模型中对先验设定的敏感性。

Sensitivity to prior specification in Bayesian genome-based prediction models.

作者信息

Lehermeier Christina, Wimmer Valentin, Albrecht Theresa, Auinger Hans-Jürgen, Gianola Daniel, Schmid Volker J, Schön Chris-Carolin

机构信息

Plant Breeding, Technische Universität München, Emil-Ramann-Straße 4, 85354 Freising, Germany.

出版信息

Stat Appl Genet Mol Biol. 2013 Jun;12(3):375-91. doi: 10.1515/sagmb-2012-0042.

DOI:10.1515/sagmb-2012-0042
PMID:23629460
Abstract

Different statistical models have been proposed for maximizing prediction accuracy in genome-based prediction of breeding values in plant and animal breeding. However, little is known about the sensitivity of these models with respect to prior and hyperparameter specification, because comparisons of prediction performance are mainly based on a single set of hyperparameters. In this study, we focused on Bayesian prediction methods using a standard linear regression model with marker covariates coding additive effects at a large number of marker loci. By comparing different hyperparameter settings, we investigated the sensitivity of four methods frequently used in genome-based prediction (Bayesian Ridge, Bayesian Lasso, BayesA and BayesB) to specification of the prior distribution of marker effects. We used datasets simulated according to a typical maize breeding program differing in the number of markers and the number of simulated quantitative trait loci affecting the trait. Furthermore, we used an experimental maize dataset, comprising 698 doubled haploid lines, each genotyped with 56110 single nucleotide polymorphism markers and phenotyped as testcrosses for the two quantitative traits grain dry matter yield and grain dry matter content. The predictive ability of the different models was assessed by five-fold cross-validation. The extent of Bayesian learning was quantified by calculation of the Hellinger distance between the prior and posterior densities of marker effects. Our results indicate that similar predictive abilities can be achieved with all methods, but with BayesA and BayesB hyperparameter settings had a stronger effect on prediction performance than with the other two methods. Prediction performance of BayesA and BayesB suffered substantially from a non-optimal choice of hyperparameters.

摘要

在植物和动物育种中基于基因组的育种值预测方面,已经提出了不同的统计模型来最大化预测准确性。然而,对于这些模型相对于先验和超参数设定的敏感性知之甚少,因为预测性能的比较主要基于单一的超参数集。在本研究中,我们聚焦于使用标准线性回归模型的贝叶斯预测方法,该模型在大量标记位点对加性效应进行标记协变量编码。通过比较不同的超参数设置,我们研究了基于基因组预测中常用的四种方法(贝叶斯岭回归、贝叶斯套索回归、贝叶斯A和贝叶斯B)对标记效应先验分布设定的敏感性。我们使用了根据典型玉米育种计划模拟的数据集,这些数据集在标记数量和影响性状的模拟数量性状位点数量上有所不同。此外,我们使用了一个实验玉米数据集,该数据集包含698个双单倍体系,每个系用56110个单核苷酸多态性标记进行基因分型,并作为两个数量性状(籽粒干物质产量和籽粒干物质含量)的测交进行表型分析。通过五重交叉验证评估不同模型的预测能力。通过计算标记效应的先验密度和后验密度之间的海林格距离来量化贝叶斯学习的程度。我们的结果表明,所有方法都能实现相似的预测能力,但贝叶斯A和贝叶斯B的超参数设置对预测性能的影响比其他两种方法更强。贝叶斯A和贝叶斯B的预测性能因超参数选择不当而大幅受损。

相似文献

1
Sensitivity to prior specification in Bayesian genome-based prediction models.基于贝叶斯基因组的预测模型中对先验设定的敏感性。
Stat Appl Genet Mol Biol. 2013 Jun;12(3):375-91. doi: 10.1515/sagmb-2012-0042.
2
Improving the computational efficiency of fully Bayes inference and assessing the effect of misspecification of hyperparameters in whole-genome prediction models.提高全贝叶斯推断的计算效率并评估全基因组预测模型中超参数误设的影响。
Genet Sel Evol. 2015 Mar 7;47(1):13. doi: 10.1186/s12711-015-0092-x.
3
Comparison of Genomic Selection Models to Predict Flowering Time and Spike Grain Number in Two Hexaploid Wheat Doubled Haploid Populations.预测两个六倍体小麦双单倍体群体开花时间和穗粒数的基因组选择模型比较
G3 (Bethesda). 2015 Jul 22;5(10):1991-8. doi: 10.1534/g3.115.019745.
4
Using Bayesian Multilevel Whole Genome Regression Models for Partial Pooling of Training Sets in Genomic Prediction.使用贝叶斯多级全基因组回归模型对基因组预测中的训练集进行部分合并
G3 (Bethesda). 2015 May 29;5(8):1603-12. doi: 10.1534/g3.115.019299.
5
Improving resistance to the European corn borer: a comprehensive study in elite maize using QTL mapping and genome-wide prediction.提高玉米抗欧洲玉米螟能力:利用 QTL 作图和全基因组预测对优秀玉米进行的综合研究。
Theor Appl Genet. 2015 May;128(5):875-91. doi: 10.1007/s00122-015-2477-1. Epub 2015 Mar 11.
6
Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: a simulation experiment.最大化植物育种中基因组选择预测准确性和遗传增益的资源分配:模拟实验。
G3 (Bethesda). 2013 Mar;3(3):481-91. doi: 10.1534/g3.112.004911. Epub 2013 Mar 1.
7
Estimation of physiological genomic estimated breeding values (PGEBV) combining full hyperspectral and marker data across environments for grain yield under combined heat and drought stress in tropical maize (Zea mays L.).在热带玉米(Zea mays L.)中,结合全高光谱和标记数据,估算在热旱胁迫下的籽粒产量的生理基因组估计育种值(PGEBV)。
PLoS One. 2019 Mar 20;14(3):e0212200. doi: 10.1371/journal.pone.0212200. eCollection 2019.
8
Genome-wide prediction of maize single-cross performance, considering non-additive genetic effects.考虑非加性遗传效应的玉米单交种表现的全基因组预测
Genet Mol Res. 2015 Dec 28;14(4):18471-84. doi: 10.4238/2015.December.23.35.
9
Genome-based prediction of maize hybrid performance across genetic groups, testers, locations, and years.基于基因组的玉米杂种表现预测,跨越遗传群体、测验种、地点和年份。
Theor Appl Genet. 2014 Jun;127(6):1375-86. doi: 10.1007/s00122-014-2305-z. Epub 2014 Apr 11.
10
Comparison between genomic predictions using daughter yield deviation and conventional estimated breeding value as response variables.基于女儿产量偏差和常规估计育种值作为反应变量的基因组预测比较。
J Anim Breed Genet. 2010 Dec;127(6):423-32. doi: 10.1111/j.1439-0388.2010.00878.x.

引用本文的文献

1
Dissection of the practical soybean breeding pipeline by developing ZDX1, a high-throughput functional array.通过开发高通量功能阵列 ZDX1 对实用大豆育种管道进行剖析。
Theor Appl Genet. 2022 Apr;135(4):1413-1427. doi: 10.1007/s00122-022-04043-w. Epub 2022 Feb 21.
2
Validation of the Prediction Accuracy for 13 Traits in Chinese Simmental Beef Cattle Using a Preselected Low-Density SNP Panel.使用预先选择的低密度单核苷酸多态性(SNP)面板验证中国西门塔尔牛13个性状的预测准确性
Animals (Basel). 2021 Jun 25;11(7):1890. doi: 10.3390/ani11071890.
3
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.
用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
4
A robust Bayesian genome-based median regression model.一种稳健的基于基因组的贝叶斯中位数回归模型。
Theor Appl Genet. 2019 May;132(5):1587-1606. doi: 10.1007/s00122-019-03303-6. Epub 2019 Feb 12.
5
Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction.在基因组预测中不进行交叉验证的交叉验证
G3 (Bethesda). 2016 Oct 13;6(10):3107-3128. doi: 10.1534/g3.116.033381.
6
Walking through the statistical black boxes of plant breeding.穿越植物育种的统计黑箱。
Theor Appl Genet. 2016 Oct;129(10):1933-49. doi: 10.1007/s00122-016-2750-y. Epub 2016 Jul 19.
7
Assessing Predictive Properties of Genome-Wide Selection in Soybeans.评估大豆全基因组选择的预测特性。
G3 (Bethesda). 2016 Aug 9;6(8):2611-6. doi: 10.1534/g3.116.032268.
8
Inferential considerations for low-count RNA-seq transcripts: a case study on the dominant prairie grass Andropogon gerardii.低计数RNA测序转录本的推断考量:以优势草原草种糙毛须芒草为例的研究
BMC Genomics. 2016 Feb 27;17:140. doi: 10.1186/s12864-016-2442-7.
9
Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans.插补方法对大豆单核苷酸多态性面板捕获的遗传变异量的影响。
BMC Bioinformatics. 2016 Feb 2;17:55. doi: 10.1186/s12859-016-0899-7.
10
Improving accuracy of genomic prediction by genetic architecture based priors in a Bayesian model.在贝叶斯模型中通过基于遗传结构的先验信息提高基因组预测的准确性。
BMC Genet. 2015 Oct 14;16:120. doi: 10.1186/s12863-015-0278-9.