• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对相邻 SNP 组的异质(共)方差进行建模可提高牛奶蛋白成分性状的基因组预测能力。

Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.

机构信息

Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Allé 20, P.O. Box 50, 8830, Tjele, Denmark.

Animal Breeding and Genomics Centre, Wageningen University, PO Box 338, 6700 AH, Wageningen, The Netherlands.

出版信息

Genet Sel Evol. 2017 Dec 5;49(1):89. doi: 10.1186/s12711-017-0364-8.

DOI:10.1186/s12711-017-0364-8
PMID:29207947
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5718071/
Abstract

BACKGROUND

Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula.

RESULTS

BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions.

CONCLUSIONS

Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.

摘要

背景

准确的基因组预测需要一个大型的参考群体,但对于那些测量成本较高的性状来说,这是一个问题。与牛奶蛋白组成相关的性状由于成本较高的程序而没有被常规记录,并且被认为是由少数几个具有较大效应的数量性状位点控制的。在不同区域,解释的变异量可能会有所不同,从而导致基因组中(协)方差的异质性模式。能够有效地考虑到(协)方差这种异质性的基因组预测模型可以提高预测的可靠性。在这项研究中,我们开发并实施了新的单变量和双变量贝叶斯预测模型,这些模型是基于对基因组片段(BayesAS)异质(协)方差的估计。可用数据包括对奶牛进行的牛奶蛋白组成性状的测量值和为公牛推导的去回归总蛋白产量的证明。单核苷酸多态性(SNP)来自 50K SNP 数组,被分组为非重叠的基因组片段。一个片段定义为一个 SNP,或者一组 50、100 或 200 个相邻的 SNP,或者一个染色体,或者整个基因组。还运行了传统的单变量和双变量基因组最佳线性无偏预测(GBLUP)模型进行比较。可靠性是通过重采样策略和使用确定性公式计算的。

结果

与 GBLUP 模型相比,BayesAS 模型提高了大多数性状的预测可靠性,而且这种增益取决于片段大小和性状的遗传结构。对于β-CN、κ-CN 和β-LG 等蛋白质组成性状,增益尤其显著,与使用 100-SNP 片段大小的 MT-BayesAS 模型相比,预测可靠性平均提高了 49 个百分点,而与双变量 GBLUP 相比。使用 100-SNP 片段大小的 BayesAS 模型可获得最高的预测可靠性。与单变量版本相比,我们的 BayesAS 模型的双变量版本在预测可靠性方面额外提高了高达 6%。

结论

使用我们的新 BayesAS 模型,与牛奶蛋白组成相关的大多数性状的预测可靠性都有了显著提高。将相邻的 SNP 分组到片段中可以提供增强的信息来估计参数,并且允许片段具有不同的(协)方差有助于解开基因组中的异质(协)方差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/7e529c137e17/12711_2017_364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/62bf3bbd9854/12711_2017_364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/b8e259955bc4/12711_2017_364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/3191f86c5af7/12711_2017_364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/7e529c137e17/12711_2017_364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/62bf3bbd9854/12711_2017_364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/b8e259955bc4/12711_2017_364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/3191f86c5af7/12711_2017_364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4880/5718071/7e529c137e17/12711_2017_364_Fig4_HTML.jpg

相似文献

1
Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.对相邻 SNP 组的异质(共)方差进行建模可提高牛奶蛋白成分性状的基因组预测能力。
Genet Sel Evol. 2017 Dec 5;49(1):89. doi: 10.1186/s12711-017-0364-8.
2
Genomic Prediction Using Multi-trait Weighted GBLUP Accounting for Heterogeneous Variances and Covariances Across the Genome.使用多性状加权基因组最佳线性无偏预测法进行基因组预测,该方法考虑了全基因组的异质方差和协方差。
G3 (Bethesda). 2018 Nov 6;8(11):3549-3558. doi: 10.1534/g3.118.200673.
3
Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.使用基于不同加权因子构建的基因组关系矩阵来考虑位点特异性方差的基因组预测比较。
J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.
4
Genomic prediction of breeding values using previously estimated SNP variances.利用先前估计的单核苷酸多态性(SNP)方差进行育种值的基因组预测。
Genet Sel Evol. 2014 Sep 25;46(1):52. doi: 10.1186/s12711-014-0052-x.
5
Value of sharing cow reference population between countries on reliability of genomic prediction for milk yield traits.在牛奶产量性状的基因组预测可靠性方面,国家间奶牛参考群体共享的价值。
J Dairy Sci. 2020 Feb;103(2):1711-1728. doi: 10.3168/jds.2019-17170. Epub 2019 Dec 19.
6
Reliability of genomic prediction for milk fatty acid composition by using a multi-population reference and incorporating GWAS results.利用多群体参考和整合 GWAS 结果进行牛奶脂肪酸成分的基因组预测的可靠性。
Genet Sel Evol. 2019 Apr 27;51(1):16. doi: 10.1186/s12711-019-0460-z.
7
The patterns of genomic variances and covariances across genome for milk production traits between Chinese and Nordic Holstein populations.中国和北欧荷斯坦牛群体间产奶性状基因组变异和协方差在全基因组上的模式。
BMC Genet. 2017 Mar 15;18(1):26. doi: 10.1186/s12863-017-0491-9.
8
Use of a Bayesian model including QTL markers increases prediction reliability when test animals are distant from the reference population.当测验动物与参考群体相距较远时,使用包含 QTL 标记的贝叶斯模型可以提高预测的可靠性。
J Dairy Sci. 2019 Aug;102(8):7237-7247. doi: 10.3168/jds.2018-15815. Epub 2019 May 31.
9
Including overseas performance information in genomic evaluations of Australian dairy cattle.将海外生产性能信息纳入澳大利亚奶牛的基因组评估中。
J Dairy Sci. 2015 May;98(5):3443-59. doi: 10.3168/jds.2014-8785. Epub 2015 Mar 12.
10
Multi-trait single-step genomic prediction accounting for heterogeneous (co)variances over the genome.多性状一步基因组预测,同时考虑全基因组异质(协)方差。
Heredity (Edinb). 2020 Feb;124(2):274-287. doi: 10.1038/s41437-019-0273-4. Epub 2019 Oct 22.

引用本文的文献

1
Multitrait genome-wide association best linear unbiased prediction of genetic values.多性状全基因组关联遗传值的最佳线性无偏预测
Genet Sel Evol. 2025 Mar 21;57(1):15. doi: 10.1186/s12711-025-00964-4.
2
Comparative Study of Single-Trait and Multi-Trait Genomic Prediction Models.单性状与多性状基因组预测模型的比较研究
Animals (Basel). 2024 Oct 14;14(20):2961. doi: 10.3390/ani14202961.
3
A breed-of-origin of alleles model that includes crossbred data improves predictive ability for crossbred animals in a multi-breed population.

本文引用的文献

1
Estimation of genetic parameters and detection of chromosomal regions affecting the major milk proteins and their post translational modifications in Danish Holstein and Danish Jersey cattle.估计丹麦荷斯坦牛和丹麦泽西牛主要乳蛋白及其翻译后修饰的遗传参数,并检测影响这些蛋白的染色体区域。
BMC Genet. 2016 Aug 2;17:114. doi: 10.1186/s12863-016-0421-2.
2
Short communication: Multi-trait estimation of genetic parameters for milk protein composition in the Danish Holstein.简短通讯:丹麦荷斯坦奶牛乳蛋白组成遗传参数的多性状估计
J Dairy Sci. 2016 Apr;99(4):2863-2866. doi: 10.3168/jds.2015-10501. Epub 2016 Jan 21.
3
Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs.
包含杂交数据的等位基因起源模型可提高多品种群体中杂交动物的预测能力。
Genet Sel Evol. 2023 May 15;55(1):34. doi: 10.1186/s12711-023-00806-1.
4
Fitting Genomic Prediction Models with Different Marker Effects among Prefectures to Carcass Traits in Japanese Black Cattle.利用不同地区标记效应拟合基因组预测模型以预测日本黑牛胴体性状
Genes (Basel). 2022 Dec 22;14(1):24. doi: 10.3390/genes14010024.
5
Vitamin B and transcobalamin in bovine milk: Genetic variation and genome-wide association with loci along the genome.牛乳中的维生素B和转钴胺素:基因变异及全基因组范围内与基因组位点的关联
JDS Commun. 2021 Mar 12;2(3):127-131. doi: 10.3168/jdsc.2020-0048. eCollection 2021 May.
6
Genomic prediction using a reference population of multiple pure breeds and admixed individuals.使用多个纯品种和杂交个体的参考群体进行基因组预测。
Genet Sel Evol. 2021 May 31;53(1):46. doi: 10.1186/s12711-021-00637-y.
7
Quantitative LC-MS/MS analysis of high-value milk proteins in Danish Holstein cows.丹麦荷斯坦奶牛高价值乳蛋白的定量液相色谱-串联质谱分析
Heliyon. 2020 Sep 16;6(9):e04620. doi: 10.1016/j.heliyon.2020.e04620. eCollection 2020 Sep.
8
Genome-wide association study on Fourier transform infrared milk spectra for two Danish dairy cattle breeds.基于傅里叶变换红外牛奶光谱的两个丹麦奶牛品种全基因组关联研究。
BMC Genet. 2020 Jan 31;21(1):9. doi: 10.1186/s12863-020-0810-4.
9
Multi-trait single-step genomic prediction accounting for heterogeneous (co)variances over the genome.多性状一步基因组预测,同时考虑全基因组异质(协)方差。
Heredity (Edinb). 2020 Feb;124(2):274-287. doi: 10.1038/s41437-019-0273-4. Epub 2019 Oct 22.
10
Reliability of genomic prediction for milk fatty acid composition by using a multi-population reference and incorporating GWAS results.利用多群体参考和整合 GWAS 结果进行牛奶脂肪酸成分的基因组预测的可靠性。
Genet Sel Evol. 2019 Apr 27;51(1):16. doi: 10.1186/s12711-019-0460-z.
利用包含纯种丹麦杜洛克猪数量性状基因座区域先验信息的基因组特征模型提高预测准确性。
BMC Genet. 2016 Jan 5;17:11. doi: 10.1186/s12863-015-0322-9.
4
Performance of a blockwise approach in variable selection using linkage disequilibrium information.使用连锁不平衡信息进行变量选择时的分块方法性能。
BMC Bioinformatics. 2015 May 8;16:148. doi: 10.1186/s12859-015-0556-6.
5
The occurrence of noncoagulating milk and the association of bovine milk coagulation properties with genetic variants of the caseins in 3 Scandinavian dairy breeds.在 3 个斯堪的纳维亚奶牛品种中,不凝固牛奶的发生与乳清蛋白遗传变异的关系及牛奶奶凝特性。
J Dairy Sci. 2013 Aug;96(8):4830-42. doi: 10.3168/jds.2012-6422. Epub 2013 Jun 5.
6
Genome position specific priors for genomic prediction.基因组位置特异性先验信息在基因组预测中的应用。
BMC Genomics. 2012 Oct 10;13:543. doi: 10.1186/1471-2164-13-543.
7
Distinct composition of bovine milk from Jersey and Holstein-Friesian cows with good, poor, or noncoagulation properties as reflected in protein genetic variants and isoforms.具有良好、较差或非凝结特性的泽西牛和荷斯坦弗里生牛乳的蛋白质遗传变异体和同工型的组成明显不同。
J Dairy Sci. 2012 Dec;95(12):6905-17. doi: 10.3168/jds.2012-5675. Epub 2012 Oct 3.
8
Estimating additive and non-additive genetic variances and predicting genetic merits using genome-wide dense single nucleotide polymorphism markers.利用全基因组高密度单核苷酸多态性标记估计加性和非加性遗传方差及预测遗传优势。
PLoS One. 2012;7(9):e45293. doi: 10.1371/journal.pone.0045293. Epub 2012 Sep 13.
9
A two step Bayesian approach for genomic prediction of breeding values.一种用于育种值基因组预测的两步贝叶斯方法。
BMC Proc. 2012 May 21;6 Suppl 2(Suppl 2):S12. doi: 10.1186/1753-6561-6-S2-S12.
10
Accuracy of multi-trait genomic selection using different methods.基于不同方法的多性状基因组选择的准确性。
Genet Sel Evol. 2011 Jul 5;43(1):26. doi: 10.1186/1297-9686-43-26.