• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组序列数据在跨品种基因组预测中的应用。

Utility of whole-genome sequence data for across-breed genomic prediction.

机构信息

Animal Breeding and Genomics, Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.

Biometris, Wageningen University and Research, 6700 AA, Wageningen, The Netherlands.

出版信息

Genet Sel Evol. 2018 May 18;50(1):27. doi: 10.1186/s12711-018-0396-8.

DOI:10.1186/s12711-018-0396-8
PMID:29776327
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5960108/
Abstract

BACKGROUND

Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction.

METHODS

Estimated breeding values for stature and bovine high-density (HD) genotypes were available for 595 Jersey bulls from New Zealand, 957 Holstein bulls from New Zealand and 5553 Holstein bulls from the Netherlands. BovineHD genotypes for all bulls were imputed to WGS using Beagle4 and Minimac2. Genomic prediction across the three populations was performed with ASReml4, with each population used as single reference and as single validation sets. In addition to the 50k, HD and WGS, markers that were significantly associated with stature in a large meta-GWAS analysis were selected and used for prediction, resulting in 10 prediction scenarios. Furthermore, we estimated the proportion of genetic variance captured by markers in each scenario.

RESULTS

Across breeds, 50k, HD and WGS markers resulted in very low accuracies of prediction ranging from - 0.04 to 0.13. Accuracies were higher in scenarios with pre-selected markers from a meta-GWAS. For example, using only the 133 most significant markers in 133 QTL regions from the meta-GWAS yielded accuracies ranging from 0.08 to 0.23, while 23,125 markers with a - log10(p) higher than 7 resulted in accuracies of up 0.35. Using WGS data did not significantly improve the proportion of genetic variance captured across breeds compared to scenarios with few but pre-selected markers.

CONCLUSIONS

Our results demonstrated that the accuracy of across-breed GP can be improved by using markers that are pre-selected from WGS based on their potential causal effect. We also showed that simply increasing the number of markers up to the WGS level does not increase the accuracy of across-breed prediction, even when markers that are expected to have a causal effect are included.

摘要

背景

跨品种的基因组预测(GP)迄今为止导致预测基因组育种值的准确性较低。我们的目的是评估使用全基因组序列(WGS)代替低密度标记是否可以提高跨品种的 GP,特别是当标记是从全基因组关联研究(GWAS)中预先选择时,并检验我们的假设,即 WGS 数据中的许多非因果标记对跨品种预测的准确性有稀释作用。

方法

为来自新西兰的 595 头泽西公牛、来自新西兰的 957 头荷斯坦公牛和来自荷兰的 5553 头荷斯坦公牛提供了体型和牛高密度(HD)基因型的估计育种值。使用 Beagle4 和 Minimac2 将所有公牛的 BovineHD 基因型导入 WGS。使用 ASReml4 在三个群体中进行基因组预测,每个群体均用作单一参考和单一验证集。除了 50k、HD 和 WGS 外,还从大型元 GWAS 分析中选择与体型显著相关的标记并用于预测,共产生了 10 种预测方案。此外,我们还估计了每种方案中标记捕获的遗传方差比例。

结果

跨品种,50k、HD 和 WGS 标记的预测准确性非常低,范围从-0.04 到 0.13。在来自元 GWAS 的预先选择标记的方案中,准确性更高。例如,仅使用元 GWAS 中 133 个 QTL 区域的 133 个最显著标记,可产生 0.08 到 0.23 的准确性,而 -log10(p) 值高于 7 的 23,125 个标记则可产生高达 0.35 的准确性。与使用少量但预先选择的标记的方案相比,使用 WGS 数据并不能显著提高跨品种遗传方差的捕获比例。

结论

我们的结果表明,通过使用基于潜在因果效应从 WGS 中预先选择的标记,可以提高跨品种 GP 的准确性。我们还表明,仅增加标记数量到 WGS 水平并不会增加跨品种预测的准确性,即使包含了预期具有因果效应的标记。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb92/5960108/df0a940a87f7/12711_2018_396_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb92/5960108/df0a940a87f7/12711_2018_396_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb92/5960108/df0a940a87f7/12711_2018_396_Fig1_HTML.jpg

相似文献

1
Utility of whole-genome sequence data for across-breed genomic prediction.全基因组序列数据在跨品种基因组预测中的应用。
Genet Sel Evol. 2018 May 18;50(1):27. doi: 10.1186/s12711-018-0396-8.
2
Efficiency of multi-breed genomic selection for dairy cattle breeds with different sizes of reference population.不同参考群体规模的奶牛品种多品种基因组选择效率
J Dairy Sci. 2014;97(6):3918-29. doi: 10.3168/jds.2013-7761. Epub 2014 Apr 3.
3
Use of meta-analyses and joint analyses to select variants in whole genome sequences for genomic evaluation: An application in milk production of French dairy cattle breeds.使用荟萃分析和联合分析选择全基因组序列中的变异进行基因组评估:在法国奶牛品种的牛奶生产中的应用。
J Dairy Sci. 2018 Apr;101(4):3126-3139. doi: 10.3168/jds.2017-13587. Epub 2018 Feb 7.
4
Using imputed whole-genome sequence data to improve the accuracy of genomic prediction for parasite resistance in Australian sheep.利用估算的全基因组序列数据提高澳大利亚绵羊寄生虫抗性的基因组预测准确性。
Genet Sel Evol. 2019 Jun 26;51(1):32. doi: 10.1186/s12711-019-0476-4.
5
Genomic prediction for numerically small breeds, using models with pre-selected and differentially weighted markers.基于预选择和差异化加权标记模型的数值较小品种的基因组预测。
Genet Sel Evol. 2018 Oct 10;50(1):49. doi: 10.1186/s12711-018-0419-5.
6
Empirical and deterministic accuracies of across-population genomic prediction.跨群体基因组预测的经验性和确定性准确性。
Genet Sel Evol. 2015 Feb 6;47(1):5. doi: 10.1186/s12711-014-0086-0.
7
Use of a Bayesian model including QTL markers increases prediction reliability when test animals are distant from the reference population.当测验动物与参考群体相距较远时,使用包含 QTL 标记的贝叶斯模型可以提高预测的可靠性。
J Dairy Sci. 2019 Aug;102(8):7237-7247. doi: 10.3168/jds.2018-15815. Epub 2019 May 31.
8
Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations.基于澳大利亚绵羊群体中推断的全基因组序列数据中选择的变体的基因组预测。
Genet Sel Evol. 2019 Dec 5;51(1):72. doi: 10.1186/s12711-019-0514-2.
9
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.利用推算的全基因组序列数据对荷斯坦奶牛进行基因组预测。
Genet Sel Evol. 2015 Sep 17;47(1):71. doi: 10.1186/s12711-015-0149-x.
10
Accuracy of genomic breeding values in multi-breed dairy cattle populations.多品种奶牛群体中基因组育种值的准确性。
Genet Sel Evol. 2009 Nov 24;41(1):51. doi: 10.1186/1297-9686-41-51.

引用本文的文献

1
Improved genomic prediction accuracy by genetic relatedness using a crossbred pig population.利用杂交猪群体的遗传相关性提高基因组预测准确性。
Transl Anim Sci. 2025 Jul 12;9:txaf095. doi: 10.1093/tas/txaf095. eCollection 2025.
2
Variable selection strategies for genomic prediction of growth and carcass related traits in experimental Nellore cattle herds under different selection criteria.不同选择标准下实验内洛尔牛群生长和胴体相关性状基因组预测的变量选择策略
Sci Rep. 2025 Jul 1;15(1):22266. doi: 10.1038/s41598-025-06949-z.
3
Integrating multi-omics and machine learning for disease resistance prediction in legumes.

本文引用的文献

1
Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals.牛体尺全基因组关联研究的荟萃分析鉴定出调控哺乳动物体型的常见基因。
Nat Genet. 2018 Mar;50(3):362-367. doi: 10.1038/s41588-018-0056-5. Epub 2018 Feb 19.
2
Genomic prediction using preselected DNA variants from a GWAS with whole-genome sequence data in Holstein-Friesian cattle.在荷斯坦-弗里生奶牛中,利用全基因组序列数据,从全基因组关联研究(GWAS)中预先选择的DNA变异进行基因组预测。
Genet Sel Evol. 2016 Dec 1;48(1):95. doi: 10.1186/s12711-016-0274-1.
3
Efficient genomic prediction based on whole-genome sequence data using split-and-merge Bayesian variable selection.
整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
4
Role of multi-omics in aquaculture genetics and breeding: current status and future perspective.多组学在水产养殖遗传与育种中的作用:现状与未来展望
Sci China Life Sci. 2025 May 28. doi: 10.1007/s11427-024-2828-8.
5
Genome-wide association studies of milk composition traits in indicine Badri cattle using ddRAD sequencing approach.利用ddRAD测序方法对印度瘤牛Badri牛的乳成分性状进行全基因组关联研究。
Trop Anim Health Prod. 2024 Dec 24;57(1):10. doi: 10.1007/s11250-024-04266-7.
6
Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets.用于基因组预测的单倍型模块:在多个作物数据集中的比较评估
Front Plant Sci. 2023 Sep 5;14:1217589. doi: 10.3389/fpls.2023.1217589. eCollection 2023.
7
Variance component estimates for growth traits in beef cattle using selected variants from imputed low-pass sequence data.利用低深度测序数据中已选变异体估计肉牛生长性状的方差组分。
J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad274.
8
Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs.利用猪全基因组序列数据中的预选标记进行多线ssGBLUP评估。
Front Genet. 2023 May 12;14:1163626. doi: 10.3389/fgene.2023.1163626. eCollection 2023.
9
Genomic evaluation with multibreed and crossbred data.利用多品种和杂交数据进行基因组评估。
JDS Commun. 2022 Jan 10;3(2):156-159. doi: 10.3168/jdsc.2021-0177. eCollection 2022 Mar.
10
A divide-and-conquer approach for genomic prediction in rubber tree using machine learning.基于机器学习的橡胶树基因组预测的分而治之方法。
Sci Rep. 2022 Oct 26;12(1):18023. doi: 10.1038/s41598-022-20416-z.
基于全基因组序列数据,使用拆分合并贝叶斯变量选择的高效基因组预测。
Genet Sel Evol. 2016 Jun 29;48(1):49. doi: 10.1186/s12711-016-0225-x.
4
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study.利用与致病突变处于连锁不平衡状态的序列变异来改进奶牛的跨品种预测:一项模拟研究。
G3 (Bethesda). 2016 Aug 9;6(8):2553-61. doi: 10.1534/g3.116.027730.
5
Validation of simultaneous deregression of cow and bull breeding values and derivation of appropriate weights.奶牛和公牛育种值同时去回归的验证及适当权重的推导
J Dairy Sci. 2016 Aug;99(8):6403-6419. doi: 10.3168/jds.2016-11028. Epub 2016 May 18.
6
An Equation to Predict the Accuracy of Genomic Values by Combining Data from Multiple Traits, Populations, or Environments.一种通过整合多个性状、群体或环境的数据来预测基因组值准确性的方程。
Genetics. 2016 Feb;202(2):799-823. doi: 10.1534/genetics.115.183269. Epub 2015 Dec 4.
7
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.利用推算的全基因组序列数据对荷斯坦奶牛进行基因组预测。
Genet Sel Evol. 2015 Sep 17;47(1):71. doi: 10.1186/s12711-015-0149-x.
8
Using selection index theory to estimate consistency of multi-locus linkage disequilibrium across populations.利用选择指数理论估计多基因座连锁不平衡在不同群体间的一致性。
BMC Genet. 2015 Jul 19;16:87. doi: 10.1186/s12863-015-0252-6.
9
Genomic prediction based on data from three layer lines: a comparison between linear methods.基于三层品系数据的基因组预测:线性方法之间的比较
Genet Sel Evol. 2014 Oct 1;46(1):57. doi: 10.1186/s12711-014-0057-5.
10
Dominance genetic variation contributes little to the missing heritability for human complex traits.显性遗传变异对人类复杂性状的“缺失遗传力”贡献不大。
Am J Hum Genet. 2015 Mar 5;96(3):377-85. doi: 10.1016/j.ajhg.2015.01.001. Epub 2015 Feb 12.