• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BWGS:一个基因组选择的 R 包及其在小麦育种计划中的应用。

BWGS: A R package for genomic selection and its application to a wheat breeding programme.

机构信息

INRAE-UCA, UMR GDEC, Clermont-Ferrand, France.

Agri-Obtentions, Ferme de Gauvilliers, Orsonville, France.

出版信息

PLoS One. 2020 Apr 2;15(4):e0222733. doi: 10.1371/journal.pone.0222733. eCollection 2020.

DOI:10.1371/journal.pone.0222733
PMID:32240182
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7141418/
Abstract

We developed an integrated R library called BWGS to enable easy computation of Genomic Estimates of Breeding values (GEBV) for genomic selection. BWGS, for BreedWheat Genomic selection, was developed in the framework of a cooperative private-public partnership project called Breedwheat (https://breedwheat.fr) and relies on existing R-libraries, all freely available from CRAN servers. The two main functions enable to run 1) replicated random cross validations within a training set of genotyped and phenotyped lines and 2) GEBV prediction, for a set of genotyped-only lines. Options are available for 1) missing data imputation, 2) markers and training set selection and 3) genomic prediction with 15 different methods, either parametric or semi-parametric. The usefulness and efficiency of BWGS are illustrated using a population of wheat lines from a real breeding programme. Adjusted yield data from historical trials (highly unbalanced design) were used for testing the options of BWGS. On the whole, 760 candidate lines with adjusted phenotypes and genotypes for 47 839 robust SNP were used. With a simple desktop computer, we obtained results which compared with previously published results on wheat genomic selection. As predicted by the theory, factors that are most influencing predictive ability, for a given trait of moderate heritability, are the size of the training population and a minimum number of markers for capturing every QTL information. Missing data up to 40%, if randomly distributed, do not degrade predictive ability once imputed, and up to 80% randomly distributed missing data are still acceptable once imputed with Expectation-Maximization method of package rrBLUP. It is worth noticing that selecting markers that are most associated to the trait do improve predictive ability, compared with the whole set of markers, but only when marker selection is made on the whole population. When marker selection is made only on the sampled training set, this advantage nearly disappeared, since it was clearly due to overfitting. Few differences are observed between the 15 prediction models with this dataset. Although non-parametric methods that are supposed to capture non-additive effects have slightly better predictive accuracy, differences remain small. Finally, the GEBV from the 15 prediction models are all highly correlated to each other. These results are encouraging for an efficient use of genomic selection in applied breeding programmes and BWGS is a simple and powerful toolbox to apply in breeding programmes or training activities.

摘要

我们开发了一个名为 BWGS 的集成 R 库,用于方便计算基因组选择的育种值(GEBV)的基因组估计。BWGS 是为 BreedWheat 基因组选择开发的,它是一个名为 Breedwheat(https://breedwheat.fr)的合作公私伙伴关系项目的框架内开发的,并且依赖于现有的 R 库,这些库都可以从 CRAN 服务器免费获得。两个主要功能允许 1)在一组已分型和表型的系内进行复制随机交叉验证,2)对一组仅分型的系进行 GEBV 预测。选项可用于 1)缺失数据插补,2)标记和训练集选择以及 3)使用 15 种不同的方法进行基因组预测,包括参数和半参数方法。使用来自实际育种计划的小麦系群体来说明 BWGS 的有用性和效率。使用历史试验的调整产量数据(高度不平衡设计)来测试 BWGS 的选项。总的来说,使用了 760 条具有调整后表型和基因型的候选系,用于 47839 个稳健 SNP。使用简单的台式计算机,我们获得了与以前发表的小麦基因组选择结果相比较的结果。根据理论预测,对于中等遗传力的给定性状,影响预测能力的最重要因素是训练群体的大小和捕获每个 QTL 信息的最小标记数量。如果缺失数据随机分布且不超过 40%,则在插补后不会降低预测能力,并且在使用 rrBLUP 包的期望最大化方法插补多达 80%的随机分布缺失数据后仍可接受。值得注意的是,与整个标记集相比,选择与性状最相关的标记确实可以提高预测能力,但仅当在整个群体上进行标记选择时。当仅在抽样训练集上进行标记选择时,这种优势几乎消失,因为这显然是由于过度拟合所致。使用此数据集,在 15 种预测模型之间观察到很少的差异。尽管假定捕获非加性效应的非参数方法具有稍高的预测准确性,但差异仍然很小。最后,15 种预测模型的 GEBV 彼此高度相关。这些结果对于在应用育种计划中有效使用基因组选择是令人鼓舞的,并且 BWGS 是在育种计划或培训活动中应用的简单而强大的工具包。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/c987569d9cd6/pone.0222733.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/59c3f98f74ca/pone.0222733.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/163e143fd2c4/pone.0222733.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/4670f8f37501/pone.0222733.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/1496db566cc4/pone.0222733.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/e396c472cce0/pone.0222733.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/03f6d4d4848a/pone.0222733.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/850ecc64b2b7/pone.0222733.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/c987569d9cd6/pone.0222733.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/59c3f98f74ca/pone.0222733.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/163e143fd2c4/pone.0222733.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/4670f8f37501/pone.0222733.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/1496db566cc4/pone.0222733.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/e396c472cce0/pone.0222733.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/03f6d4d4848a/pone.0222733.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/850ecc64b2b7/pone.0222733.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d288/7141418/c987569d9cd6/pone.0222733.g008.jpg

相似文献

1
BWGS: A R package for genomic selection and its application to a wheat breeding programme.BWGS:一个基因组选择的 R 包及其在小麦育种计划中的应用。
PLoS One. 2020 Apr 2;15(4):e0222733. doi: 10.1371/journal.pone.0222733. eCollection 2020.
2
Genomic Selection in Winter Wheat Breeding Using a Recommender Approach.利用推荐方法进行冬小麦育种中的基因组选择。
Genes (Basel). 2020 Jul 11;11(7):779. doi: 10.3390/genes11070779.
3
Genotype Imputation in Winter Wheat Using First-Generation Haplotype Map SNPs Improves Genome-Wide Association Mapping and Genomic Prediction of Traits.利用第一代单倍型图谱 SNPs 对冬小麦进行基因型推断可提高全基因组关联作图和性状的基因组预测。
G3 (Bethesda). 2019 Jan 9;9(1):125-133. doi: 10.1534/g3.118.200664.
4
Canopy Temperature and Vegetation Indices from High-Throughput Phenotyping Improve Accuracy of Pedigree and Genomic Selection for Grain Yield in Wheat.高通量表型分析得出的冠层温度和植被指数提高了小麦籽粒产量系谱选择和基因组选择的准确性。
G3 (Bethesda). 2016 Sep 8;6(9):2799-808. doi: 10.1534/g3.116.032888.
5
Multi-Trait Genomic Prediction of Yield-Related Traits in US Soft Wheat under Variable Water Regimes.在不同水分条件下美国软小麦产量相关性状的多性状基因组预测。
Genes (Basel). 2020 Oct 28;11(11):1270. doi: 10.3390/genes11111270.
6
Potential and limits to unravel the genetic architecture and predict the variation of Fusarium head blight resistance in European winter wheat (Triticum aestivum L.).解析欧洲冬小麦(Triticum aestivum L.)赤霉病抗性遗传结构及预测其变异的潜力与局限
Heredity (Edinb). 2015 Mar;114(3):318-26. doi: 10.1038/hdy.2014.104. Epub 2014 Nov 12.
7
Genomic Prediction for Grain Yield and Yield-Related Traits in Chinese Winter Wheat.中国冬小麦产量及产量相关性状的基因组预测。
Int J Mol Sci. 2020 Feb 17;21(4):1342. doi: 10.3390/ijms21041342.
8
Genomic selection models double the accuracy of predicted breeding values for bacterial cold water disease resistance compared to a traditional pedigree-based model in rainbow trout aquaculture.与虹鳟鱼养殖中基于传统系谱的模型相比,基因组选择模型可将预测的细菌性冷水病抗性育种值的准确性提高一倍。
Genet Sel Evol. 2017 Feb 1;49(1):17. doi: 10.1186/s12711-017-0293-6.
9
Effects of number of training generations on genomic prediction for various traits in a layer chicken population.训练世代数对蛋鸡群体中各种性状基因组预测的影响。
Genet Sel Evol. 2016 Mar 19;48:22. doi: 10.1186/s12711-016-0198-9.
10
Accuracy of genomic selection for a sib-evaluated trait using identity-by-state and identity-by-descent relationships.利用状态一致性和系谱一致性关系对同胞评估性状进行基因组选择的准确性。
Genet Sel Evol. 2015 Feb 25;47(1):9. doi: 10.1186/s12711-014-0084-2.

引用本文的文献

1
Multifactor Analysis of a Genome-Wide Selection System in L.番茄全基因组选择系统的多因素分析
Plants (Basel). 2025 Jul 8;14(14):2095. doi: 10.3390/plants14142095.
2
Genomic selection: Essence, applications, and prospects.基因组选择:本质、应用与前景。
Plant Genome. 2025 Jun;18(2):e70053. doi: 10.1002/tpg2.70053.
3
Genomic and Transcriptomic Profiling of Amino Acid Compositions in Common Carp Fillets.鲤鱼鱼片氨基酸组成的基因组和转录组分析

本文引用的文献

1
A deep convolutional neural network approach for predicting phenotypes from genotypes.一种基于深度卷积神经网络的基因型到表型预测方法。
Planta. 2018 Nov;248(5):1307-1318. doi: 10.1007/s00425-018-2976-9. Epub 2018 Aug 12.
2
Genomic Prediction of Breeding Values Using a Subset of SNPs Identified by Three Machine Learning Methods.使用三种机器学习方法鉴定的单核苷酸多态性(SNP)子集对育种值进行基因组预测。
Front Genet. 2018 Jul 4;9:237. doi: 10.3389/fgene.2018.00237. eCollection 2018.
3
High density marker panels, SNPs prioritizing and accuracy of genomic selection.
Animals (Basel). 2025 May 6;15(9):1335. doi: 10.3390/ani15091335.
4
Using phenomic selection to predict hybrid values with NIR spectra measured on the parental lines: proof of concept on maize.利用表型组选择,通过对亲本系测量的近红外光谱预测杂种值:玉米的概念验证
Theor Appl Genet. 2025 Jan 11;138(1):28. doi: 10.1007/s00122-024-04809-4.
5
Improvement in genomic prediction of maize with prior gene ontology information depends on traits and environmental conditions.利用先前的基因本体信息改进玉米基因组预测取决于性状和环境条件。
Plant Genome. 2025 Mar;18(1):e20553. doi: 10.1002/tpg2.20553.
6
ShinyGS-a graphical toolkit with a serial of genetic and machine learning models for genomic selection: application, benchmarking, and recommendations.ShinyGS——一个带有一系列用于基因组选择的遗传和机器学习模型的图形工具包:应用、基准测试及建议
Front Plant Sci. 2024 Dec 24;15:1480902. doi: 10.3389/fpls.2024.1480902. eCollection 2024.
7
Performance of phenomic selection in rice: Effects of population size and genotype-environment interactions on predictive ability.水稻表型组选择的性能:群体大小和基因型-环境互作对预测能力的影响。
PLoS One. 2024 Dec 23;19(12):e0309502. doi: 10.1371/journal.pone.0309502. eCollection 2024.
8
Machine learning-enhanced multi-trait genomic prediction for optimizing cannabinoid profiles in cannabis.机器学习增强的多性状基因组预测,用于优化大麻中的大麻素谱。
Plant J. 2025 Jan;121(1):e17164. doi: 10.1111/tpj.17164. Epub 2024 Nov 27.
9
Global genotype by environment prediction competition reveals that diverse modeling strategies can deliver satisfactory maize yield estimates.全球基因型与环境互作预测竞赛表明,多种建模策略均可提供令人满意的玉米产量估计。
Genetics. 2025 Feb 5;229(2). doi: 10.1093/genetics/iyae195.
10
Global Genotype by Environment Prediction Competition Reveals That Diverse Modeling Strategies Can Deliver Satisfactory Maize Yield Estimates.全球基因型与环境预测竞赛表明,多种建模策略可提供令人满意的玉米产量估计。
bioRxiv. 2024 Sep 19:2024.09.13.612969. doi: 10.1101/2024.09.13.612969.
高密度标记面板、单核苷酸多态性(SNP)的优先级排序及基因组选择的准确性
BMC Genet. 2018 Jan 5;19(1):4. doi: 10.1186/s12863-017-0595-2.
4
High throughput SNP discovery and genotyping in hexaploid wheat.六倍体小麦中高通量单核苷酸多态性的发现与基因分型
PLoS One. 2018 Jan 2;13(1):e0186329. doi: 10.1371/journal.pone.0186329. eCollection 2018.
5
Improving the baking quality of bread wheat by genomic selection in early generations.通过早期世代的基因组选择提高面包小麦的烘焙品质。
Theor Appl Genet. 2018 Feb;131(2):477-493. doi: 10.1007/s00122-017-2998-x. Epub 2017 Oct 23.
6
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.基因组选择在植物育种中的应用:方法、模型与展望。
Trends Plant Sci. 2017 Nov;22(11):961-975. doi: 10.1016/j.tplants.2017.08.011. Epub 2017 Sep 28.
7
Practical application of genomic selection in a doubled-haploid winter wheat breeding program.基因组选择在双单倍体冬小麦育种计划中的实际应用。
Mol Breed. 2017;37(10):117. doi: 10.1007/s11032-017-0715-8. Epub 2017 Sep 3.
8
Predicting genomic selection efficiency to optimize calibration set and to assess prediction accuracy in highly structured populations.预测基因组选择效率以优化校准集并评估高度结构化群体中的预测准确性。
Theor Appl Genet. 2017 Nov;130(11):2231-2247. doi: 10.1007/s00122-017-2956-7. Epub 2017 Aug 9.
9
Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines.利用关联研究结果和系谱信息优化基因组预测的训练群体规模和基因分型策略。以先进小麦育种系为例的研究。
PLoS One. 2017 Jan 12;12(1):e0169606. doi: 10.1371/journal.pone.0169606. eCollection 2017.
10
When more is better: how data sharing would accelerate genomic selection of crop plants.多多益善:数据共享如何加速作物的基因组选择
New Phytol. 2016 Dec;212(4):814-826. doi: 10.1111/nph.14174. Epub 2016 Sep 26.