优化小麦基因组选择：标记密度、群体大小和群体结构对预测准确性的影响

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy.

作者信息

Norman Adam, Taylor Julian, Edwards James, Kuchel Haydn

机构信息

School of Agriculture, Food & Wine, University of Adelaide

School of Agriculture, Food & Wine, University of Adelaide.

出版信息

G3 (Bethesda). 2018 Aug 30;8(9):2889-2899. doi: 10.1534/g3.118.200311.

DOI:10.1534/g3.118.200311

PMID:29970398

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6118301/

Abstract

Genomic selection applied to plant breeding enables earlier estimates of a line's performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel of 10,375 bread wheat () lines genotyped with 18,101 SNP markers to investigate the effect and interaction of training set size, population structure and marker density on genomic prediction accuracy. Through assessing the effect of training set size we showed the rate at which prediction accuracy increases is slower beyond approximately 2,000 lines. The structure of the panel was assessed via principal component analysis and K-means clustering, and its effect on prediction accuracy was examined through a novel cross-validation analysis according to the K-means clusters and breeding cohorts. Here we showed that accuracy can be improved by increasing the diversity within the training set, particularly when relatedness between training and validation sets is low. The breeding cohort analysis revealed that traits with higher selection pressure (lower allelic diversity) can be more accurately predicted by including several previous cohorts in the training set. The effect of marker density and its interaction with population structure was assessed for marker subsets containing between 100 and 17,181 markers. This analysis showed that response to increased marker density is largest when using a diverse training set to predict between poorly related material. These findings represent a significant resource for plant breeders and contribute to the collective knowledge on the optimal structure of calibration panels for genomic prediction.

摘要

将基因组选择应用于植物育种能够更早地估计品系的表现，并显著缩短世代间隔。如果育种者想要充分利用基因组选择的潜力，就应该充分了解影响预测准确性的几个因素。我们使用了一个由10375个面包小麦（）品系组成的群体，这些品系用18101个单核苷酸多态性（SNP）标记进行了基因分型，以研究训练集大小、群体结构和标记密度对基因组预测准确性的影响及相互作用。通过评估训练集大小的影响，我们发现，超过大约2000个品系后，预测准确性的提高速度会变慢。通过主成分分析和K均值聚类评估了群体结构，并根据K均值聚类和育种群体，通过一种新颖的交叉验证分析来检验其对预测准确性的影响。在这里，我们表明，通过增加训练集内的多样性可以提高准确性，特别是当训练集和验证集之间的亲缘关系较低时。育种群体分析表明，对于选择压力较高（等位基因多样性较低）的性状，通过在训练集中纳入几个先前的群体，可以更准确地进行预测。对于包含100至17181个标记的标记子集，评估了标记密度的影响及其与群体结构的相互作用。该分析表明，当使用多样化的训练集来预测亲缘关系较差的材料之间的情况时，对增加标记密度的反应最大。这些发现为植物育种者提供了重要资源，并有助于增进关于基因组预测校准群体最佳结构的集体知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6531/6118301/ebfe99e5e82b/2889f1.jpg

相似文献

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy.优化小麦基因组选择：标记密度、群体大小和群体结构对预测准确性的影响

G3 (Bethesda). 2018 Aug 30;8(9):2889-2899. doi: 10.1534/g3.118.200311.

Potential and limits to unravel the genetic architecture and predict the variation of Fusarium head blight resistance in European winter wheat (Triticum aestivum L.).解析欧洲冬小麦（Triticum aestivum L.）赤霉病抗性遗传结构及预测其变异的潜力与局限

Heredity (Edinb). 2015 Mar;114(3):318-26. doi: 10.1038/hdy.2014.104. Epub 2014 Nov 12.

Canopy Temperature and Vegetation Indices from High-Throughput Phenotyping Improve Accuracy of Pedigree and Genomic Selection for Grain Yield in Wheat.高通量表型分析得出的冠层温度和植被指数提高了小麦籽粒产量系谱选择和基因组选择的准确性。

G3 (Bethesda). 2016 Sep 8;6(9):2799-808. doi: 10.1534/g3.116.032888.

A classic approach for determining genomic prediction accuracy under terminal drought stress and well-watered conditions in wheat landraces and cultivars.一种经典的方法，用于确定小麦地方品种和栽培品种在终末干旱胁迫和充分供水条件下的基因组预测准确性。

PLoS One. 2021 Mar 5;16(3):e0247824. doi: 10.1371/journal.pone.0247824. eCollection 2021.

The effects of training population design on genomic prediction accuracy in wheat.训练群体设计对小麦基因组预测准确性的影响。

Theor Appl Genet. 2019 Jul;132(7):1943-1952. doi: 10.1007/s00122-019-03327-y. Epub 2019 Mar 19.

Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model.使用标记×环境互作基因组选择模型提高小麦育种试验中的预测准确性。

G3 (Bethesda). 2015 Feb 6;5(4):569-82. doi: 10.1534/g3.114.016097.

BWGS: A R package for genomic selection and its application to a wheat breeding programme.BWGS：一个基因组选择的 R 包及其在小麦育种计划中的应用。

PLoS One. 2020 Apr 2;15(4):e0222733. doi: 10.1371/journal.pone.0222733. eCollection 2020.

Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel.训练群体选择和固定效应的使用，以优化美国历史冬小麦面板的基因组预测。

Theor Appl Genet. 2019 Apr;132(4):1247-1261. doi: 10.1007/s00122-019-03276-6. Epub 2019 Jan 24.

Accuracy of genomic selection for grain yield and agronomic traits in soft red winter wheat.基因组选择对软红冬小麦粒产量和农艺性状的准确性。

BMC Genet. 2019 Nov 1;20(1):82. doi: 10.1186/s12863-019-0785-1.

Predicting Hybrid Performances for Quality Traits through Genomic-Assisted Approaches in Central European Wheat.通过基因组辅助方法预测中欧小麦品质性状的杂种表现

PLoS One. 2016 Jul 6;11(7):e0158635. doi: 10.1371/journal.pone.0158635. eCollection 2016.

引用本文的文献

Harnessing big data for enhanced genome-wide prediction in winter wheat breeding.利用大数据增强冬小麦育种中的全基因组预测

Theor Appl Genet. 2025 Aug 22;138(9):224. doi: 10.1007/s00122-025-05007-6.

Transferability of genomic prediction models across market segments in potato and the effect of selection.马铃薯基因组预测模型在不同市场细分中的可转移性及选择效应

Theor Appl Genet. 2025 Aug 20;138(9):219. doi: 10.1007/s00122-025-05004-9.

Genetic parameters and genomic prediction of egg production traits in ducks.鸭产蛋性状的遗传参数与基因组预测

Poult Sci. 2025 Jul 3;104(10):105510. doi: 10.1016/j.psj.2025.105510.

Integrating multi-omics and machine learning for disease resistance prediction in legumes.整合多组学和机器学习用于豆类抗病性预测

Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.

KBeagle: An Adaptive Strategy and Tool for Improving Imputation Accuracy and Computation Time.KBeagle：一种提高插补准确性和计算时间的自适应策略与工具。

Int J Mol Sci. 2025 Jun 18;26(12):5797. doi: 10.3390/ijms26125797.

Low density marker-based effectiveness and efficiency of early-generation genomic selection relative to phenotype-based selection in dolichos bean (Lablab purpureus L. Sweet).基于低密度标记的菜豆（Lablab purpureus L. Sweet）早期基因组选择相对于基于表型选择的有效性和效率

Plant Genome. 2025 Jun;18(2):e70039. doi: 10.1002/tpg2.70039.

Optimizing genomic prediction for complex traits via investigating multiple factors in switchgrass.通过研究柳枝稷中的多种因素优化复杂性状的基因组预测。

Plant Physiol. 2025 Jul 3;198(3). doi: 10.1093/plphys/kiaf188.

Genomic selection of maize test-cross hybrids leveraged by marker sampling.利用标记抽样对玉米测交杂种进行基因组选择。

Plant Genome. 2025 Jun;18(2):e70030. doi: 10.1002/tpg2.70030.

Breaking down data silos across companies to train genome-wide predictions: A feasibility study in wheat.打破公司间的数据孤岛以训练全基因组预测：小麦的可行性研究

Plant Biotechnol J. 2025 Jul;23(7):2704-2719. doi: 10.1111/pbi.70095. Epub 2025 Apr 20.

Rapid Identification of Alien Chromosome Fragments and Tracing of Bioactive Compound Genes in Intergeneric Hybrid Offspring Between and Based on AMAC Method.基于AMAC法快速鉴定属间杂交后代中外源染色体片段及追踪生物活性化合物基因

Int J Mol Sci. 2025 Feb 27;26(5):2091. doi: 10.3390/ijms26052091.

本文引用的文献

Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection.两阶段方案中利用快速轮回基因组选择进行长期遗传增益的最优杂交选择。

Theor Appl Genet. 2018 Sep;131(9):1953-1966. doi: 10.1007/s00122-018-3125-3. Epub 2018 Jun 6.

Rice diversity panel provides accurate genomic predictions for complex traits in the progenies of biparental crosses involving members of the panel.水稻多样性群体为涉及该群体成员的双亲杂交后代的复杂性状提供了准确的基因组预测。

Theor Appl Genet. 2018 Feb;131(2):417-435. doi: 10.1007/s00122-017-3011-4. Epub 2017 Nov 14.

Increased genomic prediction accuracy in wheat breeding using a large Australian panel.利用澳大利亚大型样本提高小麦育种中的基因组预测准确性。

Theor Appl Genet. 2017 Dec;130(12):2543-2555. doi: 10.1007/s00122-017-2975-4. Epub 2017 Sep 8.

Genome-wide mapping and prediction suggests presence of local epistasis in a vast elite winter wheat populations adapted to Central Europe.全基因组定位和预测表明，在适应中欧的大量优质冬小麦群体中存在局部上位性。

Theor Appl Genet. 2017 Apr;130(4):635-647. doi: 10.1007/s00122-016-2840-x. Epub 2016 Dec 19.

Genomic assisted selection for enhancing line breeding: merging genomic and phenotypic selection in winter wheat breeding programs with preliminary yield trials.通过基因组辅助选择提高品系选育：在冬小麦育种计划中结合基因组和表型选择并进行初步产量试验

Theor Appl Genet. 2017 Feb;130(2):363-376. doi: 10.1007/s00122-016-2818-8. Epub 2016 Nov 8.

Model training across multiple breeding cycles significantly improves genomic prediction accuracy in rye (Secale cereale L.).跨多个育种周期的模型训练显著提高了黑麦（Secale cereale L.）的基因组预测准确性。

Theor Appl Genet. 2016 Nov;129(11):2043-2053. doi: 10.1007/s00122-016-2756-5. Epub 2016 Aug 1.

Genomic Prediction of Gene Bank Wheat Landraces.基因库中小麦地方品种的基因组预测

G3 (Bethesda). 2016 Jul 7;6(7):1819-34. doi: 10.1534/g3.116.029637.

Review: How to improve genomic predictions in small dairy cattle populations.综述：如何提高小奶牛群体的基因组预测准确性。

Animal. 2016 Jun;10(6):1042-9. doi: 10.1017/S1751731115003031. Epub 2016 Jan 19.

Accuracy of whole-genome prediction using a genetic architecture-enhanced variance-covariance matrix.使用遗传结构增强的方差协方差矩阵进行全基因组预测的准确性。

G3 (Bethesda). 2015 Feb 9;5(4):615-27. doi: 10.1534/g3.114.016261.

Training set optimization under population structure in genomic selection.基因组选择中群体结构下的训练集优化

Theor Appl Genet. 2015 Jan;128(1):145-58. doi: 10.1007/s00122-014-2418-4. Epub 2014 Nov 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

优化小麦基因组选择：标记密度、群体大小和群体结构对预测准确性的影响

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献