Suppr超能文献

优化小麦基因组选择:标记密度、群体大小和群体结构对预测准确性的影响

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy.

作者信息

Norman Adam, Taylor Julian, Edwards James, Kuchel Haydn

机构信息

School of Agriculture, Food & Wine, University of Adelaide

School of Agriculture, Food & Wine, University of Adelaide.

出版信息

G3 (Bethesda). 2018 Aug 30;8(9):2889-2899. doi: 10.1534/g3.118.200311.

Abstract

Genomic selection applied to plant breeding enables earlier estimates of a line's performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel of 10,375 bread wheat () lines genotyped with 18,101 SNP markers to investigate the effect and interaction of training set size, population structure and marker density on genomic prediction accuracy. Through assessing the effect of training set size we showed the rate at which prediction accuracy increases is slower beyond approximately 2,000 lines. The structure of the panel was assessed via principal component analysis and K-means clustering, and its effect on prediction accuracy was examined through a novel cross-validation analysis according to the K-means clusters and breeding cohorts. Here we showed that accuracy can be improved by increasing the diversity within the training set, particularly when relatedness between training and validation sets is low. The breeding cohort analysis revealed that traits with higher selection pressure (lower allelic diversity) can be more accurately predicted by including several previous cohorts in the training set. The effect of marker density and its interaction with population structure was assessed for marker subsets containing between 100 and 17,181 markers. This analysis showed that response to increased marker density is largest when using a diverse training set to predict between poorly related material. These findings represent a significant resource for plant breeders and contribute to the collective knowledge on the optimal structure of calibration panels for genomic prediction.

摘要

将基因组选择应用于植物育种能够更早地估计品系的表现,并显著缩短世代间隔。如果育种者想要充分利用基因组选择的潜力,就应该充分了解影响预测准确性的几个因素。我们使用了一个由10375个面包小麦()品系组成的群体,这些品系用18101个单核苷酸多态性(SNP)标记进行了基因分型,以研究训练集大小、群体结构和标记密度对基因组预测准确性的影响及相互作用。通过评估训练集大小的影响,我们发现,超过大约2000个品系后,预测准确性的提高速度会变慢。通过主成分分析和K均值聚类评估了群体结构,并根据K均值聚类和育种群体,通过一种新颖的交叉验证分析来检验其对预测准确性的影响。在这里,我们表明,通过增加训练集内的多样性可以提高准确性,特别是当训练集和验证集之间的亲缘关系较低时。育种群体分析表明,对于选择压力较高(等位基因多样性较低)的性状,通过在训练集中纳入几个先前的群体,可以更准确地进行预测。对于包含100至17181个标记的标记子集,评估了标记密度的影响及其与群体结构的相互作用。该分析表明,当使用多样化的训练集来预测亲缘关系较差的材料之间的情况时,对增加标记密度的反应最大。这些发现为植物育种者提供了重要资源,并有助于增进关于基因组预测校准群体最佳结构的集体知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6531/6118301/ebfe99e5e82b/2889f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验