在结构化的地中海燕麦群体中进行基因组预测和训练集优化。

Genomic prediction and training set optimization in a structured Mediterranean oat population.

机构信息

Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain.

Institute for Sustainable Agriculture, Spanish Research Council (CSIC), Córdoba, Spain.

出版信息

Theor Appl Genet. 2021 Nov;134(11):3595-3609. doi: 10.1007/s00122-021-03916-w. Epub 2021 Aug 3.

DOI:10.1007/s00122-021-03916-w

PMID:34341832

Abstract

The strong genetic structure observed in Mediterranean oats affects the predictive ability of genomic prediction as well as the performance of training set optimization methods. In this study, we investigated the efficiency of genomic prediction and training set optimization in a highly structured population of cultivars and landraces of cultivated oat (Avena sativa) from the Mediterranean basin, including white (subsp. sativa) and red (subsp. byzantina) oats, genotyped using genotype-by-sequencing markers and evaluated for agronomic traits in Southern Spain. For most traits, the predictive abilities were moderate to high with little differences between models, except for biomass for which Bayes-B showed a substantial gain compared to other models. The consistency between the structure of the training population and the population to be predicted was key to the predictive ability of genomic predictions. The predictive ability of inter-subspecies predictions was indeed much lower than that of intra-subspecies predictions for all traits. Regarding training set optimization, the linear mixed model optimization criteria (prediction error variance (PEVmean) and coefficient of determination (CDmean)) performed better than the heuristic approach "partitioning around medoids," even under high population structure. The superiority of CDmean and PEVmean could be explained by their ability to adapt the representation of each genetic group according to those represented in the population to be predicted. These results represent an important step towards the implementation of genomic prediction in oat breeding programs and address important issues faced by the genomic prediction community regarding population structure and training set optimization.

摘要

在中观燕麦中观察到的强大遗传结构会影响基因组预测的预测能力以及训练集优化方法的性能。在这项研究中，我们研究了高度结构化的栽培燕麦品种和地方品种群体（包括白燕麦（亚种 sativa）和红燕麦（亚种 byzantina））的基因组预测和训练集优化效率，该群体来自地中海盆地，使用基于测序的基因型标记进行基因型分析，并在西班牙南部评估了农艺性状。对于大多数性状，预测能力为中等至高，不同模型之间的差异很小，除了生物质，贝叶斯-B 与其他模型相比有很大的提高。训练群体和要预测的群体之间的结构一致性是基因组预测预测能力的关键。事实上，与种内预测相比，种间预测的所有性状的预测能力都要低得多。关于训练集优化，线性混合模型优化标准（预测误差方差（PEVmean）和决定系数（CDmean））比启发式方法“围绕中位数分区”表现更好，即使在高度结构的情况下也是如此。CDmean 和 PEVmean 的优越性可以解释为它们能够根据预测群体中代表的遗传群体来适应每个遗传群体的表示。这些结果代表了在燕麦育种计划中实施基因组预测的重要一步，并解决了基因组预测社区在群体结构和训练集优化方面面临的重要问题。

相似文献

Genomic prediction and training set optimization in a structured Mediterranean oat population.在结构化的地中海燕麦群体中进行基因组预测和训练集优化。

Theor Appl Genet. 2021 Nov;134(11):3595-3609. doi: 10.1007/s00122-021-03916-w. Epub 2021 Aug 3.

Population genomics of Mediterranean oat (A. sativa) reveals high genetic diversity and three loci for heading date.地中海燕麦（A. sativa）的群体基因组学揭示了高度的遗传多样性和三个控制抽穗期的基因座。

Theor Appl Genet. 2021 Jul;134(7):2063-2077. doi: 10.1007/s00122-021-03805-2. Epub 2021 Mar 26.

Implementing multi-trait genomic selection to improve grain milling quality in oats (Avena sativa L.).实施多性状基因组选择以提高燕麦（Avena sativa L.）的制粉品质。

Plant Genome. 2024 Jun;17(2):e20457. doi: 10.1002/tpg2.20457. Epub 2024 May 19.

Training set optimization under population structure in genomic selection.基因组选择中群体结构下的训练集优化

Theor Appl Genet. 2015 Jan;128(1):145-58. doi: 10.1007/s00122-014-2418-4. Epub 2014 Nov 1.

Optimization of training sets for genomic prediction of early-stage single crosses in maize.优化训练集以进行玉米早期单交种的基因组预测。

Theor Appl Genet. 2021 Feb;134(2):687-699. doi: 10.1007/s00122-020-03722-w. Epub 2021 Jan 4.

Genetic diversity and genome-wide association analysis in Chinese hulless oat germplasm.中国裸燕麦种质资源的遗传多样性及全基因组关联分析

Theor Appl Genet. 2020 Dec;133(12):3365-3380. doi: 10.1007/s00122-020-03674-1. Epub 2020 Sep 4.

Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel.训练群体选择和固定效应的使用，以优化美国历史冬小麦面板的基因组预测。

Theor Appl Genet. 2019 Apr;132(4):1247-1261. doi: 10.1007/s00122-019-03276-6. Epub 2019 Jan 24.

The mosaic oat genome gives insights into a uniquely healthy cereal crop.镶嵌燕麦基因组揭示了一种独特的健康谷物作物。

Nature. 2022 Jun;606(7912):113-119. doi: 10.1038/s41586-022-04732-y. Epub 2022 May 18.

Training set optimization of genomic prediction by means of EthAcc.通过 EthAcc 对基因组预测进行训练集优化。

PLoS One. 2019 Feb 19;14(2):e0205629. doi: 10.1371/journal.pone.0205629. eCollection 2019.

Combining genetic resources and elite material populations to improve the accuracy of genomic prediction in apple.结合遗传资源和优良材料群体提高苹果基因组预测的准确性。

G3 (Bethesda). 2022 Mar 4;12(3). doi: 10.1093/g3journal/jkab420.

引用本文的文献

Transferability of genomic prediction models across market segments in potato and the effect of selection.马铃薯基因组预测模型在不同市场细分中的可转移性及选择效应

Theor Appl Genet. 2025 Aug 20;138(9):219. doi: 10.1007/s00122-025-05004-9.

Optimizing fully-efficient two-stage models for genomic selection using open-source software.使用开源软件优化用于基因组选择的全效两阶段模型。

Plant Methods. 2025 Feb 4;21(1):9. doi: 10.1186/s13007-024-01318-9.

Revisiting superiority and stability metrics of cultivar performances using genomic data: derivations of new estimators.利用基因组数据重新审视品种表现的优势和稳定性指标：新估计量的推导

Plant Methods. 2024 Jun 6;20(1):85. doi: 10.1186/s13007-024-01207-1.

Maximizing efficiency in sunflower breeding through historical data optimization.通过历史数据优化实现向日葵育种效率最大化。

Plant Methods. 2024 Mar 16;20(1):42. doi: 10.1186/s13007-024-01151-0.

Whole-genome resequencing of major populations revealed domestication-related genes in yaks.对主要人群的全基因组重测序揭示了牦牛驯化相关基因。

BMC Genomics. 2024 Jan 17;25(1):69. doi: 10.1186/s12864-024-09993-7.

Multi-Omics Pipeline and Omics-Integration Approach to Decipher Plant's Abiotic Stress Tolerance Responses.多组学分析管道和组学整合方法解析植物的非生物胁迫耐受反应。

Genes (Basel). 2023 Jun 16;14(6):1281. doi: 10.3390/genes14061281.

Utilizing Genomics to Characterize the Common Oat Gene Pool-The Story of More Than a Century of Polish Breeding.利用基因组学描述普通燕麦基因库——波兰一个多世纪的育种故事。

Int J Mol Sci. 2023 Mar 31;24(7):6547. doi: 10.3390/ijms24076547.

Multi-environment Genomic Selection in Rice Elite Breeding Lines.水稻优良品系的多环境基因组选择

Rice (N Y). 2023 Feb 8;16(1):7. doi: 10.1186/s12284-023-00623-6.

Breeding oat for resistance to the crown rust pathogen Puccinia coronata f. sp. avenae: achievements and prospects.培育燕麦抗冠锈病病原菌 Puccinia coronata f. sp. avenae：成就与展望。

Theor Appl Genet. 2022 Nov;135(11):3709-3734. doi: 10.1007/s00122-022-04121-z. Epub 2022 Jun 4.

Development of a Model for Genomic Prediction of Multiple Traits in Common Bean Germplasm, Based on Population Structure.基于群体结构的菜豆种质多性状基因组预测模型的开发

Plants (Basel). 2022 May 12;11(10):1298. doi: 10.3390/plants11101298.

本文引用的文献

Design of training populations for selective phenotyping in genomic prediction.用于基因组预测中选择性表型分析的训练群体设计。

Sci Rep. 2019 Feb 5;9(1):1446. doi: 10.1038/s41598-018-38081-6.

Optimal Designs for Genomic Selection in Hybrid Crops.杂种作物基因组选择的最优设计。

Mol Plant. 2019 Mar 4;12(3):390-401. doi: 10.1016/j.molp.2018.12.022. Epub 2019 Jan 6.

Haplotype-based genotyping-by-sequencing in oat genome research.基于单倍型的测序基因型分析在燕麦基因组研究中的应用。

Plant Biotechnol J. 2018 Aug;16(8):1452-1463. doi: 10.1111/pbi.12888. Epub 2018 Mar 25.

Population Structure and Genotype-Phenotype Associations in a Collection of Oat Landraces and Historic Cultivars.燕麦地方品种和历史栽培品种群体的结构及基因型-表型关联

Front Plant Sci. 2016 Jul 29;7:1077. doi: 10.3389/fpls.2016.01077. eCollection 2016.

Genomic Prediction in Pea: Effect of Marker Density and Training Population Size and Composition on Prediction Accuracy.豌豆中的基因组预测：标记密度、训练群体大小和组成对预测准确性的影响。

Front Plant Sci. 2015 Nov 17;6:941. doi: 10.3389/fpls.2015.00941. eCollection 2015.

Modeling Epistasis in Genomic Selection.遗传选择中的上位性建模。

Genetics. 2015 Oct;201(2):759-68. doi: 10.1534/genetics.115.177907. Epub 2015 Jul 27.

Multibreed genomic evaluations using purebred Holsteins, Jerseys, and Brown Swiss.使用纯种荷斯坦牛、娟姗牛和瑞士褐牛进行多品种基因组评估。

J Dairy Sci. 2012 Sep;95(9):5378-5383. doi: 10.3168/jds.2011-5006.

Random forests for genomic data analysis.随机森林在基因组数据分析中的应用。

Genomics. 2012 Jun;99(6):323-9. doi: 10.1016/j.ygeno.2012.04.003. Epub 2012 Apr 21.

Short communication: Genomic selection using a multi-breed, across-country reference population.简讯：利用多品种、跨国参考群体进行基因组选择。

J Dairy Sci. 2011 May;94(5):2625-30. doi: 10.3168/jds.2010-3719.

Reliability of genomic predictions across multiple populations.跨多个群体的基因组预测的可靠性。

Genetics. 2009 Dec;183(4):1545-53. doi: 10.1534/genetics.109.104935. Epub 2009 Oct 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在结构化的地中海燕麦群体中进行基因组预测和训练集优化。

Genomic prediction and training set optimization in a structured Mediterranean oat population.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献