Scotland's Rural College (SRUC), Kings Buildings, West Mains Road, Edinburgh, EH9 3JG, UK.
The John Bingham Laboratory, NIAB, 93 Lawrence Weaver Road, Cambridge, CB3 0LE, UK.
Theor Appl Genet. 2022 Feb;135(2):667-678. doi: 10.1007/s00122-021-03991-z. Epub 2021 Nov 15.
Variety age and population structure detect novel QTL for yield and adaptation in wheat and barley without the need to phenotype. The process of crop breeding over the last century has delivered new varieties with increased genetic gains, resulting in higher crop performance and yield. However, in many cases, the alleles and genomic regions underpinning this success remain unknown. This is partly due to the difficulty of generating sufficient phenotypic data on large numbers of historical varieties to enable such analyses. Here we demonstrate the ability to circumvent such bottlenecks by identifying genomic regions selected over 100 years of crop breeding using age of a variety as a surrogate for yield. Rather than collecting phenotype data, we deployed 'environmental genome-wide association scans' (EnvGWAS) based on variety age in two of the world's most important crops, wheat and barley, and detected strong signals of selection across both genomes. EnvGWAS identified 16 genomic regions in barley and 10 in wheat with contrasting patterns between spring and winter types of the two crops. To further examine changes in genome structure, we used the genomic relationship matrix of the genotypic data to derive eigenvectors for analysis in EigenGWAS. This detected seven major chromosomal introgressions that contributed to adaptation in wheat. EigenGWAS and EnvGWAS based on variety age avoid costly phenotyping and facilitate the identification of genomic tracts that have been under selection during breeding. Our results demonstrate the potential of using historical cultivar collections coupled with genomic data to identify chromosomal regions under selection and may help guide future plant breeding strategies to maximise the rate of genetic gain and adaptation.
品种的年龄和群体结构在无需表型分析的情况下,检测到小麦和大麦产量和适应性的新 QTL。在上个世纪的作物育种过程中,产生了具有更高遗传增益的新品种,从而提高了作物的表现和产量。然而,在许多情况下,支持这种成功的等位基因和基因组区域仍然未知。这在一定程度上是由于难以生成大量历史品种的足够表型数据,从而无法进行此类分析。在这里,我们通过使用品种的年龄作为产量的替代指标来识别 100 多年来作物育种过程中选择的基因组区域,从而证明了规避这种瓶颈的能力。我们没有收集表型数据,而是在世界上两种最重要的作物小麦和大麦中,基于品种的年龄部署了“环境全基因组关联扫描”(EnvGWAS),并在两个基因组中都检测到了强烈的选择信号。EnvGWAS 在大麦中鉴定出 16 个基因组区域,在小麦中鉴定出 10 个基因组区域,这两个作物的春播和冬播类型之间存在相反的模式。为了进一步研究基因组结构的变化,我们使用基因型数据的基因组关系矩阵来推导 EigenGWAS 分析的特征向量。这在小麦中检测到了七个主要的染色体渗入,它们对适应起到了作用。基于品种年龄的 EigenGWAS 和 EnvGWAS 避免了昂贵的表型分析,并有助于鉴定在育种过程中受到选择的基因组片段。我们的结果表明,使用历史品种收集和基因组数据相结合的方法来识别受选择的染色体区域具有潜力,并可能有助于指导未来的植物育种策略,以最大化遗传增益和适应性的速度。