Queensland Alliance for Agriculture and Food Innovation (QAAFI), Hermitage Research Facility, The University of Queensland, Warwick, Queensland, Australia.
Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, China.
Nat Plants. 2021 Jun;7(6):766-773. doi: 10.1038/s41477-021-00925-x. Epub 2021 May 20.
Sorghum is a drought-tolerant staple crop for half a billion people in Africa and Asia, an important source of animal feed throughout the world and a biofuel feedstock of growing importance. Cultivated sorghum and its inter-fertile wild relatives constitute the primary gene pool for sorghum. Understanding and characterizing the diversity within this valuable resource is fundamental for its effective utilization in crop improvement. Here, we report analysis of a sorghum pan-genome to explore genetic diversity within the sorghum primary gene pool. We assembled 13 genomes representing cultivated sorghum and its wild relatives, and integrated them with 3 other published genomes to generate a pan-genome of 44,079 gene families with 222.6 Mb of new sequence identified. The pan-genome displays substantial gene-content variation, with 64% of gene families showing presence/absence variation among genomes. Comparisons between core genes and dispensable genes suggest that dispensable genes are important for sorghum adaptation. Extensive genetic variation was uncovered within the pan-genome, and the distribution of these variations was influenced by variation of recombination rate and transposable element content across the genome. We identified presence/absence variants that were under selection during sorghum domestication and improvement, and demonstrated that such variation had important phenotypic outcomes that could contribute to crop improvement. The constructed sorghum pan-genome represents an important resource for sorghum improvement and gene discovery.
高粱是非洲和亚洲 5 亿人口的耐旱主食作物,是全世界重要的动物饲料来源,也是生物燃料的重要原料。栽培高粱及其可杂交的野生近缘种构成了高粱的主要基因库。了解和描述这一宝贵资源的多样性对于其在作物改良中的有效利用至关重要。在这里,我们报告了高粱泛基因组的分析,以探索高粱主要基因库内的遗传多样性。我们组装了 13 个代表栽培高粱及其野生近缘种的基因组,并将其与另外 3 个已发表的基因组整合,生成了一个包含 44079 个基因家族的泛基因组,其中有 222.6 Mb 的新序列被鉴定。泛基因组显示出大量的基因含量变异,其中 64%的基因家族在基因组之间存在存在/缺失变异。核心基因和可丢弃基因之间的比较表明,可丢弃基因对高粱的适应很重要。在泛基因组中发现了广泛的遗传变异,这些变异的分布受到基因组中重组率和转座元件含量变化的影响。我们鉴定了在高粱驯化和改良过程中受到选择的存在/缺失变异,并证明这种变异对表型有重要影响,可能有助于作物改良。构建的高粱泛基因组为高粱改良和基因发现提供了一个重要的资源。