Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, United States of America.
Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota, United States of America.
PLoS Genet. 2023 Dec 22;19(12):e1011086. doi: 10.1371/journal.pgen.1011086. eCollection 2023 Dec.
Structural differences between genomes are a major source of genetic variation that contributes to phenotypic differences. Transposable elements, mobile genetic sequences capable of increasing their copy number and propagating themselves within genomes, can generate structural variation. However, their repetitive nature makes it difficult to characterize fine-scale differences in their presence at specific positions, limiting our understanding of their impact on genome variation. Domesticated maize is a particularly good system for exploring the impact of transposable element proliferation as over 70% of the genome is annotated as transposable elements. High-quality transposable element annotations were recently generated for de novo genome assemblies of 26 diverse inbred maize lines. We generated base-pair resolved pairwise alignments between the B73 maize reference genome and the remaining 25 inbred maize line assemblies. From this data, we classified transposable elements as either shared or polymorphic in a given pairwise comparison. Our analysis uncovered substantial structural variation between lines, representing both simple and complex connections between TEs and structural variants. Putative insertions in SNP depleted regions, which represent recently diverged identity by state blocks, suggest some TE families may still be active. However, our analysis reveals that within these recently diverged genomic regions, deletions of transposable elements likely account for more structural variation events and base pairs than insertions. These deletions are often large structural variants containing multiple transposable elements. Combined, our results highlight how transposable elements contribute to structural variation and demonstrate that deletion events are a major contributor to genomic differences.
基因组之间的结构差异是遗传变异的主要来源,导致表型差异。转座元件是能够增加其拷贝数并在基因组内自身传播的移动遗传序列,可产生结构变异。然而,它们的重复性质使得难以在特定位置处精细地表征其存在的细微差异,限制了我们对其对基因组变异影响的理解。驯化的玉米是探索转座元件增殖影响的一个特别好的系统,因为超过 70%的基因组被注释为转座元件。最近为 26 个不同自交系玉米品系的从头基因组组装生成了高质量的转座元件注释。我们在 B73 玉米参考基因组和其余 25 个自交系组装之间生成了碱基分辨率的成对比对。根据这些数据,我们将转座元件分类为在给定的成对比较中是共享的还是多态的。我们的分析揭示了品系之间的大量结构变异,代表了 TEs 和结构变体之间的简单和复杂连接。在 SNP 耗尽区域中发现的假定插入,代表了最近分化的状态块,表明一些 TE 家族可能仍然活跃。然而,我们的分析表明,在这些最近分化的基因组区域内,转座元件的缺失可能比插入导致更多的结构变异事件和碱基对。这些缺失通常是包含多个转座元件的大型结构变体。总的来说,我们的结果强调了转座元件如何导致结构变异,并表明缺失事件是基因组差异的主要贡献者。