State Key Laboratory of Vegetable Biobreeding, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China.
Plant Biotechnol J. 2023 May;21(5):1022-1032. doi: 10.1111/pbi.14015. Epub 2023 Feb 2.
Brassica rapa comprises many important cultivated vegetables and oil crops. However, Chiifu v3.0, the current B. rapa reference genome, still contains hundreds of gaps. Here, we presented a near-complete genome assembly of B. rapa Chiifu v4.0, which was 424.59 Mb with only two gaps, using Oxford Nanopore Technology (ONT) ultralong-read sequencing and Hi-C technologies. The new assembly contains 12 contigs, with a contig N50 of 38.26 Mb. Eight of the ten chromosomes were entirely reconstructed in a single contig from telomere to telomere. We found that the centromeres were mainly invaded by ALE and CRM long terminal repeats (LTRs). Moreover, there is a high divergence of centromere length and sequence among B. rapa genomes. We further found that centromeres are enriched for Copia invaded at 0.14 MYA on average, while pericentromeres are enriched for Gypsy LTRs invaded at 0.51 MYA on average. These results indicated the different invasion mechanisms of LTRs between the two structures. In addition, a novel repetitive sequence PCR630 was identified in the pericentromeres of B. rapa. Overall, the near-complete genome assembly, B. rapa Chiifu v4.0, offers valuable tools for genomic and genetic studies of Brassica species and provides new insights into the evolution of centromeres.
白菜型油菜包含许多重要的栽培蔬菜和油料作物。然而,当前的白菜型油菜参考基因组 Chiifu v3.0 仍然包含数百个缺口。在这里,我们使用牛津纳米孔技术(ONT)超长读测序和 Hi-C 技术,呈现了白菜型油菜 Chiifu v4.0 的近乎完整的基因组组装。新的组装体大小为 424.59 Mb,只有两个缺口,包含 12 个 contigs,其中 contig N50 为 38.26 Mb。十个染色体中的八个从端粒到端粒完全重建在一个单独的 contig 中。我们发现着丝粒主要被 ALE 和 CRM 长末端重复序列(LTRs)侵入。此外,白菜型油菜基因组的着丝粒长度和序列存在高度差异。我们进一步发现着丝粒富含平均在 0.14 MYA 被 Copia 侵入的序列,而着丝粒周围富含平均在 0.51 MYA 被 Gypsy LTRs 侵入的序列。这些结果表明了这两种结构之间 LTRs 的不同侵入机制。此外,在白菜型油菜的着丝粒周围鉴定到了一种新的重复序列 PCR630。总体而言,近乎完整的基因组组装体 Chiifu v4.0 为芸薹属物种的基因组和遗传研究提供了有价值的工具,并为着丝粒的进化提供了新的见解。