Institute of Molecular Biology and Genetics, Seoul National University, Seoul, Korea 08826.
Department of Biological Sciences, Seoul National University, Seoul, Korea 08826.
Genome Res. 2019 Jun;29(6):1023-1035. doi: 10.1101/gr.246082.118. Epub 2019 May 23.
Long-read sequencing technologies have contributed greatly to comparative genomics among species and can also be applied to study genomics within a species. In this study, to determine how substantial genomic changes are generated and tolerated within a species, we sequenced a strain, CB4856, which is one of the most genetically divergent strains compared to the N2 reference strain. For this comparison, we used the Pacific Biosciences (PacBio) RSII platform (80×, N50 read length 11.8 kb) and generated de novo genome assembly to the level of pseudochromosomes containing 76 contigs (N50 contig = 2.8 Mb). We identified structural variations that affected as many as 2694 genes, most of which are at chromosome arms. Subtelomeric regions contained the most extensive genomic rearrangements, which even created new subtelomeres in some cases. The subtelomere structure of Chromosome VR implies that ancestral telomere damage was repaired by alternative lengthening of telomeres even in the presence of a functional telomerase gene and that a new subtelomere was formed by break-induced replication. Our study demonstrates that substantial genomic changes including structural variations and new subtelomeres can be tolerated within a species, and that these changes may accumulate genetic diversity within a species.
长读测序技术极大地促进了物种间的比较基因组学研究,也可应用于研究物种内的基因组学。在这项研究中,为了确定在一个物种内产生和容忍大量基因组变化的机制,我们对一个与 N2 参考菌株相比遗传差异最大的菌株 CB4856 进行了测序。为了进行比较,我们使用了 Pacific Biosciences (PacBio) RSII 平台(80×,N50 读长 11.8 kb),并生成了从头组装到假染色体水平的基因组,包含 76 个 contigs(N50 contig = 2.8 Mb)。我们鉴定了影响多达 2694 个基因的结构变异,其中大多数位于染色体臂上。端粒周围区域包含最广泛的基因组重排,在某些情况下甚至创建了新的端粒周围区域。染色体 VR 的端粒结构表明,即使存在功能正常的端粒酶基因,端粒损伤也可以通过端粒的非经典延长来修复,并且新的端粒周围区域是通过断裂诱导复制形成的。我们的研究表明,包括结构变异和新的端粒周围区域在内的大量基因组变化可以在一个物种内被容忍,并且这些变化可能在物种内积累遗传多样性。