Departments of Biochemistry, Molecular Biology and Medical Genetics, Cumming School of Medicine, University of Calgary, Calgary, AB, T2N 4N1, Canada.
Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, T2N 4N1, Canada.
Sci Rep. 2021 Sep 14;11(1):18258. doi: 10.1038/s41598-021-97764-9.
Genomic rearrangements cause congenital disorders, cancer, and complex diseases in human. Yet, they are still understudied in rare diseases because their detection is challenging, despite the advent of whole genome sequencing (WGS) technologies. Short-read (srWGS) and long-read WGS approaches are regularly compared, and the latter is commonly recommended in studies focusing on genomic rearrangements. However, srWGS is currently the most economical, accurate, and widely supported technology. In Caenorhabditis elegans (C. elegans), such variants, induced by various mutagenesis processes, have been used for decades to balance large genomic regions by preventing chromosomal crossover events and allowing the maintenance of lethal mutations. Interestingly, those chromosomal rearrangements have rarely been characterized on a molecular level. To evaluate the ability of srWGS to detect various types of complex genomic rearrangements, we sequenced three balancer strains using short-read Illumina technology. As we experimentally validated the breakpoints uncovered by srWGS, we showed that, by combining several types of analyses, srWGS enables the detection of a reciprocal translocation (eT1), a free duplication (sDp3), a large deletion (sC4), and chromoanagenesis events. Thus, applying srWGS to decipher real complex genomic rearrangements in model organisms may help designing efficient bioinformatics pipelines with systematic detection of complex rearrangements in human genomes.
基因组重排会导致人类先天疾病、癌症和复杂疾病。然而,由于其检测具有挑战性,尽管全基因组测序 (WGS) 技术已经问世,但它们在罕见疾病中的研究仍相对较少。短读长 (srWGS) 和长读长 WGS 方法经常被比较,而后者在关注基因组重排的研究中通常被推荐。然而,srWGS 目前是最经济、最准确和最广泛支持的技术。在秀丽隐杆线虫 (C. elegans) 中,通过各种诱变过程诱导的此类变体已被使用了几十年,通过防止染色体交叉事件并允许维持致死突变,来平衡大的基因组区域。有趣的是,这些染色体重排很少在分子水平上进行表征。为了评估 srWGS 检测各种类型复杂基因组重排的能力,我们使用短读长 Illumina 技术对三个平衡器菌株进行了测序。当我们通过实验验证了 srWGS 揭示的断点时,我们表明,通过结合多种类型的分析,srWGS 能够检测到相互易位 (eT1)、自由重复 (sDp3)、大片段缺失 (sC4) 和染色体形成事件。因此,在模式生物中应用 srWGS 来解析真实的复杂基因组重排可能有助于设计高效的生物信息学管道,系统地检测人类基因组中的复杂重排。