Biomedical Informatics Laboratory, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Kanagawa, 259-1193, Japan.
Department of Human Genetics, Yokohama City University Graduate School of Medicine, Yokohama, Kanagawa, 236-0004, Japan.
Sci Rep. 2017 Nov 1;7(1):14789. doi: 10.1038/s41598-017-13712-6.
Subtelomeric macrosatellite repeats are difficult to sequence using conventional sequencing methods owing to the high similarity among repeat units and high GC content. Sequencing these repetitive regions is challenging, even with recent improvements in sequencing technologies. Among these repeats, a haplotype carrying a particular sequence and shortening of the D4Z4 array on human chromosome 4q35 causes one of the most prevalent forms of muscular dystrophy with autosomal-dominant inheritance, facioscapulohumeral muscular dystrophy (FSHD). Here, we applied a nanopore-based ultra-long read sequencer to sequence a BAC clone containing 13 D4Z4 repeats and flanking regions. We successfully obtained the whole D4Z4 repeat sequence, including the pathogenic gene DUX4 in the last D4Z4 repeat. The estimated sequence accuracy of the total repeat region was 99.8% based on a comparison with the reference sequence. Errors were typically observed between purine or between pyrimidine bases. Further, we analyzed the D4Z4 sequence from publicly available ultra-long whole human genome sequencing data obtained by nanopore sequencing. This technology may be a new tool for studying D4Z4 repeats and pathomechanism of FSHD in the future and has the potential to widen our understanding of subtelomeric regions.
端粒上的大片段卫星重复序列由于重复单元之间的高度相似性和高 GC 含量,使用常规测序方法难以进行测序。即使测序技术最近有所改进,对这些重复区域进行测序也是具有挑战性的。在这些重复序列中,一个带有特定序列的单体型和人类 4 号染色体 4q35 上的 D4Z4 阵列缩短导致最常见的常染色体显性遗传的肌肉营养不良症之一,面肩肱型肌营养不良症(FSHD)。在这里,我们应用基于纳米孔的超长读测序仪对包含 13 个 D4Z4 重复和侧翼区域的 BAC 克隆进行测序。我们成功获得了整个 D4Z4 重复序列,包括最后一个 D4Z4 重复中的致病基因 DUX4。基于与参考序列的比较,整个重复区域的估计序列准确性为 99.8%。错误通常发生在嘌呤或嘧啶碱基之间。此外,我们分析了通过纳米孔测序获得的公开可用的超长全人类基因组测序数据中的 D4Z4 序列。这项技术可能成为未来研究 D4Z4 重复和 FSHD 发病机制的新工具,并有可能加深我们对端粒区域的理解。