Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
Am J Hum Genet. 2012 Apr 6;90(4):599-613. doi: 10.1016/j.ajhg.2012.02.013.
Recurrent deletions have been associated with numerous diseases and genomic disorders. Few, however, have been resolved at the molecular level because their breakpoints often occur in highly copy-number-polymorphic duplicated sequences. We present an approach that uses a combination of somatic cell hybrids, array comparative genomic hybridization, and the specificity of next-generation sequencing to determine breakpoints that occur within segmental duplications. Applying our technique to the 17q21.31 microdeletion syndrome, we used genome sequencing to determine copy-number-variant breakpoints in three deletion-bearing individuals with molecular resolution. For two cases, we observed breakpoints consistent with nonallelic homologous recombination involving only H2 chromosomal haplotypes, as expected. Molecular resolution revealed that the breakpoints occurred at different locations within a 145 kbp segment of >99% identity and disrupt KANSL1 (previously known as KANSL1). In the remaining case, we found that unequal crossover occurred interchromosomally between the H1 and H2 haplotypes and that this event was mediated by a homologous sequence that was once again missing from the human reference. Interestingly, the breakpoints mapped preferentially to gaps in the current reference genome assembly, which we resolved in this study. Our method provides a strategy for the identification of breakpoints within complex regions of the genome harboring high-identity and copy-number-polymorphic segmental duplication. The approach should become particularly useful as high-quality alternate reference sequences become available and genome sequencing of individuals' DNA becomes more routine.
反复出现的缺失与许多疾病和基因组紊乱有关。然而,由于其断点通常发生在高度拷贝数多态性重复序列中,因此很少有断点在分子水平上得到解决。我们提出了一种方法,该方法结合体细胞杂种、阵列比较基因组杂交和下一代测序的特异性来确定发生在片段重复内的断点。我们将该技术应用于 17q21.31 微缺失综合征,使用基因组测序对三个携带缺失的个体进行了分子分辨率的拷贝数变异断点分析。对于两个病例,我们观察到的断点与非等位同源重组一致,仅涉及 H2 染色体单倍型,这是预期的。分子分辨率显示,断点发生在 145 kbp 以上的同源性为 99%的片段内的不同位置,并破坏 KANSL1(以前称为 KANSL1)。在剩余的病例中,我们发现 H1 和 H2 单倍型之间发生了非均等交叉,并且该事件由一个同源序列介导,该同源序列再次从人类参考序列中缺失。有趣的是,断点优先映射到当前参考基因组组装中的间隙,我们在本研究中解决了这些间隙。我们的方法为鉴定基因组中含有高同源性和拷贝数多态性片段重复的复杂区域内的断点提供了一种策略。随着高质量的替代参考序列的出现以及个体 DNA 的基因组测序变得更加常规,该方法应该会变得特别有用。