Department of Medical Genetics, Zhongshan School of Medicine and Center for Genome Research, Sun Yat-Sen University, Guangzhou, 510080, China.
Hum Genet. 2015 Jun;134(6):589-603. doi: 10.1007/s00439-015-1539-4. Epub 2015 Mar 20.
Copy number variations (CNVs) have increasingly been reported to cause, or predispose to, human disease. However, a large fraction of these CNVs have not been accurately characterized at the single-base-pair level, thereby hampering a better understanding of the mutational mechanisms underlying CNV formation. Here, employing a composite pipeline method derived from various inference-based programs, we have characterized 26 deletion CNVs [including three novel pathogenic CNVs involving an autosomal gene (EXT2) causing hereditary osteochondromas and an X-linked gene (CLCN5) causing Dent disease, as well as 23 CNVs previously identified by inference from a cohort of Canadian autism spectrum disorder families] to the single-base-pair level of accuracy from whole-genome sequencing data. We found that breakpoint-flanking micro-mutations (within 22 bp of the breakpoint) are present in a significant fraction (5/26; 19%) of the deletion CNVs. This analysis also provided evidence that a recently described error-prone form of DNA repair (i.e., repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome) not only causes human genetic disease but also impacts on human genome evolution. Our findings illustrate the importance of precise CNV breakpoint delineation for understanding the underlying mutational mechanisms and have implications for primer design in relation to the detection of deletion CNVs in clinical diagnosis.
拷贝数变异 (CNVs) 越来越多地被报道导致或易患人类疾病。然而,这些 CNVs 的很大一部分尚未在单碱基对水平上进行准确描述,从而阻碍了对 CNV 形成的突变机制的更好理解。在这里,我们采用源自各种基于推理的程序的组合流水线方法,将 26 个缺失 CNV [包括三个新的致病性 CNV,涉及导致遗传性骨软骨瘤的常染色体基因 (EXT2) 和导致 Dent 病的 X 连锁基因 (CLCN5),以及先前从加拿大自闭症谱系障碍家系队列中推断出的 23 个 CNV] 准确地表征到单碱基对水平,使用全基因组测序数据。我们发现,在很大一部分(5/26;19%)缺失 CNV 中存在断裂点侧翼微突变(断裂点附近 22 个碱基内)。该分析还提供了证据,表明最近描述的一种易错 DNA 修复形式(即通过来自基因组远端区域的模板核苷酸序列插入修复 DNA 双链断裂)不仅导致人类遗传疾病,而且影响人类基因组进化。我们的发现说明了精确的 CNV 断点描绘对于理解潜在的突变机制的重要性,并对临床诊断中缺失 CNV 的检测的引物设计具有启示意义。