Ball Edward V, Stenson Peter D, Abeysinghe Shaun S, Krawczak Michael, Cooper David N, Chuzhanova Nadia A
Institute of Medical Genetics, Cardiff University, Cardiff, United Kingdom.
Hum Mutat. 2005 Sep;26(3):205-13. doi: 10.1002/humu.20212.
In the Human Gene Mutation Database (www.hgmd.org), microdeletions and microinsertions causing inherited disease (both defined as involving < or = 20 bp of DNA) account for 8,399 (17%) and 3,345 (7%) logged mutations, in 940 and 668 genes, respectively. A positive correlation was noted between the microdeletion and microinsertion frequencies for 564 genes for which both microdeletions and microinsertions are reported in HGMD, consistent with the view that the propensity of a given gene/sequence to undergo microdeletion is related to its propensity to undergo microinsertion. While microdeletions and microinsertions of 1 bp constitute respectively 48% and 66% of the corresponding totals, the relative frequency of the remaining lesions correlates negatively with the length of the DNA sequence deleted or inserted. Many of the microdeletions and microinsertions of more than 1 bp are potentially explicable in terms of slippage mutagenesis, involving the addition or removal of one copy of a mono-, di-, or trinucleotide tandem repeat. The frequency of in-frame 3-bp and 6-bp microinsertions and microdeletions was, however, found to be significantly lower than that of mutations of other lengths, suggesting that some of these in-frame lesions may not have come to clinical attention. Various sequence motifs were found to be over-represented in the vicinity of both microinsertions and microdeletions, including the heptanucleotide CCCCCTG that shares homology with the complement of the 8-bp human minisatellite conserved sequence/chi-like element (GCWGGWGG). The previously reported indel hotspot GTAAGT and its complement ACTTAC were also found to be overrepresented in the vicinity of both microinsertions and microdeletions, thereby providing a first example of a mutational hotspot that is common to different types of gene lesion. Other motifs overrepresented in the vicinity of microdeletions and microinsertions included DNA polymerase pause sites and topoisomerase cleavage sites. Several novel microdeletion/microinsertion hotspots were noted and some of these exhibited sufficient similarity to one another to justify terming them "super-hotspot" motifs. Analysis of sequence complexity also demonstrated that a combination of slipped mispairing mediated by direct repeats, and secondary structure formation promoted by symmetric elements, can account for the majority of microdeletions and microinsertions. Thus, microinsertions and microdeletions exhibit strong similarities in terms of the characteristics of their flanking DNA sequences, implying that they are generated by very similar underlying mechanisms.
在人类基因突变数据库(www.hgmd.org)中,导致遗传性疾病的微缺失和微插入(均定义为涉及≤20 bp的DNA)分别占940个和668个基因中已记录突变的8399个(17%)和3345个(7%)。对于在HGMD中同时报告了微缺失和微插入的564个基因,微缺失和微插入频率之间存在正相关,这与以下观点一致,即给定基因/序列发生微缺失的倾向与其发生微插入的倾向相关。虽然1 bp的微缺失和微插入分别占相应总数的48%和66%,但其余病变的相对频率与缺失或插入的DNA序列长度呈负相关。许多大于1 bp的微缺失和微插入可能可以用滑动错配诱变来解释,涉及单核苷酸、二核苷酸或三核苷酸串联重复的一个拷贝的添加或去除。然而,发现框内3 bp和6 bp的微插入和微缺失频率显著低于其他长度的突变频率,这表明其中一些框内病变可能尚未引起临床关注。在微插入和微缺失附近发现各种序列基序过度存在,包括与8 bp人类小卫星保守序列/chi样元件(GCWGGWGG)的互补序列具有同源性的七核苷酸CCCCCTG。先前报道的插入缺失热点GTAAGT及其互补序列ACTTAC在微插入和微缺失附近也过度存在,从而提供了一个不同类型基因病变共有的突变热点的首个例子。在微缺失和微插入附近过度存在的其他基序包括DNA聚合酶暂停位点和拓扑异构酶切割位点。注意到几个新的微缺失/微插入热点,其中一些彼此表现出足够的相似性,有理由将它们称为“超级热点”基序。对序列复杂性的分析还表明,由直接重复介导的滑动错配和对称元件促进的二级结构形成的组合,可以解释大多数微缺失和微插入。因此,微插入和微缺失在其侧翼DNA序列特征方面表现出很强的相似性,这意味着它们是由非常相似的潜在机制产生的。