Krawczak M, Cooper D N
Institut für Humangenetik der Universität, Göttingen, Federal Republic of Germany.
Hum Genet. 1991 Mar;86(5):425-41. doi: 10.1007/BF00194629.
Reports describing short (less than 20 bp) gene deletions causing human genetic disease were collated in order to study underlying causative mechanisms. Deletion breakpoint junction regions were found to be non-random both at the nucleotide and dinucleotide sequence levels, an observation consistent with an endogenous sequence-directed mechanism of mutagenesis. Direct repeats of between 2bp and 8bp were found in the immediate vicinity of all but one of the 60 deletions analysed. Direct repeats are a feature of a number of recombination, replication or repair-based models of deletion mutagenesis and the possible contribution of each to the spectrum of mutations examined was assessed. The influence of parameters such as repeat length and length of DNA between repeats was studied in relation to the frequency, location and extent of these deletions. Findings were broadly consistent with a slipped mispairing model but the predicted deletion of one whole repeat copy was found only rarely. A modified version of the slipped mispairing hypothesis was therefore proposed and was shown to possess considerable explanatory value for approximately 25% of deletions examined. Whereas the frequency of inverted repeats in the vicinity of gene deletions was not significantly elevated, these elements may nevertheless promote instability by facilitating the formation of secondary structure intermediates. A significant excess of symmetrical sequence elements was however found at sites of single base deletions. A new model to explain the involvement of symmetric elements in frameshift mutagenesis was devised, which successfully accounted for a majority of the single base deletions examined. In general, the loss of one or a few base pairs of DNA was found to be more compatible with a replication-based model of mutagenesis than with a recombination or repair hypothesis. Seven hitherto unrecognized hotspots for deletion were noted in five genes (AT3, F8, HBA, HBB and HPRT). Considerable sequence homology was found between these different sites, and a consensus sequence (TGA/GA/GG/TA/C) was drawn up. Sequences fitting this consensus (i) were noted in the immediate vicinity of 41% of the other (sporadic) gene deletions, (ii) were found frequently at sites of spontaneous deletion in the hamster APRT gene, (iii) were found to be associated with many larger human gene deletions/translocations, (iv) act as arrest sites for human polymerase alpha during DNA replication and (v) have been shown by in vitro studies of human polymerase alpha to be especially prone to frameshift mutation.(ABSTRACT TRUNCATED AT 400 WORDS)
为了研究潜在的致病机制,我们整理了描述导致人类遗传病的短(小于20个碱基对)基因缺失的报告。发现在核苷酸和二核苷酸序列水平上,缺失断点连接区域是非随机的,这一观察结果与诱变的内源性序列导向机制一致。在所分析的60个缺失中,除了一个之外,在所有缺失的紧邻区域都发现了2至8个碱基对的直接重复序列。直接重复序列是许多基于重组、复制或修复的缺失诱变模型的一个特征,并评估了每种模型对所检测的突变谱的可能贡献。研究了诸如重复长度和重复之间的DNA长度等参数对这些缺失的频率、位置和范围的影响。研究结果大致与滑动错配模型一致,但仅很少发现预测的一个完整重复拷贝的缺失。因此,提出了滑动错配假说的一个修改版本,并且该版本对于大约25%所检测的缺失具有相当大的解释价值。虽然基因缺失附近的反向重复序列频率没有显著升高,但这些元件可能通过促进二级结构中间体的形成而促进不稳定性。然而,在单碱基缺失位点发现了大量对称序列元件。设计了一个新模型来解释对称元件在移码诱变中的作用,该模型成功地解释了所检测的大多数单碱基缺失。一般来说,发现一个或几个DNA碱基对的缺失与基于复制的诱变模型比与重组或修复假说更相符。在五个基因(AT3、F8、HBA、HBB和HPRT)中发现了七个迄今未被识别的缺失热点。在这些不同位点之间发现了相当程度的序列同源性,并拟定了一个共有序列(TGA/GA/GG/TA/C)。符合该共有序列的序列:(i)在41%的其他(散发性)基因缺失紧邻区域被发现;(ii)在仓鼠APRT基因的自发缺失位点频繁出现;(iii)与许多更大的人类基因缺失/易位相关;(iv)在DNA复制过程中作为人类聚合酶α的终止位点;(v)通过对人类聚合酶α的体外研究表明特别容易发生移码突变。(摘要截断于400字)