Krawczak M, Ball E V, Cooper D N
Institute of Medical Genetics, University of Wales College of Medicine,Cardiff CF4 4XN, United Kingdom.
Am J Hum Genet. 1998 Aug;63(2):474-88. doi: 10.1086/301965.
The spectrum of single-base-pair substitutions logged in The Human Gene Mutation Database (HGMD), comprising 7,271 different lesions in the coding regions of 547 different human genes, was analyzed for nearest-neighbor effects on relative mutation rates. Owing to its retrospective nature, HGMD allows mutation rates to be estimated only in relative terms. Therefore, a novel methodology was devised in order to obtain these estimates in iterative fashion, correcting, at the same time, for the confounding effects of differential codon usage and for the fact that different types of amino acid replacement come to clinical attention with different probabilities. Over and above the hypermutability of CpG dinucleotides, reflected in transition rates five times the base mutation rate, only a subtle and locally confined influence of the surrounding DNA sequence on relative single-base-pair substitution rates was observed, which extended no farther than 2 bp from the substitution site. A disparity between the two DNA strands was evidenced by the fact that, when substitution rates were estimated conditional on the 5' and 3' flanking nucleotides, a significant rate difference emerged for 10 of 96 possible pairs of complementary substitutional events. Mutational bias, favoring substitutions toward flanking bases, a phenomenon reminiscent of misalignment mutagenesis, was apparent and exhibited both directionality and reading-frame sensitivity. No specific preponderance of repeat-sequence motifs was observed in the vicinity of nucleotide substitutions, but a moderate correlation between the relative mutability and thermodynamic stability of DNA triplets emerged, suggesting either inefficient DNA replication in regions of high stability or the transient stabilization of misaligned intermediates.
对登录在人类基因突变数据库(HGMD)中的单碱基对替换谱进行了分析,该数据库包含547个不同人类基因编码区的7271种不同损伤,以研究相邻碱基对相对突变率的影响。由于HGMD的回顾性特点,其只能以相对方式估计突变率。因此,设计了一种新方法来以迭代方式获得这些估计值,同时校正密码子使用差异的混杂效应以及不同类型氨基酸替换以不同概率引起临床关注这一事实。除了CpG二核苷酸的高突变性(其转换率是碱基突变率的五倍)外,仅观察到周围DNA序列对相对单碱基对替换率有细微且局部受限的影响,这种影响从替换位点延伸不超过2个碱基对。两条DNA链之间的差异通过以下事实得到证明:当根据5'和3'侧翼核苷酸估计替换率时,96对可能的互补替换事件中有10对出现了显著的率差异。倾向于向侧翼碱基替换的突变偏好很明显,这一现象让人联想到错配诱变,并且表现出方向性和阅读框敏感性。在核苷酸替换附近未观察到重复序列基序有特定优势,但DNA三联体的相对突变性与热力学稳定性之间出现了适度的相关性,这表明要么在高稳定性区域DNA复制效率低下,要么错配中间体存在短暂稳定化。