Bat O, Kimmel M, Axelrod D E
Molecular Biology Computational Resources, Baylor College of Medicine, Houston, TX 77030, USA.
J Theor Biol. 1997 Sep 7;188(1):53-67. doi: 10.1006/jtbi.1997.0451.
The expansion of DNA triplet repeats has been shown to be responsible for about a dozen hereditary diseases. In this paper we are concerned with a computer model of such expansion, applied to the fragile X syndrome and Huntington's disease, for which enough quantitative data have been collected. The nucleotide sequence associated with the fragile X consists of CGG repeats and is located inside the FMR1 gene. In normal individuals there is a variable number of triplet repeats less than 60; in asymptomatic carriers the number of repeats is 60-200 (premutation). From the premutation range, the number of triplet repeats can increase within one generation to more than 200 producing affected individuals. In Huntington's disease the CAG repeats are located inside the HD gene. In normal individuals the number of repeats varies from around 11, up to 34. In the intermediate range (34-37 repeats), the mutability is increased, frequently leading to alleles of more than 37 repeats, and the disease phenotype. The rapid increase of the number of triplet repeats in affected individuals has been proposed to be due to the formation of folded DNA structures (hairpins) and their repair or misrepair. In order to determine if this proposed mechanism is adequate to account for the rapid increase of repeats and the large number of repeats in affected individuals we developed a mathematical model that includes the known mechanisms of hairpin formation, and strand synthesis and repair. Simulations based on the model using realistic probabilities of hairpin formation produced results that corresponded with the observed range of repeats and transition probabilities from normal to affected individuals. Similar modelling has been published for the Huntington's disease data. However, in this paper we demonstrate that a uniform approach works for fragile X and Huntington's disease, although the detailed assumptions of the model have to be different. These difference provide insight into the mechanisms of expansion in both cases. Among these insights is that an apparent threshold in the number of repeats for rapid expansion, and the preference for expansion over contraction, may be accounted for by relative probabilities of hairpin formation, replication, slippage and repair.
DNA三联体重复序列的扩增已被证明是约十二种遗传性疾病的病因。在本文中,我们关注这样一种扩增的计算机模型,该模型应用于脆性X综合征和亨廷顿舞蹈病,针对这两种疾病已经收集了足够的定量数据。与脆性X相关的核苷酸序列由CGG重复序列组成,位于FMR1基因内部。在正常个体中,三联体重复序列的数量可变,少于60个;在无症状携带者中,重复序列的数量为60 - 200个(前突变)。从前突变范围开始,三联体重复序列的数量可在一代内增加到200多个,从而产生患病个体。在亨廷顿舞蹈病中,CAG重复序列位于HD基因内部。在正常个体中,重复序列的数量约从11个到34个不等。在中间范围(34 - 37个重复序列),突变性增加,经常导致超过37个重复序列的等位基因以及疾病表型。已提出患病个体中三联体重复序列数量的快速增加是由于折叠DNA结构(发夹)的形成及其修复或错误修复。为了确定这种提出的机制是否足以解释重复序列的快速增加以及患病个体中大量的重复序列,我们开发了一个数学模型,该模型包括发夹形成、链合成和修复的已知机制。基于该模型使用现实的发夹形成概率进行的模拟产生了与观察到的重复序列范围以及从正常个体到患病个体的转变概率相对应的结果。针对亨廷顿舞蹈病数据也发表了类似的模型。然而,在本文中我们证明,尽管模型的详细假设必须不同,但一种统一的方法适用于脆性X综合征和亨廷顿舞蹈病。这些差异为这两种情况下的扩增机制提供了见解。其中的见解包括,快速扩增的重复序列数量中明显的阈值以及扩增优于收缩的偏好,可能由发夹形成、复制、滑动和修复的相对概率来解释。