Eichler E E, Hammond H A, Macpherson J N, Ward P A, Nelson D L
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
Hum Mol Genet. 1995 Dec;4(12):2199-208. doi: 10.1093/hmg/4.12.2199.
Both sequence length and sequence content are important parameters in determining stability of the fragile X syndrome CGG repeat. In order to estimate the incidence of uninterrupted CGG repeats in the general population and to gain insight into mechanisms responsible for the loss and acquisition of AGG interruptions, 406 randomly selected FMR1 CGG repeat alleles from four broad ethnic groups were assayed for AGG punctuation. Among the 79 different classes of alleles detected, long uninterrupted tracts of pure repeats were rare in the general population, with only 1/406 or 0.25% found at the instability threshold (34-37 pure CGG repeats). There was no significant difference (P>0.05) in the distribution of alleles with long (>20) pure repeat tracts among the different ethnic groups, suggesting that different ethnic groups should be equally susceptible to the development of the disease. Analysis of an additional 43 alleles with total repeat lengths between 35 and 50 repeats, revealed that highly interrupted CGG repeats alleles (>2 AGG interruptions) occur preferentially at modal repeat lengths in the population, providing confirmatory evidence that the presence of AGG interruptions confers stability. A consideration of length variation of the most 3' tract of pure repeats revealed a bimodal distribution pattern with maxima at approximately 10 and 20 repeats. Only unimodal distributions with maxima 9 or 10 were observed for the 5' tract and middle CGG tract within the FMR1 CGG repeat substructure. These results suggest that the loss of the most 3' AGG interruption or its conversion to CGG is a common event in the human population, occurring by a mechanism which preserves overall repeat length. This bias for loss of the distal-most AGG interruption likely plays an important part in predisposing human alleles to the development of the X syndrome.
序列长度和序列内容都是决定脆性X综合征CGG重复序列稳定性的重要参数。为了估计普通人群中不间断CGG重复序列的发生率,并深入了解导致AGG中断缺失和获得的机制,对从四个主要种族群体中随机选择的406个FMR1 CGG重复等位基因进行了AGG标点分析。在检测到的79种不同类型的等位基因中,普通人群中长的不间断纯重复序列片段很少见,在不稳定阈值(34 - 37个纯CGG重复序列)时仅发现1/406或0.25%。不同种族群体中具有长(>20)纯重复序列片段的等位基因分布没有显著差异(P>0.05),这表明不同种族群体对该疾病的易感性应该是相同的。对另外43个总重复长度在35至50个重复序列之间的等位基因进行分析,发现高度中断的CGG重复等位基因(>2个AGG中断)在人群的模式重复长度处优先出现,这提供了确凿证据,证明AGG中断的存在赋予了稳定性。对最3'端纯重复序列片段长度变化的研究揭示了一种双峰分布模式,峰值分别在大约10和20个重复序列处。在FMR1 CGG重复子结构内,5'端片段和中间CGG片段仅观察到最大值为9或10的单峰分布。这些结果表明,最3'端AGG中断的缺失或其转化为CGG在人群中是常见事件,其发生机制保持了整体重复长度。最远端AGG中断缺失的这种偏向性可能在使人类等位基因易患X综合征方面起着重要作用。