Centre for Reproduction and Genomics, Department of Anatomy, and Allan Wilson Centre for Molecular Ecology and Evolution, University of Otago, Dunedin, New Zealand.
PLoS One. 2013;8(2):e54710. doi: 10.1371/journal.pone.0054710. Epub 2013 Feb 6.
Tandem repeats are genomic elements that are prone to changes in repeat number and are thus often polymorphic. These sequences are found at a high density at the start of human genes, in the gene's promoter. Increasing empirical evidence suggests that length variation in these tandem repeats can affect gene regulation. One class of tandem repeats, known as microsatellites, rapidly alter in repeat number. Some of the genetic variation induced by microsatellites is known to result in phenotypic variation. Recently, our group developed a novel method for measuring the evolutionary conservation of microsatellites, and with it we discovered that human microsatellites near transcription start sites are often highly conserved. In this study, we examined the properties of microsatellites found in promoters. We found a high density of microsatellites at the start of genes. We showed that microsatellites are statistically associated with promoters using a wavelet analysis, which allowed us to test for associations on multiple scales and to control for other promoter related elements. Because promoter microsatellites tend to be G/C rich, we hypothesized that G/C rich regulatory elements may drive the association between microsatellites and promoters. Our results indicate that CpG islands, G-quadruplexes (G4) and untranslated regulatory regions have highly significant associations with microsatellites, but controlling for these elements in the analysis does not remove the association between microsatellites and promoters. Due to their intrinsic lability and their overlap with predicted functional elements, these results suggest that many promoter microsatellites have the potential to affect human phenotypes by generating mutations in regulatory elements, which may ultimately result in disease. We discuss the potential functions of human promoter microsatellites in this context.
串联重复是容易发生重复数变化的基因组元件,因此通常是多态的。这些序列在人类基因的起始处,即在基因的启动子中高度密集地存在。越来越多的经验证据表明,这些串联重复的长度变化可能会影响基因调控。一类串联重复序列,称为微卫星,其重复数迅速变化。微卫星引起的一些遗传变异已知会导致表型变异。最近,我们小组开发了一种测量微卫星进化保守性的新方法,并用它发现了人类转录起始位点附近的微卫星通常高度保守。在这项研究中,我们研究了启动子中发现的微卫星的特性。我们在基因的起始处发现了高密度的微卫星。我们通过小波分析表明,微卫星与启动子在统计学上存在关联,这使我们能够在多个尺度上测试关联,并控制其他与启动子相关的元件。由于启动子微卫星往往富含 G/C,我们假设富含 G/C 的调控元件可能驱动微卫星与启动子之间的关联。我们的结果表明,CpG 岛、G-四联体 (G4) 和非翻译调控区与微卫星有高度显著的关联,但在分析中控制这些元件并不能消除微卫星与启动子之间的关联。由于其内在的不稳定性及其与预测功能元件的重叠,这些结果表明,许多启动子微卫星有可能通过在调控元件中产生突变来影响人类表型,这可能最终导致疾病。我们在这方面讨论了人类启动子微卫星的潜在功能。