Guo Xiangxue, Mrázek Jan
Department of Microbiology, University of Georgia, Athens, GA 30602-2605, USA.
J Mol Evol. 2008 Nov;67(5):497-509. doi: 10.1007/s00239-008-9166-5. Epub 2008 Oct 17.
Simple sequence repeats (SSRs) in DNA sequences are tandem iterations of a single nucleotide or a short oligonucleotide. SSRs are subject to slipped-strand mutations and a common source of phase variation in bacteria and antigenic variation in pathogens. Significantly long SSRs are generally rare in prokaryotic genomes, and long SSRs composed of iterations of mono-, di-, tri-, and tetranucleotides are mostly restricted to host-adapted pathogens. We present new results concerning associations between long SSRs and genes related to different cellular functions in genomes of host-adapted pathogens. We found that in the majority of the analyzed genomes, at least some of the genes associated with SSRs encode potential antigens, which is expected if the primary function of SSRs is their contribution to antigenic variation. However, we also found a number of long SSRs associated with housekeeping genes, including rRNA and tRNA genes, genes encoding ribosomal proteins, amino acyl-tRNA synthetases, chaperones, and important metabolic enzymes. Many of these genes are probably essential and it is unlikely that they are phase-variable. Few statistically significant associations between SSRs and gene functional classifications were detected, suggesting that most long SSRs are not related to a particular cellular function or process. Long SSRs in Mycobacterium leprae are mostly associated with pseudogenes and may be contributing to gene loss following the adaptation to an obligate pathogenic lifestyle. We speculate that LSSRs may have played a similar role in genome reduction of other host-adapted pathogens.
DNA序列中的简单序列重复(SSRs)是单个核苷酸或短寡核苷酸的串联重复。SSRs易发生滑链突变,是细菌中相位变异和病原体中抗原变异的常见来源。在原核生物基因组中,明显长的SSRs通常很少见,由单核苷酸、二核苷酸、三核苷酸和四核苷酸重复组成的长SSRs大多局限于宿主适应性病原体。我们展示了关于宿主适应性病原体基因组中长SSRs与不同细胞功能相关基因之间关联的新结果。我们发现,在大多数分析的基因组中,至少一些与SSRs相关的基因编码潜在抗原,如果SSRs的主要功能是其对抗原变异的贡献,那么这是可以预期的。然而,我们还发现了许多与管家基因相关的长SSRs,包括rRNA和tRNA基因、编码核糖体蛋白的基因、氨酰-tRNA合成酶、伴侣蛋白和重要的代谢酶。这些基因中的许多可能是必不可少的,而且它们不太可能是相位可变的。在SSRs与基因功能分类之间几乎没有检测到统计学上显著的关联,这表明大多数长SSRs与特定的细胞功能或过程无关。麻风分枝杆菌中的长SSRs大多与假基因相关,可能在适应专性病原体生活方式后导致基因丢失。我们推测,长SSRs可能在其他宿主适应性病原体的基因组缩减中发挥了类似的作用。