Mrázek Jan, Guo Xiangxue, Shah Apurva
Department of Microbiology, University of Georgia, Athens, GA 30602, USA.
Proc Natl Acad Sci U S A. 2007 May 15;104(20):8472-7. doi: 10.1073/pnas.0702412104. Epub 2007 May 7.
Simple sequence repeats (SSRs) in DNA sequences are composed of tandem iterations of short oligonucleotides and may have functional and/or structural properties that distinguish them from general DNA sequences. They are variable in length because of slip-strand mutations and may also affect local structure of the DNA molecule or the encoded proteins. Long SSRs (LSSRs) are common in eukaryotes but rare in most prokaryotes. In pathogens, SSRs can enhance antigenic variance of the pathogen population in a strategy that counteracts the host immune response. We analyze representations of SSRs in >300 prokaryotic genomes and report significant differences among different prokaryotes as well as among different types of SSRs. LSSRs composed of short oligonucleotides (1-4 bp length, designated LSSR(1-4)) are often found in host-adapted pathogens with reduced genomes that are not known to readily survive in a natural environment outside the host. In contrast, LSSRs composed of longer oligonucleotides (5-11 bp length, designated LSSR(5-11)) are found mostly in nonpathogens and opportunistic pathogens with large genomes. Comparisons among SSRs of different lengths suggest that LSSR(1-4) are likely maintained by selection. This is consistent with the established role of some LSSR(1-4) in enhancing antigenic variance. By contrast, abundance of LSSR(5-11) in some genomes may reflect the SSRs' general tendency to expand rather than their specific role in the organisms' physiology. Differences among genomes in terms of SSR representations and their possible interpretations are discussed.
DNA序列中的简单序列重复(SSR)由短寡核苷酸的串联重复组成,可能具有使其有别于一般DNA序列的功能和/或结构特性。由于滑链突变,它们的长度可变,还可能影响DNA分子或编码蛋白质的局部结构。长SSR(LSSR)在真核生物中很常见,但在大多数原核生物中很少见。在病原体中,SSR可通过一种对抗宿主免疫反应的策略增强病原体群体的抗原变异。我们分析了300多个原核生物基因组中SSR的表现形式,并报告了不同原核生物之间以及不同类型SSR之间的显著差异。由短寡核苷酸(长度为1 - 4个碱基对,称为LSSR(1 - 4))组成的LSSR通常存在于基因组缩小的宿主适应性病原体中,这些病原体在宿主之外的自然环境中难以存活。相比之下,由较长寡核苷酸(长度为5 - 11个碱基对,称为LSSR(5 - 11))组成的LSSR大多存在于具有大基因组的非病原体和机会性病原体中。不同长度SSR之间的比较表明,LSSR(1 - 4)可能是通过选择得以维持。这与一些LSSR(1 - 4)在增强抗原变异方面已确立的作用相一致。相比之下,某些基因组中LSSR(5 - 11)的丰度可能反映了SSR普遍的扩增趋势,而非它们在生物体生理过程中的特定作用。本文讨论了基因组在SSR表现形式方面的差异及其可能的解释。