Suppr超能文献

宿主适应性病原体中的长简单序列重复序列位于编码抗原的基因、管家基因和假基因附近。

Long simple sequence repeats in host-adapted pathogens localize near genes encoding antigens, housekeeping genes, and pseudogenes.

作者信息

Guo Xiangxue, Mrázek Jan

机构信息

Department of Microbiology, University of Georgia, Athens, GA 30602-2605, USA.

出版信息

J Mol Evol. 2008 Nov;67(5):497-509. doi: 10.1007/s00239-008-9166-5. Epub 2008 Oct 17.

Abstract

Simple sequence repeats (SSRs) in DNA sequences are tandem iterations of a single nucleotide or a short oligonucleotide. SSRs are subject to slipped-strand mutations and a common source of phase variation in bacteria and antigenic variation in pathogens. Significantly long SSRs are generally rare in prokaryotic genomes, and long SSRs composed of iterations of mono-, di-, tri-, and tetranucleotides are mostly restricted to host-adapted pathogens. We present new results concerning associations between long SSRs and genes related to different cellular functions in genomes of host-adapted pathogens. We found that in the majority of the analyzed genomes, at least some of the genes associated with SSRs encode potential antigens, which is expected if the primary function of SSRs is their contribution to antigenic variation. However, we also found a number of long SSRs associated with housekeeping genes, including rRNA and tRNA genes, genes encoding ribosomal proteins, amino acyl-tRNA synthetases, chaperones, and important metabolic enzymes. Many of these genes are probably essential and it is unlikely that they are phase-variable. Few statistically significant associations between SSRs and gene functional classifications were detected, suggesting that most long SSRs are not related to a particular cellular function or process. Long SSRs in Mycobacterium leprae are mostly associated with pseudogenes and may be contributing to gene loss following the adaptation to an obligate pathogenic lifestyle. We speculate that LSSRs may have played a similar role in genome reduction of other host-adapted pathogens.

摘要

DNA序列中的简单序列重复(SSRs)是单个核苷酸或短寡核苷酸的串联重复。SSRs易发生滑链突变,是细菌中相位变异和病原体中抗原变异的常见来源。在原核生物基因组中,明显长的SSRs通常很少见,由单核苷酸、二核苷酸、三核苷酸和四核苷酸重复组成的长SSRs大多局限于宿主适应性病原体。我们展示了关于宿主适应性病原体基因组中长SSRs与不同细胞功能相关基因之间关联的新结果。我们发现,在大多数分析的基因组中,至少一些与SSRs相关的基因编码潜在抗原,如果SSRs的主要功能是其对抗原变异的贡献,那么这是可以预期的。然而,我们还发现了许多与管家基因相关的长SSRs,包括rRNA和tRNA基因、编码核糖体蛋白的基因、氨酰-tRNA合成酶、伴侣蛋白和重要的代谢酶。这些基因中的许多可能是必不可少的,而且它们不太可能是相位可变的。在SSRs与基因功能分类之间几乎没有检测到统计学上显著的关联,这表明大多数长SSRs与特定的细胞功能或过程无关。麻风分枝杆菌中的长SSRs大多与假基因相关,可能在适应专性病原体生活方式后导致基因丢失。我们推测,长SSRs可能在其他宿主适应性病原体的基因组缩减中发挥了类似的作用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验