Rusinov Ivan, Ershova Anna, Karyagina Anna, Spirin Sergey, Alexeevski Andrei
Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119992, Russia.
Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119992, Russia.
BMC Genomics. 2015 Dec 21;16:1084. doi: 10.1186/s12864-015-2288-4.
Avoidance of palindromic recognition sites of Type II restriction-modification (R-M) systems was shown for many R-M systems in dozens of prokaryotic genomes. However the phenomenon has not been investigated systematically for all presently available genomes and annotated R-M systems. We have studied all known recognition sites in thousands of prokaryotic genomes and found factors that influence their avoidance.
Only Type II R-M systems consisting of independently acting endonuclease and methyltransferase (called 'orthodox' here) cause avoidance of their sites, both palindromic and asymmetric, in corresponding prokaryotic genomes; the avoidance takes place for ~ 50 % of 1774 studied cases. It is known that prokaryotes can acquire and lose R-M systems. Thus it is possible to talk about the lifespan of an R-M system in a genome. We have shown that the recognition site avoidance correlates with the lifespan of R-M systems. The sites of orthodox R-M systems that are encoded in host genomes for a long time are avoided more often (up to 100 % in certain cohorts) than the sites of recently acquired ones. We also found cases of site avoidance in absence of the corresponding R-M systems in the genome. An analysis of closely related bacteria shows that such avoidance can be a trace of lost R-M systems. Sites of Type I, IIС/G, IIM, III, and IV R-M systems are not avoided in vast majority of cases.
The avoidance of orthodox Type II R-M system recognition sites in prokaryotic genomes is a widespread phenomenon. Presence of an R-M system without an underrepresentation of its site may indicate that the R-M system was acquired recently. At the same time, a significant underrepresentation of a site may be a sign of presence of the corresponding R-M system in this organism or in its ancestors for a long time. The drastic difference between site avoidance for orthodox Type II R-M systems and R-M systems of other types can be explained by a higher rate of specificity changes or a less self-toxicity of the latter.
在数十个原核生物基因组中,许多II型限制-修饰(R-M)系统都表现出对回文识别位点的回避。然而,尚未针对所有现有基因组和已注释的R-M系统对这一现象进行系统研究。我们研究了数千个原核生物基因组中所有已知的识别位点,并发现了影响其回避的因素。
只有由独立作用的内切核酸酶和甲基转移酶组成的II型R-M系统(此处称为“正统”)会导致其在相应原核生物基因组中的位点(包括回文和不对称位点)被回避;在1774个研究案例中,约50%出现了这种回避情况。已知原核生物可以获得和失去R-M系统。因此,可以讨论基因组中R-M系统的寿命。我们已经表明,识别位点回避与R-M系统的寿命相关。在宿主基因组中编码时间较长的正统R-M系统的位点,比最近获得的系统的位点更常被回避(在某些群体中高达100%)。我们还发现了基因组中不存在相应R-M系统时的位点回避情况。对密切相关细菌的分析表明,这种回避可能是已丢失R-M系统的痕迹。在绝大多数情况下,I型、IIС/G、IIM、III型和IV型R-M系统的位点不会被回避。
原核生物基因组中对正统II型R-M系统识别位点的回避是一种普遍现象。存在一个其位点没有明显低丰度的R-M系统,可能表明该R-M系统是最近获得的。同时,一个位点的显著低丰度可能表明该生物体或其祖先在很长一段时间内存在相应的R-M系统。正统II型R-M系统与其他类型R-M系统在位点回避上的巨大差异,可以通过后者更高的特异性变化率或更低的自身毒性来解释。