Kojima Kenji K, Kanehisa Minoru
Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan.
Mol Biol Evol. 2008 Jul;25(7):1395-404. doi: 10.1093/molbev/msn081. Epub 2008 Apr 7.
Retroelements, elements encoding reverse transcriptase (RT), are ubiquitous in eukaryotes and have a great influence on the evolution of our genome. Detailed information is available on eukaryotic retroelements; however, prokaryotic retroelements are poorly understood. Recently, new types of eukaryotic retroelements were characterized on the basis of their gene composition and their phylogenetic positions. Here we performed a systematic survey to identify novel types of prokaryotic retroelements by analyzing gene neighborhood and protein architecture. We found novel types of gene combination and examined whether they represent actual retroelements. Five monophyletic groups were identified that were distinct from characterized prokaryotic retroelements, showed specific gene combination, were distributed patchily, and included at least 1 example of recent integration. These results strongly indicated the frequent horizontal transfer of these elements. One group encoded DNA polymerase A. A possible function of DNA polymerase A in the life cycle of retroelements is catalyzing second-strand cDNA synthesis, which is DNA polymerization performed using a DNA template not an RNA template. Another group encoded both bacterial primase and carbon-nitrogen hydrolase. Primase is likely to synthesize primers to initiate reverse transcription. Two other groups also encoded carbon-nitrogen hydrolase as a fusion protein with RT. It is difficult to speculate on the function of hydrolase in the life cycle of retroelements. The last group encoded dual RT proteins, which are likely to form heterodimers during replication. The protein sets of these 5 groups of prokaryotic retroelements were completely different from those of eukaryotic retroelements, indicating that the survival constraints of prokaryotic elements were distinct from those of eukaryotic elements. It is likely that these prokaryotic retroelements are maintained as extrachromosomal DNA or RNA or are accidentally integrated into genomes. Our findings presented the possibility that many types of extrachromosomal prokaryotic retroelements remain to be characterized. In addition, we found 8 RT genes were associated with clustered regularly interspaced short palindrome repeats (CRISPRs) of the CRISPR-Cas system. These RT genes are likely to work in immunity against RNA phages via cDNA synthesis.
逆转录元件,即编码逆转录酶(RT)的元件,在真核生物中普遍存在,对我们基因组的进化有重大影响。关于真核生物逆转录元件有详细信息;然而,原核生物逆转录元件却了解甚少。最近,基于其基因组成和系统发育位置鉴定出了新型真核生物逆转录元件。在此,我们通过分析基因邻域和蛋白质结构进行了一项系统调查,以鉴定新型原核生物逆转录元件。我们发现了新型基因组合,并研究它们是否代表实际的逆转录元件。鉴定出了五个单系类群,它们与已鉴定的原核生物逆转录元件不同,具有特定的基因组合,分布零散,且至少包括一个近期整合的实例。这些结果有力地表明了这些元件频繁的水平转移。其中一个类群编码DNA聚合酶A。DNA聚合酶A在逆转录元件生命周期中的一个可能功能是催化第二链cDNA合成,这是使用DNA模板而非RNA模板进行的DNA聚合反应。另一个类群同时编码细菌引发酶和碳氮水解酶。引发酶可能合成引物以启动逆转录。另外两个类群也将碳氮水解酶编码为与RT融合的蛋白。很难推测水解酶在逆转录元件生命周期中的功能。最后一个类群编码双RT蛋白,它们在复制过程中可能形成异二聚体。这5个原核生物逆转录元件类群的蛋白质组与真核生物逆转录元件的完全不同,表明原核生物元件的生存限制与真核生物元件不同。这些原核生物逆转录元件很可能作为染色体外DNA或RNA得以维持,或者偶然整合到基因组中。我们的发现表明,许多类型的染色体外原核生物逆转录元件仍有待鉴定。此外,我们发现8个RT基因与CRISPR-Cas系统的成簇规律间隔短回文重复序列(CRISPRs)相关。这些RT基因可能通过cDNA合成在抵抗RNA噬菌体的免疫中发挥作用。