Dassa Bareket, London Nir, Stoddard Barry L, Schueler-Furman Ora, Pietrokovski Shmuel
Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel.
Nucleic Acids Res. 2009 May;37(8):2560-73. doi: 10.1093/nar/gkp095. Epub 2009 Mar 5.
Inteins are genetic elements, inserted in-frame into protein-coding genes, whose products catalyze their removal from the protein precursor via a protein-splicing reaction. Intein domains can be split into two fragments and still ligate their flanks by a trans-protein-splicing reaction. A bioinformatic analysis of environmental metagenomic data revealed 26 different loci with a novel genomic arrangement. In each locus, a conserved enzyme coding region is broken in two by a split intein, with a free-standing endonuclease gene inserted in between. Eight types of DNA synthesis and repair enzymes have this 'fractured' organization. The new types of naturally split-inteins were analyzed in comparison to known split-inteins. Some loci include apparent gene control elements brought in with the endonuclease gene. A newly predicted homing endonuclease family, related to very-short patch repair (Vsr) endonucleases, was found in half of the loci. These putative homing endonucleases also appear in group-I introns, and as stand-alone inserts in the absence of surrounding intervening sequences. The new fractured genes organization appears to be present mainly in phage, shows how endonucleases can integrate into inteins, and may represent a missing link in the evolution of gene breaking in general, and in the creation of split-inteins in particular.
内含肽是插入蛋白质编码基因读码框内的遗传元件,其产物通过蛋白质剪接反应催化自身从蛋白质前体中去除。内含肽结构域可被分成两个片段,并且仍能通过反式蛋白质剪接反应连接其侧翼。对环境宏基因组数据的生物信息学分析揭示了26个具有新型基因组排列的不同位点。在每个位点,一个保守的酶编码区域被一个分裂内含肽分成两部分,中间插入一个独立的内切核酸酶基因。八种类型的DNA合成和修复酶具有这种“断裂”的组织形式。将新型天然分裂内含肽与已知的分裂内含肽进行了比较分析。一些位点包括与内切核酸酶基因一起引入的明显的基因控制元件。在一半的位点中发现了一个新预测的归巢内切核酸酶家族,它与极短补丁修复(Vsr)内切核酸酶相关。这些假定的归巢内切核酸酶也出现在I组内含子中,并且在没有周围间隔序列的情况下作为独立插入片段出现。这种新的断裂基因组织形式似乎主要存在于噬菌体中,显示了内切核酸酶如何整合到内含肽中,并且可能代表了一般基因断裂进化中,特别是分裂内含肽产生过程中缺失的环节。