Choi Sunju, Ohta Shinya, Ohtsubo Eiichi
Institute of Molecular and Cellular Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo 113-0032, Japan.
J Bacteriol. 2003 Aug;185(16):4891-900. doi: 10.1128/JB.185.16.4891-4900.2003.
An Escherichia coli strain, ECOR28, was found to have insertions of an identical sequence (1,279 bp in length) at 10 loci in its genome. This insertion sequence (named IS621) has one large open reading frame encoding a putative protein that is 326 amino acids in length. A computer-aided homology search using the DNA sequence as the query revealed that IS621 was homologous to the piv genes, encoding pilin gene invertase (PIV). A homology search using the amino acid sequence of the putative protein encoded by IS621 as the query revealed that the protein also has partial homology to transposases encoded by the IS110/IS492 family elements, which were known to have partial homology to PIV. This indicates that IS621 belongs to the IS110/IS492 family but is most closely related to the piv genes. In fact, a phylogenetic tree constructed on the basis of amino acid sequences of PIV proteins and transposases revealed that IS621 belongs to the piv gene group, which is distinct from the IS110/IS492 family elements, which form several groups. PIV proteins and transposases encoded by the IS110/IS492 family elements, including IS621, have four acidic amino acid residues, which are conserved at positions in their N-terminal regions. These residues may constitute a tetrad D-E(or D)-D-D motif as the catalytic center. Interestingly, IS621 was inserted at specific sites within repetitive extragenic palindromic (REP) sequences at 10 loci in the ECOR28 genome. IS621 may not recognize the entire REP sequence in transposition, but it recognizes a 15-bp sequence conserved in the REP sequences around the target site. There are several elements belonging to the IS110/IS492 family that also transpose to specific sites in the repeated sequences, as does IS621. IS621 does not have terminal inverted repeats like most of the IS110/IS492 family elements. The terminal sequences of IS621 have homology with the 26-bp inverted repeat sequences of pilin gene inversion sites that are recognized and used for inversion of pilin genes by PIV. This suggests that IS621 initiates transposition through recognition of their terminal regions and cleavage at the ends by a mechanism similar to that used for PIV to promote inversion at the pilin gene inversion sites.
发现一株大肠杆菌菌株ECOR28在其基因组的10个位点插入了相同序列(长度为1279 bp)。这个插入序列(命名为IS621)有一个大的开放阅读框,编码一个长度为326个氨基酸的假定蛋白。以DNA序列作为查询进行计算机辅助同源性搜索发现,IS621与编码菌毛蛋白基因转化酶(PIV)的piv基因同源。以IS621编码的假定蛋白的氨基酸序列作为查询进行同源性搜索发现,该蛋白也与IS110/IS492家族元件编码的转座酶有部分同源性,已知这些元件与PIV有部分同源性。这表明IS621属于IS110/IS492家族,但与piv基因关系最为密切。事实上,基于PIV蛋白和转座酶的氨基酸序列构建的系统发育树表明,IS621属于piv基因组,这与形成几个组的IS110/IS492家族元件不同。包括IS621在内的IS110/IS492家族元件编码的PIV蛋白和转座酶有四个酸性氨基酸残基,它们在其N端区域的位置是保守的。这些残基可能构成一个作为催化中心的四联体D-E(或D)-D-D基序。有趣的是,IS621插入到ECOR28基因组中10个位点的重复基因外回文(REP)序列内的特定位置。IS621在转座过程中可能不识别整个REP序列,但它识别目标位点周围REP序列中保守的一个15 bp序列。有几个属于IS110/IS492家族的元件也像IS621一样转座到重复序列中的特定位置。IS621不像大多数IS110/IS492家族元件那样有末端反向重复序列。IS621的末端序列与菌毛蛋白基因转化位点的26 bp反向重复序列有同源性,PIV识别并利用这些序列进行菌毛蛋白基因的转化。这表明IS621通过识别其末端区域并在末端进行切割来启动转座,其机制类似于PIV在菌毛蛋白基因转化位点促进转化所使用的机制。