Institute of Biomedical Informatics, National Yang-Ming University, 155, Sec. 2, Linong St., Taipei 11221, Taiwan.
J Virol. 2011 Jul;85(13):6567-78. doi: 10.1128/JVI.02378-10. Epub 2011 Apr 20.
We report the genome organization and analysis of the first completely sequenced T4-like phage, AR1, of Escherichia coli O157:H7. Unlike most of the other sequenced phages of O157:H7, which belong to the temperate Podoviridae and Siphoviridae families, AR1 is a T4-like phage known to efficiently infect this pathogenic bacterial strain. The 167,435-bp AR1 genome is currently the largest among all the sequenced E. coli O157:H7 phages. It carries a total of 281 potential open reading frames (ORFs) and 10 putative tRNA genes. Of these, 126 predicted proteins could be classified into six viral orthologous group categories, with at least 18 proteins of the structural protein category having been detected by tandem mass spectrometry. Comparative genomic analysis of AR1 and four other completely sequenced T4-like genomes (RB32, RB69, T4, and JS98) indicated that they share a well-organized and highly conserved core genome, particularly in the regions encoding DNA replication and virion structural proteins. The major diverse features between these phages include the modules of distal tail fibers and the types and numbers of internal proteins, tRNA genes, and mobile elements. Codon usage analysis suggested that the presence of AR1-encoded tRNAs may be relevant to the codon usage of structural proteins. Furthermore, protein sequence analysis of AR1 gp37, a potential receptor binding protein, indicated that eight residues in the C terminus are unique to O157:H7 T4-like phages AR1 and PP01. These residues are known to be located in the T4 receptor recognition domain, and they may contribute to specificity for adsorption to the O157:H7 strain.
我们报告了大肠杆菌 O157:H7 中第一个完全测序的 T4 样噬菌体 AR1 的基因组组织和分析。与大多数属于温和噬菌体 Podoviridae 和 Siphoviridae 家族的其他已测序的 O157:H7 噬菌体不同,AR1 是一种已知能够有效感染这种致病性细菌菌株的 T4 样噬菌体。167435bp 的 AR1 基因组是目前所有已测序的大肠杆菌 O157:H7 噬菌体中最大的。它总共携带 281 个潜在的开放阅读框(ORF)和 10 个推定的 tRNA 基因。其中,126 个预测蛋白可分为六个病毒直系同源群类别,通过串联质谱法检测到至少 18 个结构蛋白类别的蛋白。AR1 和其他四个完全测序的 T4 样基因组(RB32、RB69、T4 和 JS98)的比较基因组分析表明,它们共享一个组织良好且高度保守的核心基因组,特别是在编码 DNA 复制和病毒粒子结构蛋白的区域。这些噬菌体之间的主要差异特征包括远位尾部纤维模块以及内部蛋白、tRNA 基因和移动元件的类型和数量。密码子使用分析表明,AR1 编码的 tRNA 的存在可能与结构蛋白的密码子使用有关。此外,AR1 编码的 gp37 是一种潜在的受体结合蛋白,其蛋白序列分析表明,C 末端的 8 个残基是 O157:H7 T4 样噬菌体 AR1 和 PP01 所特有的。这些残基已知位于 T4 受体识别结构域,可能有助于特异性吸附到 O157:H7 菌株。