Comeau André M, Krisch Henry M
Laboratoire de Microbiologie et Génétique Moléculaires, Centre National de la Recherche Scientifique-Université Paul Sabatier-Toulouse III, Toulouse, France.
Mol Biol Evol. 2008 Jul;25(7):1321-32. doi: 10.1093/molbev/msn080. Epub 2008 Apr 7.
The Escherichia coli bacteriophage T4 has served as a classic system in phage biology for more than 60 years. Only recently have phylogenetic analyses and genomic comparisons demonstrated the existence of a large, diverse, and widespread superfamily of T4-like phages in the environment. We report here on the T4-like major capsid protein (MCP) sequences that were obtained by targeted polymerase chain reaction (PCR) of marine environmental samples. This analysis was then expanded to include 1,000 s of new sequences of T4-like capsid genes from the metagenomic data obtained during the Sorcerer II Global Ocean Sampling (GOS) expedition. This data compilation reveals that the diversity of the major and minor capsid proteins from the GOS metagenome follows the same general patterns as the sequences from cultured phage genomes. Interestingly, the new MCP sequences obtained by PCR targeted to MCP sequences in environmental samples are more divergent (deeper branching) than the vast majority of the MCP sequences coming from the other sources. The marine T4-like phage population appears to be largely dominated by the T4-like cyanophages. Using approximately 1,400 T4-like MCP sequences from various sources, we mapped the degree of sequence conservation on a structural model of the T4-like MCP. The results indicate that within the T4 superfamily there are some clear phylogenetic groups with regard to the more conserved and more variable domains of the MCP. Such differences can be correlated with variations in capsid morphology, the arrangement of the MCP lattice, and the presence of different capsid accessory proteins between the subgroups of the T4 superfamily.
六十多年来,大肠杆菌噬菌体T4一直是噬菌体生物学中的经典系统。直到最近,系统发育分析和基因组比较才证明环境中存在一个庞大、多样且广泛分布的T4样噬菌体超家族。我们在此报告通过对海洋环境样本进行靶向聚合酶链反应(PCR)获得的T4样主要衣壳蛋白(MCP)序列。然后,该分析扩展到包括来自“魔法师二号”全球海洋采样(GOS)探险期间获得的宏基因组数据中的数千条T4样衣壳基因新序列。这一数据汇编表明,GOS宏基因组中主要和次要衣壳蛋白的多样性与来自培养噬菌体基因组的序列遵循相同的一般模式。有趣的是,通过针对环境样本中MCP序列进行PCR获得的新MCP序列比绝大多数来自其他来源的MCP序列差异更大(分支更深)。海洋中T4样噬菌体群体似乎在很大程度上由T4样蓝藻噬菌体主导。我们使用来自各种来源的约1400条T4样MCP序列,在T4样MCP的结构模型上绘制了序列保守程度。结果表明,在T4超家族中,就MCP的保守结构域和可变结构域而言,存在一些明显的系统发育组。这种差异可能与衣壳形态的变化、MCP晶格的排列以及T4超家族各亚组之间不同衣壳辅助蛋白的存在有关。