Sullivan Matthew B, Coleman Maureen L, Quinlivan Vanessa, Rosenkrantz Jessica E, Defrancesco Alicia S, Tan G, Fu Ross, Lee Jessica A, Waterbury John B, Bielawski Joseph P, Chisholm Sallie W
Departments of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
Environ Microbiol. 2008 Oct;10(10):2810-23. doi: 10.1111/j.1462-2920.2008.01702.x. Epub 2008 Jul 31.
Oceanic phages are critical components of the global ecosystem, where they play a role in microbial mortality and evolution. Our understanding of phage diversity is greatly limited by the lack of useful genetic diversity measures. Previous studies, focusing on myophages that infect the marine cyanobacterium Synechococcus, have used the coliphage T4 portal-protein-encoding homologue, gene 20 (g20), as a diversity marker. These studies revealed 10 sequence clusters, 9 oceanic and 1 freshwater, where only 3 contained cultured representatives. We sequenced g20 from 38 marine myophages isolated using a diversity of Synechococcus and Prochlorococcus hosts to see if any would fall into the clusters that lacked cultured representatives. On the contrary, all fell into the three clusters that already contained sequences from cultured phages. Further, there was no obvious relationship between host of isolation, or host range, and g20 sequence similarity. We next expanded our analyses to all available g20 sequences (769 sequences), which include PCR amplicons from wild uncultured phages, non-PCR amplified sequences identified in the Global Ocean Survey (GOS) metagenomic database, as well as sequences from cultured phages, to evaluate the relationship between g20 sequence clusters and habitat features from which the phage sequences were isolated. Even in this meta-data set, very few sequences fell into the sequence clusters without cultured representatives, suggesting that the latter are very rare, or sequencing artefacts. In contrast, sequences most similar to the culture-containing clusters, the freshwater cluster and two novel clusters, were more highly represented, with one particular culture-containing cluster representing the dominant g20 genotype in the unamplified GOS sequence data. Finally, while some g20 sequences were non-randomly distributed with respect to habitat, there were always numerous exceptions to general patterns, indicating that phage portal proteins are not good predictors of a phage's host or the habitat in which a particular phage may thrive.
海洋噬菌体是全球生态系统的关键组成部分,它们在微生物的死亡和进化过程中发挥作用。由于缺乏有效的遗传多样性测量方法,我们对噬菌体多样性的了解极为有限。以往的研究聚焦于感染海洋蓝细菌聚球藻的肌尾噬菌体,这些研究使用了编码大肠杆菌噬菌体T4门户蛋白的同源基因20(g20)作为多样性标记。这些研究揭示了10个序列簇,其中9个来自海洋,1个来自淡水,只有3个包含可培养的代表菌株。我们对从多种聚球藻和原绿球藻宿主中分离出的38个海洋肌尾噬菌体的g20进行了测序,以查看是否有任何噬菌体属于缺乏可培养代表菌株的序列簇。相反,所有噬菌体都落入了已经包含来自可培养噬菌体序列的三个簇中。此外,分离宿主或宿主范围与g20序列相似性之间没有明显关系。接下来,我们将分析扩展到所有可用的g20序列(769个序列),这些序列包括来自未培养野生噬菌体的PCR扩增子、全球海洋调查(GOS)宏基因组数据库中鉴定的非PCR扩增序列以及来自可培养噬菌体的序列,以评估g20序列簇与分离噬菌体序列的栖息地特征之间的关系。即使在这个元数据集中,也只有极少数序列落入没有可培养代表菌株的序列簇中,这表明后者非常罕见,或者是测序假象。相比之下,与包含培养菌株的簇、淡水簇和两个新簇最相似的序列有更高的代表性,其中一个特定的包含培养菌株的簇代表了未扩增的GOS序列数据中占主导地位的g20基因型。最后,虽然一些g20序列在栖息地方面是非随机分布的,但总体模式总是有许多例外情况,这表明噬菌体门户蛋白不是噬菌体宿主或特定噬菌体可能繁盛的栖息地的良好预测指标。