Stewart Charles R, Casjens Sherwood R, Cresawn Steven G, Houtz Jennifer M, Smith Alexis L, Ford Michael E, Peebles Craig L, Hatfull Graham F, Hendrix Roger W, Huang Wai Mun, Pedulla Marisa L
Department of Biochemistry and Cell Biology, Rice University, Houston, TX 77251, USA.
J Mol Biol. 2009 Apr 24;388(1):48-70. doi: 10.1016/j.jmb.2009.03.009. Epub 2009 Mar 10.
We report the genome sequence of Bacillus subtilis phage SPO1. The unique genome sequence is 132,562 bp long, and DNA packaged in the virion (the chromosome) has a 13,185-bp terminal redundancy, giving a total of 145,747 bp. We predict 204 protein-coding genes and 5 tRNA genes, and we correlate these findings with the extensive body of investigations of SPO1, including studies of the functions of the 61 previously defined genes and studies of the virion structure. Sixty-nine percent of the encoded proteins show no similarity to any previously known protein. We identify 107 probable transcription promoters; most are members of the promoter classes identified in earlier studies, but we also see a new class that has the same sequence as the host sigma K promoters. We find three genes encoding potential new transcription factors, one of which is a distant homologue of the host sigma factor K. We also identify 75 probable transcription terminator structures. Promoters and terminators are generally located between genes and together with earlier data give what appears to be a rather complete picture of how phage transcription is regulated. There are complete genome sequences available for five additional phages of Gram-positive hosts that are similar to SPO1 in genome size and in composition and organization of genes. Comparative analysis of SPO1 in the context of these other phages yields insights about SPO1 and the other phages that would not be apparent from the analysis of any one phage alone. These include assigning identities as well as probable functions for several specific genes and inferring evolutionary events in the phages' histories. The comparative analysis also allows us to put SPO1 into a phylogenetic context. We see a pattern similar to what has been noted in phage T4 and its relatives, in which there is minimal successful horizontal exchange of genes among a "core" set of genes that includes most of the virion structural genes and some genes of DNA metabolism, but there is extensive horizontal transfer of genes over the remainder of the genome. There is a correlation between genes in rapid evolutionary flux through these genomes and genes that are small.
我们报道了枯草芽孢杆菌噬菌体SPO1的基因组序列。其独特的基因组序列长度为132,562 bp,包装在病毒粒子中的DNA(染色体)具有13,185 bp的末端冗余,总计145,747 bp。我们预测了204个蛋白质编码基因和5个tRNA基因,并将这些发现与对SPO1的大量研究相关联,包括对61个先前定义基因的功能研究以及对病毒粒子结构的研究。69%的编码蛋白质与任何先前已知的蛋白质均无相似性。我们鉴定出107个可能的转录启动子;大多数属于早期研究中确定的启动子类别,但我们也发现了一类新的启动子,其序列与宿主σK启动子相同。我们发现三个编码潜在新转录因子的基因,其中一个是宿主σ因子K的远亲同源物。我们还鉴定出75个可能的转录终止子结构。启动子和终止子通常位于基因之间,结合早期数据,这似乎给出了噬菌体转录调控方式的相当完整的图景。另外还有五个革兰氏阳性宿主噬菌体的完整基因组序列,它们在基因组大小以及基因组成和组织方面与SPO1相似。在这些其他噬菌体的背景下对SPO1进行比较分析,能够深入了解SPO1和其他噬菌体,而这些是单独分析任何一个噬菌体时所不明显的。这些包括为几个特定基因确定身份以及可能的功能,并推断噬菌体历史中的进化事件。比较分析还使我们能够将SPO1置于系统发育背景中。我们看到一种与噬菌体T4及其亲属中所观察到的模式相似的情况,即在一组“核心”基因中,包括大多数病毒粒子结构基因和一些DNA代谢基因,基因的水平交换很少成功,但在基因组的其余部分存在广泛的基因水平转移。在这些基因组中快速进化的基因与小基因之间存在相关性。