van Wezenbeek P, Schoenmakers J G
Nucleic Acids Res. 1979 Jun 25;6(8):2799-818. doi: 10.1093/nar/6.8.2799.
A DNA region of 2750 base pairs encompassing the genes III, VI and I of bacteriophage M13 has been sequenced by the Maxam-Gilbert procedure. By establishing the nucleotide changes introduced by several amber mutations, the coding region and the regulatory signals of each gene have been deduced. The genes appear to span 1275 base pairs (gene III; mol.wt. 44,748) 339 base pairs (gene VI; mol.wt. 12,264) and 1047 base pairs (gene I; mol.wt. 39,500). Their separating non-codogenic regions are extremely short, namely two and one base pair, respectively. The C-terminal end of gene I, however, intrudes 23 nucleotides into gene IV. From the nucleotide sequence it appears that the minor capsid protein of the phage, which is encoded by gene III, is synthesized in a precursor form containing 18 extra amino acids at its N-terminal end. Furthermore, in this capsid protein two clusters of a fourfold repeat of the sequence Glu-Gly-Gly-Gly-Ser are apparent. Gene VI appears to code for a small, extremely hydrophobic polypeptide. Its total hydrophobic amino acids content of 51% suggests that this protein can only function in the host cell membrane.
利用马克萨姆-吉尔伯特法对噬菌体M13包含基因III、VI和I的2750个碱基对的DNA区域进行了测序。通过确定由几个琥珀突变引入的核苷酸变化,推导了每个基因的编码区和调控信号。这些基因似乎分别跨越1275个碱基对(基因III;分子量44748)、339个碱基对(基因VI;分子量12264)和1047个碱基对(基因I;分子量39500)。它们之间的非编码区非常短,分别为2个和1个碱基对。然而,基因I的C末端有23个核苷酸插入到基因IV中。从核苷酸序列来看,由基因III编码的噬菌体小衣壳蛋白以前体形式合成,其N末端含有18个额外的氨基酸。此外,在这种衣壳蛋白中,明显存在由谷氨酸-甘氨酸-甘氨酸-甘氨酸-丝氨酸序列四倍重复组成的两个簇。基因VI似乎编码一种小的、极度疏水的多肽。其总疏水氨基酸含量为51%,表明该蛋白只能在宿主细胞膜中发挥作用。