Hidaka Y, Kanda T, Iwasaki K, Nomoto A, Shioda T, Shibuta H
Nucleic Acids Res. 1984 Nov 12;12(21):7965-73. doi: 10.1093/nar/12.21.7965.
We determined the sequence of the 2,138 nucleotides in the Sendai virus genome just following the 3' proximal 3,686 nucleotides which we had previously reported (Nucleic Acids Res. 11, 7317-7330, 1983). This covers the entire third gene of 1,173 nucleotides and the 3' proximal 1,013 nucleotides of the fourth gene. Like the NP and P+C genes, both the third and fourth genes start from consensus sequence R1 (3'-UCCCAC(or UA)UUUC) at the 3' end and the third gene terminates with consensus sequence R2 (3'-AUUCUUUUU) at the 5' end. The third gene was identified as M, and the deduced 348 amino acids indicated that the M protein is rich in basic residues and has hydrophobic domains near the C-terminal. The fourth gene, although sequencing is not complete yet, was identified as F, since a large open reading frame found in the gene contains the characteristic sequence of 20 amino acids located at the N-terminal of the F1 protein. Analyses of the amino acid sequence suggested that the structure of the F gene product is NH2-signal peptide-F2-F1-COOH.
我们测定了仙台病毒基因组中紧接着我们先前报道的3'近端3686个核苷酸之后的2138个核苷酸的序列(《核酸研究》,第11卷,7317 - 7330页,1983年)。这涵盖了整个1173个核苷酸的第三个基因以及第四个基因的3'近端1013个核苷酸。与NP和P + C基因一样,第三个和第四个基因均从3'端的共有序列R1(3'-UCCCAC(或UA)UUUC)起始,第三个基因在5'端以共有序列R2(3'-AUUCUUUUU)终止。第三个基因被鉴定为M基因,推导的348个氨基酸表明M蛋白富含碱性残基且在C端附近有疏水区。第四个基因虽然测序尚未完成,但被鉴定为F基因,因为在该基因中发现的一个大的开放阅读框包含位于F1蛋白N端的20个氨基酸的特征序列。氨基酸序列分析表明F基因产物的结构为NH2 - 信号肽 - F2 - F1 - COOH。