Suppr超能文献

风疹病毒病毒粒子蛋白C和E2以及非结构蛋白羧基末端编码区的序列:与甲病毒的比较。

Sequence of the region coding for virion proteins C and E2 and the carboxy terminus of the nonstructural proteins of rubella virus: comparison with alphaviruses.

作者信息

Frey T K, Marr L D

机构信息

Department of Biology, Georgia State University, Atlanta 30303.

出版信息

Gene. 1988;62(1):85-99. doi: 10.1016/0378-1119(88)90582-3.

Abstract

The sequence of the 3' 4508 nucleotides (nt) of the genomic RNA of the Therien strain of rubella virus (RV) was determined for cDNA clones. The sequence contains a 3189-nt open reading frame (ORF) which codes for the structural proteins C, E2 and E1. C is predicted to have a length of 300 amino acids (aa). The N-terminal half of the C protein is highly basic and hydrophilic in nature, and is putatively the region of the protein which interacts with the virion RNA. At the C terminus of the C protein is a stretch of 20 hydrophobic aa which also serves as the signal sequence for E2, indicating that the cleavage of C from the polyprotein precursor may be catalyzed by signalase in the lumen of the endoplasmic reticulum. E2 is 282 aa in length and contains four potential N-linked glycosylation sites and a putative transmembrane domain near its C terminus. The sequence of E1 has been previously described [Frey et al., Virology 154 (1986) 228-232]. No homology could be detected between the amino acid sequence of the RV structural proteins and the amino acid sequence of the alphavirus structural proteins. From the position of a region of 30 nt in the RV genomic sequence which exhibited significant homology with the sequence in the alphavirus genome at which subgenomic RNA synthesis is initiated, the RV subgenomic RNA is predicted to be 3346 nt in length and the nontranslated region from the 5' end of the subgenomic RNA to the structural protein ORF is predicted to be 98 nt. In a different translation frame beginning at the 5' end of the RV nt sequence reported here is a 1407 nt ORF which is the C terminal region of the nonstructural protein ORF. This ORF overlaps the structural protein ORF by 149 nt. A low level of homology could be detected between the predicted amino acid sequence of the C-terminus of the RV nonstructural protein ORF and the replicase proteins of several positive RNA viruses of animals and plants, including nsp4 of the alphaviruses, the protein encoded by the C-terminal region of the alphavirus nonstructural ORF. However, the overall homology between RV and the alphaviruses in this region of the genome was only 18%, indicating that these two genera of the Togavirus family are only distantly related. Intriguingly, there is a 2844-nt ORF present in the negative polarity orientation of the RV sequence which could encode a 928-aa polyprotein.

摘要

测定了风疹病毒(RV)泰里恩株基因组RNA 3'端4508个核苷酸(nt)的序列,用于cDNA克隆。该序列包含一个3189 nt的开放阅读框(ORF),编码结构蛋白C、E2和E1。预测C蛋白长度为300个氨基酸(aa)。C蛋白的N端一半在性质上高度碱性且亲水,推测是该蛋白与病毒粒子RNA相互作用的区域。在C蛋白的C端是一段20个疏水氨基酸的序列,它也是E2的信号序列,这表明从多蛋白前体中切割C蛋白可能由内质网腔中的信号肽酶催化。E2长度为282个氨基酸,含有四个潜在的N - 糖基化位点,并且在其C端附近有一个推定的跨膜结构域。E1的序列先前已有描述[弗雷等人,《病毒学》154(1986)228 - 232]。在风疹病毒结构蛋白的氨基酸序列与甲病毒结构蛋白的氨基酸序列之间未检测到同源性。从风疹病毒基因组序列中一个30 nt区域的位置来看,该区域与甲病毒基因组中启动亚基因组RNA合成的序列具有显著同源性,预测风疹病毒亚基因组RNA长度为3346 nt,并且从亚基因组RNA 5'端到结构蛋白ORF的非翻译区预测为98 nt。在此处报告的风疹病毒nt序列5'端起始的一个不同翻译框架中,有一个1407 nt的ORF,它是非结构蛋白ORF的C端区域。这个ORF与结构蛋白ORF重叠149 nt。在风疹病毒非结构蛋白ORF C端的预测氨基酸序列与几种动植物正链RNA病毒的复制酶蛋白之间可检测到低水平的同源性,包括甲病毒的nsp4,即甲病毒非结构ORF C端区域编码的蛋白。然而,风疹病毒与甲病毒在基因组的这个区域的总体同源性仅为18%,表明披膜病毒科的这两个属仅具有远缘关系。有趣的是,在风疹病毒序列的负链方向存在一个2844 nt的ORF,它可能编码一个928个氨基酸的多蛋白。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验