Beekwilder M J, Nieuwenhuizen R, van Duin J
Department of Biochemistry, Leiden University, The Netherlands.
J Mol Biol. 1995 Apr 14;247(5):903-17. doi: 10.1006/jmbi.1995.0189.
We have determined the nucleotide sequence of three positive single-stranded RNA coliphages and have used this information, together with the known sequences of the related phages Q beta and SP, to construct a secondary structure model for the two distal domains of Q beta RNA. The 3' terminal domain, which is about 100 nucleotides long, contains most of the 3' untranslated region and folds into four short, regular hairpins. The adjacent 3' replicase domain contains about 1100 nucleotides. Hairpins in this protein-coding domain are much longer and more irregular than in the 3' untranslated region. Both domains are defined by long-distance interactions. The secondary structure is not a collection of hairpin structures connected by single-stranded regions. Rather, the RNA stretches between the stem-loop structures are all involved in an extensive array of long-distance interactions that contract the molecule to a rigid structure in which all hairpins are predicted to have a fixed position with respect to each other. A general feature of the model is that helices tend to be organized in four-way junctions with little or no unpaired nucleotides between them. As a result, there is a potential for coaxial stacking of adjacent stems. The essential features of the model are supported by the S1 nuclease cleavage pattern. Viral RNA sequences are strongly constrained by their coding function. As a result, structural evolution by simple base-pair substitution is not always possible, as this usually requires the juxtaposition of the codon wobble positions in stems. Rather, we often observe co-ordinate base substitutions that maintain the stem, but tend to change the position at which bulges or internal loops are found. Structures that differ in this way are apparently equally fit. Also, the relative position of hairpin loops can shift several nucleotides through an alignment based on maximal sequence identity i.e. amino acid homology. The fact that these structural irregularities do not occur at the 3' untranslated region suggests indeed that the coding function of the RNA constrains the secondary structure. Hairpins with the stable tetraloop motif GNRA and UNCG or their complement are over-represented. This suggests their involvement in segregation of plus and minus strand. The genome of the coliphages contains a well-defined high affinity binding site for the coat protein, which serves to suppress replicase translation and also acts as a nucleation point in capsid formation. Close to the 3' end we find an additional conserved helix that meets the described consensus criteria for coat-protein binding.
我们已确定了三种正链单链RNA大肠杆菌噬菌体的核苷酸序列,并利用这些信息,结合相关噬菌体Qβ和SP的已知序列,构建了Qβ RNA两个远端结构域的二级结构模型。3'末端结构域长约100个核苷酸,包含大部分3'非翻译区,折叠成四个短的规则发夹结构。相邻的3'复制酶结构域包含约1100个核苷酸。这个蛋白质编码结构域中的发夹结构比3'非翻译区的长得多且更不规则。两个结构域均由长距离相互作用定义。二级结构不是由单链区域连接的发夹结构的集合。相反,茎环结构之间的RNA片段都参与了广泛的长距离相互作用,这些相互作用将分子收缩成一个刚性结构,其中所有发夹结构预计彼此具有固定位置。该模型的一个普遍特征是螺旋倾向于以四通连接的方式组织,它们之间几乎没有或没有未配对的核苷酸。因此,相邻茎可能存在同轴堆积。该模型的基本特征得到了S1核酸酶切割模式的支持。病毒RNA序列受到其编码功能的强烈限制。因此,通过简单的碱基对替换进行结构进化并不总是可行的,因为这通常需要茎中密码子摆动位置的并列。相反,我们经常观察到协同碱基替换,这些替换维持了茎,但倾向于改变凸起或内环出现的位置。以这种方式不同的结构显然同样合适。此外,基于最大序列同一性(即氨基酸同源性)的比对,发夹环的相对位置可以移动几个核苷酸。这些结构不规则性不在3'非翻译区出现这一事实确实表明RNA的编码功能限制了二级结构。具有稳定四环基序GNRA和UNCG或其互补序列的发夹结构过度存在。这表明它们参与了正负链的分离。大肠杆菌噬菌体的基因组包含一个明确的衣壳蛋白高亲和力结合位点,该位点用于抑制复制酶翻译,并且在衣壳形成中也充当成核点。靠近3'末端,我们发现了一个额外的保守螺旋,它符合所描述的衣壳蛋白结合的共识标准。