Leontis Neocles B, Westhof Eric
Chemistry Department and Center for Biomolecular Sciences, Overman Hall, Bowling Green State University, Bowling Green, OH 43403, USA.
Comp Funct Genomics. 2002;3(6):518-24. doi: 10.1002/cfg.213.
The recent deluge of new RNA structures, including complete atomic-resolution views of both subunits of the ribosome, has on the one hand literally overwhelmed our individual abilities to comprehend the diversity of RNA structure, and on the other hand presented us with new opportunities for comprehensive use of RNA sequences for comparative genetic, evolutionary and phylogenetic studies. Two concepts are key to understanding RNA structure: hierarchical organization of global structure and isostericity of local interactions. Global structure changes extremely slowly, as it relies on conserved long-range tertiary interactions. Tertiary RNA-RNA and quaternary RNA-protein interactions are mediated by RNA motifs, defined as recurrent and ordered arrays of non-Watson-Crick base-pairs. A single RNA motif comprises a family of sequences, all of which can fold into the same three-dimensional structure and can mediate the same interaction(s). The chemistry and geometry of base pairing constrain the evolution of motifs in such a way that random mutations that occur within motifs are accepted or rejected insofar as they can mediate a similar ordered array of interactions. The steps involved in the analysis and annotation of RNA motifs in 3D structures are: (a) decomposition of each motif into non-Watson-Crick base-pairs; (b) geometric classification of each basepair; (c) identification of isosteric substitutions for each basepair by comparison to isostericity matrices; (d) alignment of homologous sequences using the isostericity matrices to identify corresponding positions in the crystal structure; (e) acceptance or rejection of the null hypothesis that the motif is conserved.
最近大量新的RNA结构不断涌现,包括核糖体两个亚基的完整原子分辨率视图,一方面着实超出了我们个人理解RNA结构多样性的能力,另一方面也为我们利用RNA序列进行比较遗传学、进化和系统发育研究提供了新机会。理解RNA结构的两个关键概念是:全局结构的层次组织和局部相互作用的等排性。全局结构变化极其缓慢,因为它依赖于保守的长程三级相互作用。三级RNA-RNA和四级RNA-蛋白质相互作用由RNA基序介导,RNA基序被定义为非沃森-克里克碱基对的重复且有序排列。单个RNA基序包含一个序列家族,所有这些序列都能折叠成相同的三维结构并介导相同的相互作用。碱基配对的化学性质和几何形状以这样一种方式限制了基序的进化,即基序内发生的随机突变根据它们能否介导类似的有序相互作用阵列而被接受或拒绝。分析和注释三维结构中RNA基序所涉及的步骤如下:(a) 将每个基序分解为非沃森-克里克碱基对;(b) 对每个碱基对进行几何分类;(c) 通过与等排性矩阵比较确定每个碱基对的等排替代;(d) 使用等排性矩阵比对同源序列以确定晶体结构中的相应位置;(e) 接受或拒绝该基序保守的原假设。