Department of Chemistry, Saint Louis University, St Louis, MO 63103, USA.
Nucleic Acids Res. 2011 Feb;39(3):1081-94. doi: 10.1093/nar/gkq793. Epub 2010 Sep 28.
RNA is known to be involved in several cellular processes; however, it is only active when it is folded into its correct 3D conformation. The folding, bending and twisting of an RNA molecule is dependent upon the multitude of canonical and non-canonical secondary structure motifs. These motifs contribute to the structural complexity of RNA but also serve important integral biological functions, such as serving as recognition and binding sites for other biomolecules or small ligands. One of the most prevalent types of RNA secondary structure motifs are single mismatches, which occur when two canonical pairs are separated by a single non-canonical pair. To determine sequence-structure relationships and to identify structural patterns, we have systematically located, annotated and compared all available occurrences of the 30 most frequently occurring single mismatch-nearest neighbor sequence combinations found in experimentally determined 3D structures of RNA-containing molecules deposited into the Protein Data Bank. Hydrogen bonding, stacking and interaction of nucleotide edges for the mismatched and nearest neighbor base pairs are described and compared, allowing for the identification of several structural patterns. Such a database and comparison will allow researchers to gain insight into the structural features of unstudied sequences and to quickly look-up studied sequences.
RNA 参与了许多细胞过程;然而,只有当它折叠成正确的 3D 构象时,它才具有活性。RNA 分子的折叠、弯曲和扭曲取决于众多的规范和非规范二级结构基序。这些基序有助于 RNA 的结构复杂性,但也具有重要的整体生物学功能,例如作为其他生物分子或小分子配体的识别和结合位点。最常见的 RNA 二级结构基序之一是单错配,当两个规范对被一个非规范对隔开时就会发生单错配。为了确定序列-结构关系并识别结构模式,我们系统地定位、注释并比较了在蛋白质数据库中存储的 RNA 分子的实验确定的 3D 结构中所有最常出现的 30 个单错配-最近邻序列组合的所有出现情况。描述并比较了错配和最近邻碱基对的氢键、堆积和核苷酸边缘相互作用,从而确定了几种结构模式。这样的数据库和比较将使研究人员能够深入了解未研究序列的结构特征,并快速查找已研究序列。