Jain Swati, Bayrak Cigdem S, Petingi Louis, Schlick Tamar
Department of Chemistry, New York University, New York, NY 10003, USA.
Computer Science Department, College of Staten Island, City University of New York, Staten Island, New York, NY 10314, USA.
Genes (Basel). 2018 Jul 25;9(8):371. doi: 10.3390/genes9080371.
RNA molecules are composed of modular architectural units that define their unique structural and functional properties. Characterization of these building blocks can help interpret RNA structure/function relationships. We present an RNA secondary structure motif and submotif library using dual graph representation and partitioning. Dual graphs represent RNA helices as vertices and loops as edges. Unlike tree graphs, dual graphs can represent RNA pseudoknots (intertwined base pairs). For a representative set of RNA structures, we construct dual graphs from their secondary structures, and apply our partitioning algorithm to identify non-separable subgraphs (or blocks) without breaking pseudoknots. We report 56 subgraph blocks up to nine vertices; among them, 22 are frequently occurring, 15 of which contain pseudoknots. We then catalog atomic fragments corresponding to the subgraph blocks to define a library of building blocks that can be used for RNA design, which we call , as we have done for tree graphs. As an application, we analyze the distribution of these subgraph blocks within ribosomal RNAs of various prokaryotic and eukaryotic species to identify common subgraphs and possible ancestry relationships. Other applications of dual graph partitioning and motif library can be envisioned for RNA structure analysis and design.
RNA分子由模块化的结构单元组成,这些单元决定了它们独特的结构和功能特性。对这些构建模块的表征有助于解释RNA的结构/功能关系。我们提出了一种使用对偶图表示和划分的RNA二级结构基序和子基序库。对偶图将RNA螺旋表示为顶点,环表示为边。与树状图不同,对偶图可以表示RNA假结(相互缠绕的碱基对)。对于一组具有代表性的RNA结构,我们从其二级结构构建对偶图,并应用我们的划分算法来识别不可分离的子图(或块),同时不破坏假结。我们报告了多达九个顶点的56个子图块;其中22个是频繁出现的,其中15个包含假结。然后,我们对与子图块相对应的原子片段进行编目,以定义一个可用于RNA设计的构建模块库,我们将其称为,就像我们对树状图所做的那样。作为一个应用,我们分析了这些子图块在各种原核和真核物种的核糖体RNA中的分布,以识别常见的子图和可能的祖先关系。对偶图划分和基序库的其他应用可用于RNA结构分析和设计。