Department of Computer Science, University of Central Florida, Orlando, FL 32816, United States.
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i337-i346. doi: 10.1093/bioinformatics/btad223.
The 3D structures of RNA play a critical role in understanding their functionalities. There exist several computational methods to study RNA 3D structures by identifying structural motifs and categorizing them into several motif families based on their structures. Although the number of such motif families is not limited, a few of them are well-studied. Out of these structural motif families, there exist several families that are visually similar or very close in structure, even with different base interactions. Alternatively, some motif families share a set of base interactions but maintain variation in their 3D formations. These similarities among different motif families, if known, can provide a better insight into the RNA 3D structural motifs as well as their characteristic functions in cell biology.
In this work, we proposed a method, RNAMotifComp, that analyzes the instances of well-known structural motif families and establishes a relational graph among them. We also have designed a method to visualize the relational graph where the families are shown as nodes and their similarity information is represented as edges. We validated our discovered correlations of the motif families using RNAMotifContrast. Additionally, we used a basic Naïve Bayes classifier to show the importance of RNAMotifComp. The relational analysis explains the functional analogies of divergent motif families and illustrates the situations where the motifs of disparate families are predicted to be of the same family.
Source code publicly available at https://github.com/ucfcbb/RNAMotifFamilySimilarity.
RNA 的 3D 结构在理解其功能方面起着至关重要的作用。存在几种计算方法可以通过识别结构基序并根据其结构将它们分类为几个基序家族来研究 RNA 3D 结构。尽管这些基序家族的数量没有限制,但其中有一些是研究得很好的。在这些结构基序家族中,存在一些在结构上视觉上相似或非常接近的家族,即使它们的碱基相互作用不同。或者,一些基序家族共享一组碱基相互作用,但在其 3D 形成上保持变化。如果知道这些不同基序家族之间的相似性,则可以更好地了解 RNA 3D 结构基序及其在细胞生物学中的特征功能。
在这项工作中,我们提出了一种方法 RNAMotifComp,该方法分析了已知结构基序家族的实例,并在它们之间建立了关系图。我们还设计了一种方法来可视化关系图,其中家族显示为节点,它们的相似性信息表示为边。我们使用 RNAMotifContrast 验证了我们发现的基序家族相关性。此外,我们还使用了基本的朴素贝叶斯分类器来展示 RNAMotifComp 的重要性。关系分析解释了不同基序家族的功能类似性,并说明了预测不同家族的基序属于同一家族的情况。
源代码可在 https://github.com/ucfcbb/RNAMotifFamilySimilarity 上公开获取。