Benedetti G, Morosetti S
Dipartimento di Chimica, Università di Roma La Sapienza, Italy.
Biophys Chem. 1996 Mar 7;59(1-2):179-84. doi: 10.1016/0301-4622(95)00119-0.
Secondary and tertiary RNA structures play an important role in many biological processes. Therefore the necessity arises to find similar higher-order structures for different but functionally homologous RNA sequences. We propose here a graph-topological approach to the problem, which shows two main features: simplified graph representation which allows the recognition of similarity of RNA secondary structures with the same branching look despite minor differences. This allows comparison among foldings from different sequences, and "pruning" of the secondary structures not shared by all the sequences since the early stages of the search. (b) The graph representation is encoded by the Randić topological index, and the search for the folding similarity is reduced to checking the identity of single numbers. These characteristics make this approach significantly different, less depending on empirical criteria, and less computationally heavy then previous methods, where the folding consensus has been measured by an alignment procedure or correlation of strings representing the secondary structures. Some U2 snRNA and viroid sequences are studied by this approach, which is imbedded in our previous search method based on genetic algorithms.
二级和三级RNA结构在许多生物过程中发挥着重要作用。因此,有必要为不同但功能同源的RNA序列找到相似的高阶结构。我们在此提出一种针对该问题的图拓扑方法,该方法具有两个主要特点:(a)简化的图表示,它允许识别具有相同分支外观的RNA二级结构的相似性,尽管存在细微差异。这使得能够比较不同序列的折叠方式,并在搜索的早期阶段“修剪”所有序列不共享的二级结构。(b)图表示由兰迪奇拓扑指数编码,并且对折叠相似性的搜索简化为检查单个数字的一致性。这些特性使得该方法有显著不同,较少依赖经验标准,并且比以前的方法计算量更小,以前的方法中折叠一致性是通过比对程序或表示二级结构的字符串的相关性来衡量的。通过这种方法研究了一些U2 snRNA和类病毒序列,该方法嵌入在我们之前基于遗传算法的搜索方法中。