Liu Qi, Zhang Yin, Xu Ying, Ye Xiuzi
Zhejiang California International Nanosystems Institute, Zhejiang University, Hangzhou, 310029, China.
J Biomol Struct Dyn. 2008 Jun;25(6):685-96. doi: 10.1080/07391102.2008.10507214.
Measuring the (dis)similarity between RNA secondary structures is critical for the study of RNA secondary structures and has implications to RNA functional characterization. Although a number of methods have been developed for comparing RNA structural similarities, their applications have been limited by the complexity of the required computation. In this paper, we present a novel method for comparing the similarity of RNA secondary structures generated from the same RNA sequence, i.e., a secondary structure ensemble, using a matrix representation of the RNA structures. Relevant features of the RNA secondary structures can be easily extracted through singular value decomposition (SVD) of the representing matrices. We have mapped the feature vectors of the singular values to a kernel space, where (dis)similarities among the mapped feature vectors become more evident, making clustering of RNA secondary structures easier to handle. The pair-wise comparison of RNA structures is achieved through computing the distance between the singular value vectors in the kernel space. We have applied a fuzzy kernel clustering method, using this similarity metric, to cluster the RNA secondary structure ensembles. Our application results suggest that our fuzzy kernel clustering method is highly promising for classifications of RNA structure ensembles, because of its low computational complexity and high clustering accuracy.
测量RNA二级结构之间的(不)相似性对于RNA二级结构的研究至关重要,并且对RNA功能表征具有重要意义。尽管已经开发了许多用于比较RNA结构相似性的方法,但它们的应用受到所需计算复杂性的限制。在本文中,我们提出了一种新颖的方法,用于比较由相同RNA序列生成的RNA二级结构(即二级结构集合)的相似性,该方法使用RNA结构的矩阵表示。通过对表示矩阵进行奇异值分解(SVD),可以轻松提取RNA二级结构的相关特征。我们将奇异值的特征向量映射到核空间,在该空间中,映射后的特征向量之间的(不)相似性变得更加明显,从而使RNA二级结构的聚类更容易处理。RNA结构的成对比较是通过计算核空间中奇异值向量之间的距离来实现的。我们应用了一种模糊核聚类方法,使用这种相似性度量来对RNA二级结构集合进行聚类。我们的应用结果表明,我们的模糊核聚类方法因其低计算复杂性和高聚类准确性,在RNA结构集合分类方面具有很大的前景。