Mori Ryota, Hamada Michiaki, Asai Kiyoshi
BMC Genomics. 2014;15 Suppl 10(Suppl 10):S6. doi: 10.1186/1471-2164-15-S10-S6. Epub 2014 Dec 12.
Although the needs for analyses of secondary structures of RNAs are increasing, prediction of the secondary structures of RNAs are not always reliable. Because an RNA may have a complicated energy landscape, comprehensive representations of the whole ensemble of the secondary structures, such as the probability distributions of various features of RNA secondary structures are required.
A general method to efficiently compute the distribution of any integer scalar/vector function on the secondary structure is proposed. We also show two concrete algorithms, for Hamming distance from a reference structure and for 5'-3' distance, which can be constructed by following our general method. These practical applications of this method show the effectiveness of the proposed method.
The proposed method provides a clear and comprehensive procedure to construct algorithms for distributions of various integer features. In addition, distributions of integer vectors, that is a combination of different integer scores, can be also described by applying our 2D expanding technique.
尽管对RNA二级结构分析的需求不断增加,但RNA二级结构的预测并不总是可靠的。由于RNA可能具有复杂的能量景观,因此需要对二级结构的整个集合进行全面表示,例如RNA二级结构各种特征的概率分布。
提出了一种有效计算二级结构上任何整数标量/向量函数分布的通用方法。我们还展示了两种具体算法,分别用于计算与参考结构的汉明距离和5'-3'距离,它们可以按照我们的通用方法构建。该方法的这些实际应用证明了所提方法的有效性。
所提方法为构建各种整数特征分布的算法提供了清晰且全面的过程。此外,通过应用我们的二维扩展技术,还可以描述整数向量的分布,即不同整数分数的组合。