Dykeman Eric C
York Centre for Complex Systems Analysis, Department of Mathematics and Biology University of York, Deramore Lane, York, YO10 5GE, UK
Nucleic Acids Res. 2015 Jul 13;43(12):5708-15. doi: 10.1093/nar/gkv480. Epub 2015 May 18.
In this paper I outline a fast method called KFOLD for implementing the Gillepie algorithm to stochastically sample the folding kinetics of an RNA molecule at single base-pair resolution. In the same fashion as the KINFOLD algorithm, which also uses the Gillespie algorithm to predict folding kinetics, KFOLD stochastically chooses a new RNA secondary structure state that is accessible from the current state by a single base-pair addition/deletion following the Gillespie procedure. However, unlike KINFOLD, the KFOLD algorithm utilizes the fact that many of the base-pair addition/deletion reactions and their corresponding rates do not change between each step in the algorithm. This allows KFOLD to achieve a substantial speed-up in the time required to compute a prediction of the folding pathway and, for a fixed number of base-pair moves, performs logarithmically with sequence size. This increase in speed opens up the possibility of studying the kinetics of much longer RNA sequences at single base-pair resolution while also allowing for the RNA folding statistics of smaller RNA sequences to be computed much more quickly.
在本文中,我概述了一种名为KFOLD的快速方法,用于实现吉列斯皮算法,以单碱基对分辨率随机采样RNA分子的折叠动力学。与同样使用吉列斯皮算法预测折叠动力学的KINFOLD算法类似,KFOLD按照吉列斯皮程序,随机选择一个新的RNA二级结构状态,该状态可通过在当前状态下添加/删除单个碱基对来实现。然而,与KINFOLD不同的是,KFOLD算法利用了这样一个事实,即在算法的每一步之间,许多碱基对添加/删除反应及其相应速率不会改变。这使得KFOLD在计算折叠途径预测所需的时间上大幅提速,并且对于固定数量的碱基对移动,计算时间与序列大小呈对数关系。速度的提升使得在单碱基对分辨率下研究更长RNA序列的动力学成为可能,同时也能更快地计算较小RNA序列的RNA折叠统计信息。