University of British Columbia, Canada.
California Institute of Technology, United States of America.
Comput Biol Chem. 2023 Jun;104:107837. doi: 10.1016/j.compbiolchem.2023.107837. Epub 2023 Feb 25.
Predicting the kinetics of reactions involving nucleic acid strands is a fundamental task in biology and biotechnology. Reaction kinetics can be modeled as an elementary step continuous-time Markov chain, where states correspond to secondary structures and transitions correspond to base pair formation and breakage. Since the number of states in the Markov chain could be large, rates are determined by estimating the mean first passage time from sampled trajectories. As a result, the cost of kinetic predictions becomes prohibitively expensive for rare events with extremely long trajectories. Also problematic are scenarios where multiple predictions are needed for the same reaction, e.g., under different environmental conditions, or when calibrating model parameters, because a new set of trajectories is needed multiple times. We propose a new method, called pathway elaboration, to handle these scenarios. Pathway elaboration builds a truncated continuous-time Markov chain through both biased and unbiased sampling. The resulting Markov chain has moderate state space size, so matrix methods can efficiently compute reaction rates, even for rare events. Also the transition rates of the truncated Markov chain can easily be adapted when model or environmental parameters are perturbed, making model calibration feasible. We illustrate the utility of pathway elaboration on toehold-mediated strand displacement reactions, show that it well-approximates trajectory-based predictions of unbiased elementary step models on a wide range of reaction types for which such predictions are feasible, and demonstrate that it performs better than alternative truncation-based approaches that are applicable for mean first passage time estimation. Finally, in a small study, we use pathway elaboration to optimize the Metropolis kinetic model of Multistrand, an elementary step simulator, showing that the optimized parameters greatly improve reaction rate predictions. Our framework and dataset are available at https://github.com/DNA-and-Natural-Algorithms-Group/PathwayElaboration.
预测涉及核酸链的反应动力学是生物学和生物技术中的一项基本任务。反应动力学可以建模为一个基本步骤连续时间马尔可夫链,其中状态对应于二级结构,转换对应于碱基对的形成和断裂。由于马尔可夫链中的状态数量可能很大,因此速率是通过估计从采样轨迹的平均首次通过时间来确定的。因此,对于具有极长轨迹的罕见事件,动力学预测的成本变得非常昂贵。同样成问题的是需要对同一反应进行多次预测的情况,例如在不同的环境条件下,或者在对模型参数进行校准时,因为需要多次使用新的轨迹集。我们提出了一种新的方法,称为途径细化,以处理这些情况。途径细化通过有偏和无偏采样构建截断连续时间马尔可夫链。由此产生的马尔可夫链具有适中的状态空间大小,因此即使对于罕见事件,矩阵方法也可以有效地计算反应速率。此外,当模型或环境参数受到干扰时,截断马尔可夫链的转移速率可以很容易地进行调整,从而使模型校准成为可能。我们在 toehold 介导的链置换反应上说明了途径细化的实用性,表明它很好地近似了基于轨迹的无偏基本步骤模型的预测,这些预测对于可行的反应类型范围广泛,并且表明它比适用于平均首次通过时间估计的替代截断方法表现更好。最后,在一项小型研究中,我们使用途径细化来优化多链的 Metropolis 动力学模型,这是一个基本步骤模拟器,表明优化后的参数大大提高了反应速率预测。我们的框架和数据集可在 https://github.com/DNA-and-Natural-Algorithms-Group/PathwayElaboration 上获得。