Centre for High-Throughput Biology & Department of Computer Science and Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada.
PLoS Comput Biol. 2010 Jun 24;6(6):e1000823. doi: 10.1371/journal.pcbi.1000823.
The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular environment.
功能 RNA 结构的预测引起了越来越多的关注,因为它使我们能够研究许多基因的潜在功能作用。然而,RNA 结构预测方法假设存在唯一的功能性 RNA 结构,并且也不能预测体内折叠所需的功能特征。为了了解功能性 RNA 结构如何在体内形成,我们需要复杂的实验或可靠的预测方法。到目前为止,只有少数经过实验验证的瞬态 RNA 结构。在计算方面,有几个计算机程序旨在预测体内共转录折叠途径,但这些程序做了一系列简化假设,并且不能捕捉到所有已知影响体内 RNA 折叠的特征。我们想研究进化相关的 RNA 基因在体内是否以相似的方式折叠。为此,我们开发了一种新的计算方法 Transat,它可以检测具有高统计意义的保守螺旋。我们介绍了该方法,进行了全面的性能评估,并表明 Transat 能够预测已知参考结构的结构特征,包括伪结结构以及已知替代结构构象的结构特征。Transat 还可以识别被其他分子结合的无结构子序列,并提供可能定义折叠途径的新螺旋的证据,支持同源 RNA 序列不仅假设类似的参考 RNA 结构,而且折叠方式也相似的观点。最后,我们表明 Transat 预测的结构特征与假设热力学平衡的结构特征不同。与预测折叠途径的现有方法不同,我们的方法是一种比较方法。这有不能预测随时间变化的特征的缺点,但具有突出保守特征和不需要详细了解细胞环境的相当大的优点。