Fu Yinghan, Sharma Gaurav, Mathews David H
Nucleic Acids Res. 2014 Dec 16;42(22):13939-48. doi: 10.1093/nar/gku1172.
Homologous non-coding RNAs frequently exhibit domain insertions, where a branch of secondary structure is inserted in a sequence with respect to its homologs. Dynamic programming algorithms for common secondary structure prediction of multiple RNA homologs, however, do not account for these domain insertions. This paper introduces a novel dynamic programming algorithm methodology that explicitly accounts for the possibility of inserted domains when predicting common RNA secondary structures. The algorithm is implemented as Dynalign II, an update to the Dynalign software package for predicting the common secondary structure of two RNA homologs. This update is accomplished with negligible increase in computational cost. Benchmarks on ncRNA families with domain insertions validate the method. Over base pairs occurring in inserted domains, Dynalign II improves accuracy over Dynalign, attaining 80.8% sensitivity (compared with 14.4% for Dynalign) and 91.4% positive predictive value (PPV) for tRNA; 66.5% sensitivity (compared with 38.9% for Dynalign) and 57.0% PPV for RNase P RNA; and 50.1% sensitivity (compared with 24.3% for Dynalign) and 58.5% PPV for SRP RNA. Compared with Dynalign, Dynalign II also exhibits statistically significant improvements in overall sensitivity and PPV. Dynalign II is available as a component of RNAstructure, which can be downloaded from http://rna.urmc.rochester.edu/RNAstructure.html.
同源非编码RNA经常表现出结构域插入现象,即相对于其同源物而言,二级结构的一个分支插入到序列中。然而,用于多个RNA同源物共同二级结构预测的动态规划算法并未考虑这些结构域插入。本文介绍了一种新颖的动态规划算法方法,在预测常见RNA二级结构时明确考虑了插入结构域的可能性。该算法以Dynalign II的形式实现,它是用于预测两个RNA同源物共同二级结构的Dynalign软件包的更新版本。这种更新在计算成本增加可忽略不计的情况下完成。对具有结构域插入的非编码RNA家族的基准测试验证了该方法。在插入结构域中出现的碱基对上,Dynalign II比Dynalign提高了准确性,对于tRNA,灵敏度达到80.8%(Dynalign为14.4%),阳性预测值(PPV)为91.4%;对于核糖核酸酶P RNA,灵敏度为66.5%(Dynalign为38.9%),PPV为57.0%;对于信号识别颗粒RNA,灵敏度为50.1%(Dynalign为24.3%),PPV为58.5%。与Dynalign相比,Dynalign II在总体灵敏度和PPV方面也有统计学上的显著提高。Dynalign II作为RNAstructure的一个组件可用,可从http://rna.urmc.rochester.edu/RNAstructure.html下载。