Gorodkin J, Heyer L J, Stormo G D
Center for Biological Sequence Analysis, The Technical University of Denmark, Building 206, DK-2800 Lyngby, Denmark.
Nucleic Acids Res. 1997 Sep 15;25(18):3724-32. doi: 10.1093/nar/25.18.3724.
We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints. In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections. The first part utilizes a simplified version of the Sankoff algorithm for simultaneous folding and alignment of RNA sequences, but maintains tractability by constructing multi-sequence alignments from pairwise comparisons. The algorithm finds the multiple alignments using a greedy approach and has similarities to both CLUSTAL and CONSENSUS, but the core algorithm assures that the pairwise alignments are optimized for both sequence and structure conservation. The choice of scoring system and the method of progressively constructing the final solution are important considerations that are discussed. Example solutions, and comparisons with other approaches, are provided. The solutions include finding consensus structures identical to published ones.
我们提出了一种计算方案,用于利用序列和结构约束对一组RNA序列进行局部比对。此外,该方法在所有可能的集合中搜索具有最显著共同基序的比对结果。第一部分使用了Sankoff算法的简化版本,用于RNA序列的同时折叠和比对,但通过从成对比较构建多序列比对来保持可处理性。该算法使用贪婪方法找到多序列比对,与CLUSTAL和CONSENSUS都有相似之处,但核心算法确保成对比对在序列和结构保守性方面都得到了优化。评分系统的选择和逐步构建最终解决方案的方法是需要讨论的重要考虑因素。文中提供了示例解决方案以及与其他方法的比较。这些解决方案包括找到与已发表的一致结构相同的结构。