Notredame C, Holm L, Higgins D G
EMBL Outstation-The European Bioinformatics Institute, Hinxton Hall, Hinxton, Cambridge CB10 1SD, UK.
Bioinformatics. 1998 Jun;14(5):407-22. doi: 10.1093/bioinformatics/14.5.407.
In order to increase the accuracy of multiple sequence alignments, we designed a new strategy for optimizing multiple sequence alignments by genetic algorithm. We named it COFFEE (Consistency based Objective Function For alignmEnt Evaluation). The COFFEE score reflects the level of consistency between a multiple sequence alignment and a library containing pairwise alignments of the same sequences.
We show that multiple sequence alignments can be optimized for their COFFEE score with the genetic algorithm package SAGA. The COFFEE function is tested on 11 test cases made of structural alignments extracted from 3D_ali. These alignments are compared to those produced using five alternative methods. Results indicate that COFFEE outperforms the other methods when the level of identity between the sequences is low. Accuracy is evaluated by comparison with the structural alignments used as references. We also show that the COFFEE score can be used as a reliability index on multiple sequence alignments. Finally, we show that given a library of structure-based pairwise sequence alignments extracted from FSSP, SAGA can produce high-quality multiple sequence alignments. The main advantage of COFFEE is its flexibility. With COFFEE, any method suitable for making pairwise alignments can be extended to making multiple alignments.
The package is available along with the test cases through the WWW: http://www. ebi.ac.uk/cedric
为了提高多序列比对的准确性,我们设计了一种通过遗传算法优化多序列比对的新策略。我们将其命名为COFFEE(基于一致性的比对评估目标函数)。COFFEE分数反映了多序列比对与包含相同序列两两比对的库之间的一致性水平。
我们表明,可以使用遗传算法包SAGA针对其COFFEE分数优化多序列比对。在由从3D_ali提取的结构比对组成的11个测试案例上测试了COFFEE函数。将这些比对与使用五种替代方法产生的比对进行比较。结果表明,当序列之间的同一性水平较低时,COFFEE优于其他方法。通过与用作参考的结构比对进行比较来评估准确性。我们还表明,COFFEE分数可以用作多序列比对的可靠性指标。最后,我们表明,给定从FSSP提取的基于结构的两两序列比对库,SAGA可以产生高质量的多序列比对。COFFEE的主要优点是其灵活性。使用COFFEE,可以将任何适用于进行两两比对的方法扩展到进行多比对。
该软件包以及测试案例可通过万维网获取:http://www.ebi.ac.uk/cedric