Notredame C, Higgins D G
EMBL outstation, The European Bioinformatics Institute, Cambridge, UK.
Nucleic Acids Res. 1996 Apr 15;24(8):1515-24. doi: 10.1093/nar/24.8.1515.
We describe a new approach to multiple sequence alignment using genetic algorithms and an associated software package called SAGA. The method involves evolving a population of alignments in a quasi evolutionary manner and gradually improving the fitness of the population as measured by an objective function which measures multiple alignment quality. SAGA uses an automatic scheduling scheme to control the usage of 22 different operators for combining alignments or mutating them between generations. When used to optimise the well known sums of pairs objective function, SAGA performs better than some of the widely used alternative packages. This is seen with respect to the ability to achieve an optimal solution and with regard to the accuracy of alignment by comparison with reference alignments based on sequences of known tertiary structure. The general attraction of the approach is the ability to optimise any objective function that one can invent.
我们描述了一种使用遗传算法进行多序列比对的新方法以及一个名为SAGA的相关软件包。该方法涉及以一种准进化方式演化比对群体,并通过一个衡量多序列比对质量的目标函数来逐步提高群体的适应度。SAGA使用一种自动调度方案来控制22种不同操作符的使用,这些操作符用于在各代之间组合比对或使其发生变异。当用于优化著名的双序列和目标函数时,SAGA的表现优于一些广泛使用的其他软件包。这在实现最优解的能力方面以及通过与基于已知三级结构序列的参考比对进行比较的比对准确性方面都有所体现。该方法的普遍吸引力在于能够优化人们可以设想的任何目标函数。