Lemmon Alan R, Milinkovitch Michel C
Laboratory of Evolutionary Genetics, Free University of Brussels (ULB), Cp 300, Institute of Molecular Biology and Medicine, Rue Jeener and Brachet 12, B-6041 Gosselies, Belgium.
Proc Natl Acad Sci U S A. 2002 Aug 6;99(16):10516-21. doi: 10.1073/pnas.162224399. Epub 2002 Jul 25.
Large phylogeny estimation is a combinatorial optimization problem that no future computer will ever be able to solve exactly in practical computing time. The difficulty of the problem is amplified by the need to use complex evolutionary models and large taxon samplings. Hence, many heuristic approaches have been developed, with varying degrees of success. Here, we report on a heuristic approach, the metapopulation genetic algorithm, involving several populations of trees that are forced to cooperate in the search for the optimal tree. Within each population, trees are subjected to evaluation, selection, and mutation events, which are directed by using inter-population consensus information. The method proves to be both very accurate and vastly faster than existing heuristics, such that data sets comprised of hundreds of taxa can be analyzed in practical computing times under complex models of maximum-likelihood evolution. Branch support values produced by the metapopulation genetic algorithm might closely approximate the posterior probabilities of the corresponding branches.
大型系统发育估计是一个组合优化问题,在实际计算时间内,未来的计算机也无法精确求解。由于需要使用复杂的进化模型和大量的分类群样本,该问题的难度进一步加大。因此,人们开发了许多启发式方法,但成功程度各不相同。在此,我们报告一种启发式方法——集合种群遗传算法,该方法涉及多个树种群,这些种群在寻找最优树的过程中被迫合作。在每个种群内部,树会经历评估、选择和变异事件,这些事件通过种群间的共识信息来引导。该方法被证明既非常准确,又比现有的启发式方法快得多,以至于在复杂的最大似然进化模型下,由数百个分类群组成的数据集能够在实际计算时间内得到分析。集合种群遗传算法产生的分支支持值可能与相应分支的后验概率非常接近。