Department of Statistics, University of British Columbia, Vancouver, BC V6T 1Z2, Canada.
Syst Biol. 2012 Jul;61(4):579-93. doi: 10.1093/sysbio/syr131. Epub 2012 Jan 4.
Bayesian inference provides an appealing general framework for phylogenetic analysis, able to incorporate a wide variety of modeling assumptions and to provide a coherent treatment of uncertainty. Existing computational approaches to bayesian inference based on Markov chain Monte Carlo (MCMC) have not, however, kept pace with the scale of the data analysis problems in phylogenetics, and this has hindered the adoption of bayesian methods. In this paper, we present an alternative to MCMC based on Sequential Monte Carlo (SMC). We develop an extension of classical SMC based on partially ordered sets and show how to apply this framework--which we refer to as PosetSMC--to phylogenetic analysis. We provide a theoretical treatment of PosetSMC and also present experimental evaluation of PosetSMC on both synthetic and real data. The empirical results demonstrate that PosetSMC is a very promising alternative to MCMC, providing up to two orders of magnitude faster convergence. We discuss other factors favorable to the adoption of PosetSMC in phylogenetics, including its ability to estimate marginal likelihoods, its ready implementability on parallel and distributed computing platforms, and the possibility of combining with MCMC in hybrid MCMC-SMC schemes. Software for PosetSMC is available at http://www.stat.ubc.ca/ bouchard/PosetSMC.
贝叶斯推断为系统发育分析提供了一个吸引人的通用框架,能够结合各种建模假设,并为不确定性提供一致的处理。然而,现有的基于马尔可夫链蒙特卡罗 (MCMC) 的贝叶斯推断计算方法并没有跟上系统发育学中数据分析问题的规模,这阻碍了贝叶斯方法的采用。在本文中,我们提出了一种基于序贯蒙特卡罗 (SMC) 的替代方法。我们开发了一种基于偏序集的经典 SMC 扩展,并展示了如何将这种框架(我们称之为 PosetSMC)应用于系统发育分析。我们对 PosetSMC 进行了理论处理,并对其在合成和真实数据上的实验评估进行了描述。实证结果表明,PosetSMC 是 MCMC 的一种很有前途的替代方法,它的收敛速度快了两个数量级。我们还讨论了在系统发育学中采用 PosetSMC 的其他有利因素,包括它能够估计边缘似然度、在并行和分布式计算平台上易于实现以及与 MCMC 结合的可能性在混合 MCMC-SMC 方案中。PosetSMC 的软件可在 http://www.stat.ubc.ca/ bouchard/PosetSMC 获得。