Xue Alexander T, Hickerson Michael J
Department of Biology: Subprogram in Ecology, Evolutionary Biology, and Behavior, City College and Graduate Center of City University of New York, 160 Convent Avenue, Marshak Science Building, Room 526, New York, NY, 10031, USA.
Division of Invertebrate Zoology, American Museum of Natural History, New York, NY, 10024, USA.
Mol Ecol. 2015 Dec;24(24):6223-40. doi: 10.1111/mec.13447. Epub 2015 Dec 12.
Understanding how assemblages of species responded to past climate change is a central goal of comparative phylogeography and comparative population genomics, an endeavour that has increasing potential to integrate with community ecology. New sequencing technology now provides the potential to perform complex demographic inference at unprecedented resolution across assemblages of nonmodel species. To this end, we introduce the aggregate site frequency spectrum (aSFS), an expansion of the site frequency spectrum to use single nucleotide polymorphism (SNP) data sets collected from multiple, co-distributed species for assemblage-level demographic inference. We describe how the aSFS is constructed over an arbitrary number of independent population samples and then demonstrate how the aSFS can differentiate various multispecies demographic histories under a wide range of sampling configurations while allowing effective population sizes and expansion magnitudes to vary independently. We subsequently couple the aSFS with a hierarchical approximate Bayesian computation (hABC) framework to estimate degree of temporal synchronicity in expansion times across taxa, including an empirical demonstration with a data set consisting of five populations of the threespine stickleback (Gasterosteus aculeatus). Corroborating what is generally understood about the recent postglacial origins of these populations, the joint aSFS/hABC analysis strongly suggests that the stickleback data are most consistent with synchronous expansion after the Last Glacial Maximum (posterior probability = 0.99). The aSFS will have general application for multilevel statistical frameworks to test models involving assemblages and/or communities, and as large-scale SNP data from nonmodel species become routine, the aSFS expands the potential for powerful next-generation comparative population genomic inference.
了解物种组合如何应对过去的气候变化是比较系统地理学和比较种群基因组学的核心目标,这一努力与群落生态学整合的潜力越来越大。新的测序技术现在提供了以前所未有的分辨率对非模式物种组合进行复杂的种群统计学推断的潜力。为此,我们引入了总体位点频率谱(aSFS),它是位点频率谱的扩展,用于利用从多个共分布物种收集的单核苷酸多态性(SNP)数据集进行组合水平的种群统计学推断。我们描述了如何在任意数量的独立种群样本上构建aSFS,然后展示了aSFS如何在广泛的抽样配置下区分各种多物种的种群历史,同时允许有效种群大小和扩张幅度独立变化。随后,我们将aSFS与分层近似贝叶斯计算(hABC)框架相结合,以估计不同分类群扩张时间的时间同步程度,包括用由五个三刺鱼(Gasterosteus aculeatus)种群组成的数据集进行的实证演示。联合aSFS/hABC分析有力地表明,刺鱼数据与末次盛冰期之后的同步扩张最为一致(后验概率 = 0.99),这证实了人们对这些种群近期冰后期起源的普遍理解。aSFS将广泛应用于多级统计框架,以检验涉及组合和/或群落的模型,并且随着来自非模式物种的大规模SNP数据变得常规化,aSFS扩展了强大的下一代比较种群基因组推断的潜力。