Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA.
Mol Ecol. 2013 Jun;22(11):3124-40. doi: 10.1111/mec.12354. Epub 2013 May 24.
Massively parallel short-read sequencing technologies, coupled with powerful software platforms, are enabling investigators to analyse tens of thousands of genetic markers. This wealth of data is rapidly expanding and allowing biological questions to be addressed with unprecedented scope and precision. The sizes of the data sets are now posing significant data processing and analysis challenges. Here we describe an extension of the Stacks software package to efficiently use genotype-by-sequencing data for studies of populations of organisms. Stacks now produces core population genomic summary statistics and SNP-by-SNP statistical tests. These statistics can be analysed across a reference genome using a smoothed sliding window. Stacks also now provides several output formats for several commonly used downstream analysis packages. The expanded population genomics functions in Stacks will make it a useful tool to harness the newest generation of massively parallel genotyping data for ecological and evolutionary genetics.
高通量短读测序技术与强大的软件平台相结合,使研究人员能够分析数以万计的遗传标记。这种丰富的数据正在迅速增加,并允许以前所未有的范围和精度来解决生物学问题。现在,数据集的大小给数据处理和分析带来了巨大的挑战。在这里,我们描述了对 Stacks 软件包的扩展,以有效地利用基于测序的基因型数据来研究生物种群。Stacks 现在可以生成核心种群基因组汇总统计信息和 SNP-by-SNP 统计测试。可以使用平滑滑动窗口在参考基因组上分析这些统计信息。Stacks 现在还为几种常用的下游分析包提供了几种输出格式。Stacks 中扩展的群体基因组学功能将使其成为一种有用的工具,可用于利用最新一代的高通量基因分型数据进行生态和进化遗传学研究。