Sovic Michael G, Fries Anthony C, Gibbs H Lisle
Department of Evolution, Ecology, and Organismal Biology, Aronoff Laboratory, The Ohio State University, 318 W. 12th Ave, Columbus, OH, 43210, USA.
Ohio Biodiversity Conservation Partnership, Aronoff Laboratory, The Ohio State University, 318 W. 12th Ave, Columbus, OH, 43210, USA.
Mol Ecol Resour. 2015 Sep;15(5):1163-71. doi: 10.1111/1755-0998.12378. Epub 2015 Feb 16.
An increase in studies using restriction site-associated DNA sequencing (RADseq) methods has led to a need for both the development and assessment of novel bioinformatic tools that aid in the generation and analysis of these data. Here, we report the availability of AftrRAD, a bioinformatic pipeline that efficiently assembles and genotypes RADseq data, and outputs these data in various formats for downstream analyses. We use simulated and experimental data sets to evaluate AftrRAD's ability to perform accurate de novo assembly of loci, and we compare its performance with two other commonly used programs, stacks and pyrad. We demonstrate that AftrRAD is able to accurately assemble loci, while accounting for indel variation among alleles, in a more computationally efficient manner than currently available programs. AftrRAD run times are not strongly affected by the number of samples in the data set, making this program a useful tool when multicore systems are not available for parallel processing, or when data sets include large numbers of samples.
使用限制性位点关联DNA测序(RADseq)方法的研究不断增加,这就需要开发和评估有助于生成和分析这些数据的新型生物信息学工具。在此,我们报告了AftrRAD的可用性,这是一种生物信息学流程,可有效组装RADseq数据并进行基因分型,并以各种格式输出这些数据以供下游分析。我们使用模拟和实验数据集来评估AftrRAD对基因座进行准确的从头组装的能力,并将其性能与其他两个常用程序stacks和pyrad进行比较。我们证明,AftrRAD能够以比现有程序更高的计算效率准确地组装基因座,同时考虑等位基因之间的插入缺失变异。AftrRAD的运行时间不受数据集中样本数量的强烈影响,当无法使用多核系统进行并行处理或数据集包含大量样本时,该程序是一个有用的工具。