Lepais Olivier, Weir Jason T
INRA, UMR 1224, Ecologie Comportementale et Biologie des Populations de Poissons, INRA, Saint Pée sur Nivelle, France; Univ Pau & Pays Adour, UMR 1224, Ecologie Comportementale et Biologie des Populations de Poissons, UFR Sciences et Techniques de la Côte Basque, Univ Pau and Pays Adour, Anglet, France.
Mol Ecol Resour. 2014 Nov;14(6):1314-21. doi: 10.1111/1755-0998.12273. Epub 2014 May 26.
Application of high-throughput sequencing platforms in the field of ecology and evolutionary biology is developing quickly with the introduction of efficient methods to reduce genome complexity. Numerous approaches for genome complexity reduction have been developed using different combinations of restriction enzymes, library construction strategies and fragment size selection. As a result, the choice of which techniques to use may become cumbersome, because it is difficult to anticipate the number of loci resulting from each method. We developed SimRAD, an R package that performs in silico restriction enzyme digests and fragment size selection as implemented in most restriction site associated DNA polymorphism and genotyping by sequencing methods. In silico digestion is performed on a reference genome or on a randomly generated DNA sequence when no reference genome sequence is available. SimRAD accurately predicts the number of loci under alternative protocols when a reference genome sequence is available for the targeted species (or a close relative) but may be unreliable when no reference genome is available. SimRAD is also useful for fine-tuning a given protocol to adjust the number of targeted loci. Here, we outline the functionality of SimRAD and provide an illustrative example of the use of the package (available on the CRAN at http://cran.r-project.org/web/packages/SimRAD).
随着高效降低基因组复杂性方法的引入,高通量测序平台在生态学和进化生物学领域的应用发展迅速。利用限制性内切酶、文库构建策略和片段大小选择的不同组合,已经开发出了多种降低基因组复杂性的方法。因此,选择使用哪种技术可能会变得麻烦,因为很难预测每种方法产生的位点数量。我们开发了SimRAD,这是一个R包,它能像大多数限制性位点相关DNA多态性和测序基因分型方法那样进行计算机模拟限制性内切酶消化和片段大小选择。当有参考基因组序列时,在参考基因组上进行计算机模拟消化;当没有参考基因组序列时,则在随机生成的DNA序列上进行。当目标物种(或近缘物种)有参考基因组序列时,SimRAD能准确预测不同方案下的位点数量,但在没有参考基因组时可能不可靠。SimRAD对于微调给定方案以调整目标位点数量也很有用。在这里,我们概述了SimRAD的功能,并提供了该软件包使用的示例(可从CRAN获取,网址为http://cran.r-project.org/web/packages/SimRAD)。