Institute of Ecology and Earth Sciences, University of Tartu, Tartu, Estonia.
Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
Mol Ecol Resour. 2017 Nov;17(6):e234-e240. doi: 10.1111/1755-0998.12692. Epub 2017 Jun 21.
High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24 hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.
高通量测序方法已成为环境科学以及公共和私营部门的常规分析工具。这些方法提供了大量的数据,需要经过几个步骤进行分析。虽然可以使用几个公共工具进行生物信息学分析,但许多分析管道为更复杂或定制的设计提供的最佳分析选项太少。在这里,我们介绍了 PipeCraft,这是一个灵活易用的生物信息学管道,具有用户友好的图形界面,可链接用于分析扩增子测序数据的几个公共工具。用户可以通过选择最适合的工具和选项来自行定制管道,以处理来自 Illumina、Pacific Biosciences、Ion Torrent 和 Roche 454 测序平台的原始序列。我们描述了 PipeCraft 的设计和选项,并通过分析来自三个不同测序平台的数据集来评估其性能。我们证明了 PipeCraft 能够在 24 小时内处理大型数据集。图形用户界面和各种生物信息学工具之间的自动链接使工作流程的定制变得更加容易。所有分析步骤和选项都记录在日志文件中,并且易于跟踪。