Plate-forme bio-informatique Genotoul, INRA, Biométrie et Intelligence Artificielle, BP 52627, 31326, Castanet-Tolosan Cedex, France.
BMC Genomics. 2012 Sep 9;13:462. doi: 10.1186/1471-2164-13-462.
Next generation sequencing platforms are now well implanted in sequencing centres and some laboratories. Upcoming smaller scale machines such as the 454 junior from Roche or the MiSeq from Illumina will increase the number of laboratories hosting a sequencer. In such a context, it is important to provide these teams with an easily manageable environment to store and process the produced reads.
We describe a user-friendly information system able to manage large sets of sequencing data. It includes, on one hand, a workflow environment already containing pipelines adapted to different input formats (sff, fasta, fastq and qseq), different sequencers (Roche 454, Illumina HiSeq) and various analyses (quality control, assembly, alignment, diversity studies,…) and, on the other hand, a secured web site giving access to the results. The connected user will be able to download raw and processed data and browse through the analysis result statistics. The provided workflows can easily be modified or extended and new ones can be added. Ergatis is used as a workflow building, running and monitoring system. The analyses can be run locally or in a cluster environment using Sun Grid Engine.
NG6 is a complete information system designed to answer the needs of a sequencing platform. It provides a user-friendly interface to process, store and download high-throughput sequencing data.
下一代测序平台现已广泛应用于测序中心和一些实验室。即将推出的较小规模的仪器,如罗氏的 454 初级版或 Illumina 的 MiSeq,将增加拥有测序仪的实验室数量。在这种情况下,为这些团队提供一个易于管理的环境来存储和处理生成的读取数据非常重要。
我们描述了一个用户友好的信息系统,能够管理大量的测序数据。它包括一方面,一个已经包含了适用于不同输入格式(sff、fasta、fastq 和 qseq)、不同测序仪(罗氏 454、Illumina HiSeq)和各种分析(质量控制、组装、比对、多样性研究等)的工作流程环境;另一方面,一个安全的网站,提供对结果的访问。连接的用户将能够下载原始和处理后的数据,并浏览分析结果的统计信息。提供的工作流程可以轻松修改或扩展,也可以添加新的工作流程。Ergatis 被用作工作流程的构建、运行和监控系统。分析可以在本地或使用 Sun Grid Engine 的集群环境中运行。
NG6 是一个完整的信息系统,旨在满足测序平台的需求。它提供了一个用户友好的界面,用于处理、存储和下载高通量测序数据。