Otto T D, Vasconcellos E A, Gomes L H F, Moreira A S, Degrave W M, Mendonça-Lima L, Alves-Ferreira M
Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brasil.
Genet Mol Res. 2008 Sep 23;7(3):861-71. doi: 10.4238/vol7-3x-meeting04.
Optimizing and monitoring the data flow in high-throughput sequencing facilities is important for data input and output, for tracking the status of results for the users of the facility, and to guarantee a good, high-quality service. In a multi-user system environment with different throughputs, each user wants to access his/her data easily, track his/her sequencing history, analyze sequences and their quality, and apply some basic post-sequencing analysis, without the necessity of installing further software. Recently, Fiocruz established such a core facility as a "technological platform". Infrastructure includes a 48-capillary 3730 DNA Sequence Analyzer (Applied Biosystems) and supporting equipment. The service includes running samples for large-scale users, performing DNA sequencing reactions and runs for medium and small users, and participation in partial or full genome projects. We implemented a workflow that fulfills these requirements for small and high throughput users. Our implementation also includes the monitoring of data for continuous quality improvement (reports by plate, month and user) by the sequencing staff. For the user, different analyses of the chromatograms, such as visualization of good quality regions, as well as processing, such as comparisons or assemblies, are available. So far, 180 users have made use of the service, generating 155,000 sequences, 35% of which were produced for the BCG Moreau-RJ genome project. The pipeline (named ChromaPipe for Chromatogram Pipeline) is available for download by the scientific community at the url http://bioinfo.pdtis.fiocruz.br/ChromaPipe/. The support for assembly is also configured as a web service: http://bioinfo.pdtis.fiocruz.br/Assembly/.
优化和监控高通量测序设施中的数据流对于数据的输入和输出、跟踪设施用户的结果状态以及保证优质的服务非常重要。在具有不同通量的多用户系统环境中,每个用户都希望轻松访问自己的数据、跟踪自己的测序历史、分析序列及其质量,并进行一些基本的测序后分析,而无需安装其他软件。最近,Fiocruz建立了这样一个作为“技术平台”的核心设施。基础设施包括一台48毛细管的3730 DNA序列分析仪(应用生物系统公司)及配套设备。服务包括为大规模用户运行样本、为中小用户进行DNA测序反应和测序运行,以及参与部分或全基因组项目。我们实施了一个满足小通量和高通量用户这些需求的工作流程。我们的实施还包括测序人员对数据进行监控以持续改进质量(按板、月份和用户生成报告)。对于用户而言,可以对色谱图进行不同分析,例如可视化高质量区域,以及进行处理,如比较或组装。到目前为止,已有180名用户使用了该服务,生成了155,000条序列,其中35%是为卡介苗莫罗 - 里约热内卢基因组项目生成的。该流程(名为ChromaPipe,即色谱图流程)可供科学界在网址http://bioinfo.pdtis.fiocruz.br/ChromaPipe/下载。对组装的支持也配置为一个网络服务:http://bioinfo.pdtis.fiocruz.br/Assembly/ 。