Spjuth Ola, Bongcam-Rudloff Erik, Dahlberg Johan, Dahlö Martin, Kallio Aleksi, Pireddu Luca, Vezzi Francesco, Korpelainen Eija
Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Uppsala, P.O. Box 591, SE-75124, Sweden.
SLU-Global Bioinformatics Centre, Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden.
Gigascience. 2016 Jun 7;5:26. doi: 10.1186/s13742-016-0132-7.
With ever-increasing amounts of data being produced by next-generation sequencing (NGS) experiments, the requirements placed on supporting e-infrastructures have grown. In this work, we provide recommendations based on the collective experiences from participants in the EU COST Action SeqAhead for the tasks of data preprocessing, upstream processing, data delivery, and downstream analysis, as well as long-term storage and archiving. We cover demands on computational and storage resources, networks, software stacks, automation of analysis, education, and also discuss emerging trends in the field. E-infrastructures for NGS require substantial effort to set up and maintain over time, and with sequencing technologies and best practices for data analysis evolving rapidly it is important to prioritize both processing capacity and e-infrastructure flexibility when making strategic decisions to support the data analysis demands of tomorrow. Due to increasingly demanding technical requirements we recommend that e-infrastructure development and maintenance be handled by a professional service unit, be it internal or external to the organization, and emphasis should be placed on collaboration between researchers and IT professionals.
随着下一代测序(NGS)实验产生的数据量不断增加,对支持性电子基础设施的要求也在提高。在这项工作中,我们根据欧盟COST行动SeqAhead参与者的集体经验,针对数据预处理、上游处理、数据交付和下游分析以及长期存储和存档任务提供建议。我们涵盖了对计算和存储资源、网络、软件栈、分析自动化、教育的需求,并讨论了该领域的新兴趋势。NGS的电子基础设施需要随着时间的推移投入大量精力来建立和维护,并且随着测序技术和数据分析最佳实践的快速发展,在做出战略决策以支持未来的数据分析需求时,优先考虑处理能力和电子基础设施的灵活性非常重要。由于技术要求日益苛刻,我们建议电子基础设施的开发和维护由专业服务部门负责,无论是组织内部还是外部的,并应强调研究人员与IT专业人员之间的合作。