Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK.
NIHR Oxford Biomedical Research Centre, University of Oxford, UK.
Nucleic Acids Res. 2020 Jul 2;48(W1):W366-W371. doi: 10.1093/nar/gkaa413.
Metagenomic sequencing combined with Oxford Nanopore Technology has the potential to become a point-of-care test for infectious disease in public health and clinical settings, providing rapid diagnosis of infection, guiding individual patient management and treatment strategies, and informing infection prevention and control practices. However, publicly available, streamlined, and reproducible pipelines for analyzing Nanopore metagenomic sequencing data are still lacking. Here we introduce NanoSPC, a scalable, portable and cloud compatible pipeline for analyzing Nanopore sequencing data. NanoSPC can identify potentially pathogenic viruses and bacteria simultaneously to provide comprehensive characterization of individual samples. The pipeline can also detect single nucleotide variants and assemble high quality complete consensus genome sequences, permitting high-resolution inference of transmission. We implement NanoSPC using Nextflow manager within Docker images to allow reproducibility and portability of the analysis. Moreover, we deploy NanoSPC to our scalable pathogen pipeline platform, enabling elastic computing for high throughput Nanopore data on HPC cluster as well as multiple cloud platforms, such as Google Cloud, Amazon Elastic Computing Cloud, Microsoft Azure and OpenStack. Users could either access our web interface (https://nanospc.mmmoxford.uk) to run cloud-based analysis, monitor process, and visualize results, as well as download Docker images and run command line to analyse data locally.
宏基因组测序结合牛津纳米孔技术有可能成为公共卫生和临床环境中传染病的即时检测方法,提供感染的快速诊断,指导个体患者的管理和治疗策略,并为感染预防和控制措施提供信息。然而,用于分析纳米孔宏基因组测序数据的公开、精简和可重复的管道仍然缺乏。在这里,我们介绍了 NanoSPC,这是一种用于分析纳米孔测序数据的可扩展、便携和与云兼容的管道。NanoSPC 可以同时识别潜在的致病性病毒和细菌,从而对个体样本进行全面表征。该管道还可以检测单核苷酸变体并组装高质量的完整一致基因组序列,从而能够对传播进行高分辨率推断。我们使用 Nextflow 管理器在 Docker 映像中实现了 NanoSPC,以实现分析的可重复性和可移植性。此外,我们将 NanoSPC 部署到我们可扩展的病原体管道平台上,从而能够在高性能计算集群以及多个云平台(如谷歌云、亚马逊弹性计算云、微软 Azure 和 OpenStack)上对高通量纳米孔数据进行弹性计算。用户可以访问我们的网络界面(https://nanospc.mmmoxford.uk)来运行基于云的分析、监控流程和可视化结果,以及下载 Docker 映像并在本地运行命令行来分析数据。