BugSeq Bioinformatics Inc, Vancouver, BC, Canada.
BMC Bioinformatics. 2021 Mar 25;22(1):160. doi: 10.1186/s12859-021-04089-5.
As the use of nanopore sequencing for metagenomic analysis increases, tools capable of performing long-read taxonomic classification (ie. determining the composition of a sample) in a fast and accurate manner are needed. Existing tools were either designed for short-read data (eg. Centrifuge), take days to analyse modern sequencer outputs (eg. MetaMaps) or suffer from suboptimal accuracy (eg. CDKAM). Additionally, all tools require command line expertise and do not scale in the cloud.
We present BugSeq, a novel, highly accurate metagenomic classifier for nanopore reads. We evaluate BugSeq on simulated data, mock microbial communities and real clinical samples. On the ZymoBIOMICS Even and Log communities, BugSeq (F1 = 0.95 at species level) offers better read classification than MetaMaps (F1 = 0.89-0.94) in a fraction of the time. BugSeq significantly improves on the accuracy of Centrifuge (F1 = 0.79-0.93) and CDKAM (F1 = 0.91-0.94) while offering competitive run times. When applied to 41 samples from patients with lower respiratory tract infections, BugSeq produces greater concordance with microbiological culture and qPCR compared with "What's In My Pot" analysis.
BugSeq is deployed to the cloud for easy and scalable long-read metagenomic analyses. BugSeq is freely available for non-commercial use at https://bugseq.com/free .
随着纳米孔测序在宏基因组分析中的应用越来越多,需要能够快速准确地进行长读长分类(即确定样本的组成)的工具。现有的工具要么是为短读长数据设计的(例如 Centrifuge),要么需要数天时间来分析现代测序仪的输出(例如 MetaMaps),要么准确性不理想(例如 CDKAM)。此外,所有工具都需要命令行专业知识,并且无法在云中扩展。
我们提出了 BugSeq,这是一种用于纳米孔读长的新型、高度准确的宏基因组分类器。我们在模拟数据、模拟微生物群落和真实临床样本上评估了 BugSeq。在 ZymoBIOMICS Even 和 Log 群落中,BugSeq(在物种水平上的 F1 值为 0.95)在短时间内提供了比 MetaMaps(F1 值为 0.89-0.94)更好的读分类。BugSeq 显著提高了 Centrifuge(F1 值为 0.79-0.93)和 CDKAM(F1 值为 0.91-0.94)的准确性,同时提供了有竞争力的运行时间。当应用于 41 名下呼吸道感染患者的样本时,BugSeq 与微生物培养和 qPCR 的一致性优于“我的锅中有什么”分析。
BugSeq 已部署到云端,可轻松实现可扩展的长读长宏基因组分析。BugSeq 可在 https://bugseq.com/free 免费供非商业用途使用。