Pacific Northwest Laboratory, US Food and Drug Administration, 22201 23rd Drive SE, Bothell, WA, 98021, USA.
Sci Rep. 2021 Mar 11;11(1):5676. doi: 10.1038/s41598-021-84956-6.
The sequencing, assembly, and analysis of bacterial genomes is central to tracking and characterizing foodborne pathogens. The bulk of bacterial genome sequencing at the US Food and Drug Administration is performed using short-read Illumina MiSeq technology, resulting in highly accurate but fragmented genomic sequences. The MinION sequencer from Oxford Nanopore is an evolving technology that produces long-read sequencing data with low equipment cost. The goal of this study was to compare Campylobacter genome assemblies generated from MiSeq and MinION data independently, as well as hybrid genome assemblies combining both data types. Two reference strains and two field isolates of C. jejuni were sequenced using MiSeq and MinION, and the sequence data were assembled using the software programs SPAdes and Canu, respectively. Hybrid genome assembly was performed using the program Unicycler. Comparison of the C. jejuni 81-176 and RM1221 genome assemblies to the PacBio reference genomes revealed that the SPAdes assemblies had the most accurate nucleotide identity, while the hybrid assemblies were the most contiguous. Assemblies generated only from MinION data using Canu were the least accurate, containing many indels and substitutions that affected downstream analyses. The hybrid sequencing approach was the most useful for detecting plasmids, large genome rearrangements, and repetitive elements such as rRNA and tRNA genes. The full genomes of both C. jejuni field isolates were completed and circularized using hybrid sequencing, and a plasmid was detected in one isolate. Continued development of nanopore sequencing technologies will likely enhance the accuracy of hybrid genome assemblies and enable public health laboratories to routinely generate complete circularized bacterial genome sequences.
细菌基因组的测序、组装和分析是追踪和描述食源性病原体的核心。美国食品和药物管理局(FDA)的大部分细菌基因组测序工作都是使用 Illumina MiSeq 短读长技术完成的,这会产生高度准确但碎片化的基因组序列。Oxford Nanopore 的 MinION 测序仪是一种不断发展的技术,可生成具有低设备成本的长读长测序数据。本研究的目的是独立比较 MiSeq 和 MinION 数据生成的弯曲杆菌基因组组装,以及结合两种数据类型的混合基因组组装。使用 MiSeq 和 MinION 对 2 个参考株和 2 个田间分离株的 C. jejuni 进行测序,分别使用软件程序 SPAdes 和 Canu 对序列数据进行组装。使用程序 Unicycler 进行混合基因组组装。将 C. jejuni 81-176 和 RM1221 基因组组装与 PacBio 参考基因组进行比较,结果表明 SPAdes 组装具有最高的核苷酸同一性,而混合组装最连续。仅使用 Canu 从 MinION 数据生成的组装准确性最低,包含许多影响下游分析的插入和替换。混合测序方法最有助于检测质粒、大基因组重排和 rRNA 和 tRNA 等重复元件。使用混合测序完成了两个 C. jejuni 田间分离株的全基因组并使其环状化,并在一个分离株中检测到一个质粒。纳米孔测序技术的持续发展可能会提高混合基因组组装的准确性,并使公共卫生实验室能够常规生成完整的环状细菌基因组序列。