Jeong Haeyoung, Lee Dae-Hee, Ryu Choong-Min, Park Seung-Hwan
Super-Bacteria Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon 34141, Republic of Korea.
Biosystems and Bioengineering Program, Korea University of Science and Technology (UST), Daejeon 34113, Republic of Korea.
J Microbiol Biotechnol. 2016 Jan;26(1):207-12. doi: 10.4014/jmb.1507.07055.
PacBio's long-read sequencing technologies can be successfully used for a complete bacterial genome assembly using recently developed non-hybrid assemblers in the absence of secondgeneration, high-quality short reads. However, standardized procedures that take into account multiple pre-existing second-generation sequencing platforms are scarce. In addition to Illumina HiSeq and Ion Torrent PGM-based genome sequencing results derived from previous studies, we generated further sequencing data, including from the PacBio RS II platform, and applied various bioinformatics tools to obtain complete genome assemblies for five bacterial strains. Our approach revealed that the hierarchical genome assembly process (HGAP) non-hybrid assembler resulted in nearly complete assemblies at a moderate coverage of ~75x, but that different versions produced non-compatible results requiring post processing. The other two platforms further improved the PacBio assembly through scaffolding and a final error correction.
在没有第二代高质量短读长序列的情况下,太平洋生物科学公司(PacBio)的长读长测序技术可以通过最近开发的非杂交组装器成功用于完整细菌基因组组装。然而,考虑到多个现有第二代测序平台的标准化程序却很稀少。除了来自先前研究的基于Illumina HiSeq和Ion Torrent PGM的基因组测序结果外,我们还生成了更多测序数据,包括来自PacBio RS II平台的数据,并应用各种生物信息学工具来获得五种细菌菌株的完整基因组组装。我们的方法表明,分层基因组组装流程(HGAP)非杂交组装器在约75倍的适度覆盖度下可产生近乎完整的组装结果,但不同版本会产生不兼容的结果,需要进行后期处理。另外两个平台通过搭建支架和最终的错误校正进一步改进了PacBio组装结果。