Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada.
PLoS One. 2012;7(12):e52038. doi: 10.1371/journal.pone.0052038. Epub 2012 Dec 21.
Typically, the assembly and closure of a complete bacterial genome requires substantial additional effort spent in a wet lab for gap resolution and genome polishing. Assembly is further confounded by subspecies polymorphism when starting from metagenome sequence data. In this paper, we describe an in silico gap-resolution strategy that can substantially improve assembly. This strategy resolves assembly gaps in scaffolds using pre-assembled contigs, followed by verification with read mapping. It is capable of resolving assembly gaps caused by repetitive elements and subspecies polymorphisms. Using this strategy, we realized the de novo assembly of the first two Dehalobacter genomes from the metagenomes of two anaerobic mixed microbial cultures capable of reductive dechlorination of chlorinated ethanes and chloroform. Only four additional PCR reactions were required even though the initial assembly with Newbler v. 2.5 produced 101 contigs within 9 scaffolds belonging to two Dehalobacter strains. By applying this strategy to the re-assembly of a recently published genome of Bacteroides, we demonstrate its potential utility for other sequencing projects, both metagenomic and genomic.
通常情况下,组装和闭合一个完整的细菌基因组需要在湿实验室中花费大量额外的精力来解决缺口和基因组抛光问题。当从宏基因组序列数据开始时,组装还会因亚种多态性而变得更加复杂。在本文中,我们描述了一种可以大大提高组装质量的计算上的缺口解决策略。该策略使用预组装的 contigs 来解决支架中的组装缺口,然后通过读映射进行验证。它能够解决由重复元件和亚种多态性引起的组装缺口。使用这种策略,我们从能够还原脱氯氯化乙烷和三氯甲烷的两种厌氧混合微生物培养物的宏基因组中实现了前两个 Dehalobacter 基因组的从头组装。即使使用 Newbler v. 2.5 进行初始组装会在属于两个 Dehalobacter 菌株的 9 个支架中产生 101 个 contigs,也只需要另外进行四个 PCR 反应。通过将该策略应用于最近发表的 Bacteroides 基因组的重新组装,我们证明了它在其他测序项目中的潜力,包括宏基因组和基因组。