Bishara Alex, Moss Eli L, Kolmogorov Mikhail, Parada Alma E, Weng Ziming, Sidow Arend, Dekas Anne E, Batzoglou Serafim, Bhatt Ami S
Department of Computer Science, Stanford University, Stanford, California, USA.
Department of Medicine (Hematology, Blood and Marrow Transplantation) and Department of Genetics, Stanford University, Stanford, California, USA.
Nat Biotechnol. 2018 Oct 15. doi: 10.1038/nbt.4266.
Although shotgun metagenomic sequencing of microbiome samples enables partial reconstruction of strain-level community structure, obtaining high-quality microbial genome drafts without isolation and culture remains difficult. Here, we present an application of read clouds, short-read sequences tagged with long-range information, to microbiome samples. We present Athena, a de novo assembler that uses read clouds to improve metagenomic assemblies. We applied this approach to sequence stool samples from two healthy individuals and compared it with existing short-read and synthetic long-read metagenomic sequencing techniques. Read-cloud metagenomic sequencing and Athena assembly produced the most comprehensive individual genome drafts with high contiguity (>200-kb N50, fewer than ten contigs), even for bacteria with relatively low (20×) raw short-read-sequence coverage. We also sequenced a complex marine-sediment sample and generated 24 intermediate-quality genome drafts (>70% complete, <10% contaminated), nine of which were complete (>90% complete, <5% contaminated). Our approach allows for culture-free generation of high-quality microbial genome drafts by using a single shotgun experiment.
尽管对微生物组样本进行鸟枪法宏基因组测序能够部分重建菌株水平的群落结构,但在不进行分离和培养的情况下获得高质量的微生物基因组草图仍然很困难。在这里,我们展示了将带有长程信息标签的短读长序列——读云应用于微生物组样本。我们展示了Athena,这是一种利用读云来改进宏基因组组装的从头组装器。我们将这种方法应用于对两名健康个体的粪便样本进行测序,并将其与现有的短读长和合成长读长宏基因组测序技术进行比较。读云宏基因组测序和Athena组装产生了最全面的具有高连续性(N50>200 kb,重叠群少于十个)的个体基因组草图,即使对于原始短读长序列覆盖率相对较低(20×)的细菌也是如此。我们还对一个复杂的海洋沉积物样本进行了测序,并生成了24个中等质量的基因组草图(完成度>70%,污染率<10%),其中9个是完整的(完成度>90%,污染率<5%)。我们的方法允许通过单次鸟枪法实验无培养地生成高质量的微生物基因组草图。