Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 6997801, Israel.
Department of Life Sciences, Ben-Gurion University of the Negev and the National Institute for Biotechnology in the Negev, Beer-Sheva, 8410501, Israel.
Microbiome. 2021 Jun 25;9(1):144. doi: 10.1186/s40168-021-01068-z.
Metagenomic sequencing has led to the identification and assembly of many new bacterial genome sequences. These bacteria often contain plasmids: usually small, circular double-stranded DNA molecules that may transfer across bacterial species and confer antibiotic resistance. These plasmids are generally less studied and understood than their bacterial hosts. Part of the reason for this is insufficient computational tools enabling the analysis of plasmids in metagenomic samples.
We developed SCAPP (Sequence Contents-Aware Plasmid Peeler)-an algorithm and tool to assemble plasmid sequences from metagenomic sequencing. SCAPP builds on some key ideas from the Recycler algorithm while improving plasmid assemblies by integrating biological knowledge about plasmids. We compared the performance of SCAPP to Recycler and metaplasmidSPAdes on simulated metagenomes, real human gut microbiome samples, and a human gut plasmidome dataset that we generated. We also created plasmidome and metagenome data from the same cow rumen sample and used the parallel sequencing data to create a novel assessment procedure. Overall, SCAPP outperformed Recycler and metaplasmidSPAdes across this wide range of datasets.
SCAPP is an easy to use Python package that enables the assembly of full plasmid sequences from metagenomic samples. It outperformed existing metagenomic plasmid assemblers in most cases and assembled novel and clinically relevant plasmids in samples we generated such as a human gut plasmidome. SCAPP is open-source software available from: https://github.com/Shamir-Lab/SCAPP . Video abstract.
宏基因组测序导致了许多新细菌基因组序列的鉴定和组装。这些细菌通常含有质粒:通常是小型的、圆形的双链 DNA 分子,可以在细菌物种之间转移,并赋予抗生素抗性。这些质粒通常比它们的细菌宿主研究和理解得更少。部分原因是缺乏能够分析宏基因组样本中质粒的计算工具。
我们开发了 SCAPP(序列内容感知质粒削皮器)-一种从宏基因组测序中组装质粒序列的算法和工具。SCAPP 建立在 Recycler 算法的一些关键思想之上,同时通过整合关于质粒的生物学知识来改进质粒组装。我们将 SCAPP 的性能与 Recycler 和 metaplasmidSPAdes 在模拟宏基因组、真实人类肠道微生物组样本和我们生成的人类肠道质粒组数据集上进行了比较。我们还从同一牛瘤胃样本创建了质粒组和宏基因组数据,并使用并行测序数据创建了一种新的评估程序。总体而言,SCAPP 在广泛的数据集上都优于 Recycler 和 metaplasmidSPAdes。
SCAPP 是一个易于使用的 Python 包,它能够从宏基因组样本中组装完整的质粒序列。它在大多数情况下都优于现有的宏基因组质粒组装器,并在我们生成的样本中组装了新的和临床相关的质粒,例如人类肠道质粒组。SCAPP 是一个开源软件,可从以下网址获得:https://github.com/Shamir-Lab/SCAPP 。视频摘要。