利用空间图分类器（spacegraphcats）探索大型宏基因组组装图中的群落，揭示隐藏的序列多样性。

Exploring neighborhoods in large metagenome assembly graphs using spacegraphcats reveals hidden sequence diversity.

机构信息

Department of Population Health and Reproduction, University of California Davis, Davis, USA.

Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, USA.

出版信息

Genome Biol. 2020 Jul 6;21(1):164. doi: 10.1186/s13059-020-02066-4.

Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/spacegraphcats under the 3-Clause BSD License.

Exploring neighborhoods in large metagenome assembly graphs using spacegraphcats reveals hidden sequence diversity.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献