Suppr超能文献

使用 MetaMaps 对长读进行菌株水平宏基因组分配和组成估计。

Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps.

机构信息

Institute of Medical Microbiology and Hospital Hygiene, Heinrich-Heine-University Düsseldorf, Düsseldorf, North Rhine-Westphalia, Germany.

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, 20892, USA.

出版信息

Nat Commun. 2019 Jul 11;10(1):3066. doi: 10.1038/s41467-019-10934-2.

Abstract

Metagenomic sequence classification should be fast, accurate and information-rich. Emerging long-read sequencing technologies promise to improve the balance between these factors but most existing methods were designed for short reads. MetaMaps is a new method, specifically developed for long reads, capable of mapping a long-read metagenome to a comprehensive RefSeq database with >12,000 genomes in <16 GB or RAM on a laptop computer. Integrating approximate mapping with probabilistic scoring and EM-based estimation of sample composition, MetaMaps achieves >94% accuracy for species-level read assignment and r > 0.97 for the estimation of sample composition on both simulated and real data when the sample genomes or close relatives are present in the classification database. To address novel species and genera, which are comparatively harder to predict, MetaMaps outputs mapping locations and qualities for all classified reads, enabling functional studies (e.g. gene presence/absence) and detection of incongruities between sample and reference genomes.

摘要

宏基因组序列分类应快速、准确且信息丰富。新兴的长读测序技术有望改善这些因素之间的平衡,但大多数现有方法都是为短读长设计的。MetaMaps 是一种新的方法,专门为长读长设计,能够在笔记本电脑上的 <16GB 或 RAM 中,将长读长宏基因组映射到包含 >12000 个基因组的综合 RefSeq 数据库中。MetaMaps 将近似映射与概率评分和基于 EM 的样本组成估计相结合,在分类数据库中存在样本基因组或近亲时,实现了 >94%的物种级读分配准确率和 r > 0.97 的样本组成估计准确率。为了解决新型物种和属,这些物种和属相对较难预测,MetaMaps 为所有分类读长输出映射位置和质量,从而能够进行功能研究(例如基因存在/缺失)和检测样本与参考基因组之间的不一致性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56af/6624308/ae4842d34eea/41467_2019_10934_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验