• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MGmapper:宏基因组序列 reads 的基于参考的映射和分类注释

MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads.

作者信息

Petersen Thomas Nordahl, Lukjancenko Oksana, Thomsen Martin Christen Frølund, Maddalena Sperotto Maria, Lund Ole, Møller Aarestrup Frank, Sicheritz-Pontén Thomas

机构信息

Department of Bio and Health Informatics, Technical University of Denmark, Kongens Lyngby, Denmark.

National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark.

出版信息

PLoS One. 2017 May 3;12(5):e0176469. doi: 10.1371/journal.pone.0176469. eCollection 2017.

DOI:10.1371/journal.pone.0176469
PMID:28467460
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5415185/
Abstract

An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets.

摘要

越来越多的物种和基因鉴定研究依赖于对单个分离株或宏基因组样本进行下一代序列分析。有几种方法可用于进行分类注释,之前的一项宏基因组基准研究表明,除非应用阈值或后处理来区分正确和错误的注释,否则大量的假阳性物种注释是一个问题。MGmapper是一个用于处理原始下一代序列数据并执行基于参考的序列分配的软件包,随后进行后处理分析,以在物种和菌株水平分辨率上产生可靠的分类注释。先前使用由8个属、11个物种和12个菌株组成的体外细菌模拟群落样本对宏基因组分类方法进行基准测试。应用后处理过滤器后,我们在物种和属水平上获得了100%正确的分类分配。在菌株水平注释方面,灵敏度和精确度达到了75%。MGmapper和Kraken在物种水平上的比较表明,MGmapper使用84.8%的序列读数在物种水平上分配分类,而Kraken为70.5%,两种方法都识别出所有物种且无假阳性。对于被拒绝和被接受的分类注释,以纯文本和电子表格形式提供了广泛的读数计数统计信息。MGmapper的命令行版本可以使用自定义数据库,完整的流程作为一个bitbucked包(https://bitbucket.org/genomicepidemiology/mgmapper)免费提供。网络版本(https://cge.cbs.dtu.dk/services/MGmapper)提供了分析小型fastq数据集的基本功能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06d9/5415185/b76f10b41f13/pone.0176469.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06d9/5415185/b76f10b41f13/pone.0176469.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06d9/5415185/b76f10b41f13/pone.0176469.g001.jpg

相似文献

1
MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads.MGmapper:宏基因组序列 reads 的基于参考的映射和分类注释
PLoS One. 2017 May 3;12(5):e0176469. doi: 10.1371/journal.pone.0176469. eCollection 2017.
2
MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function.MG-RAST,一种用于分析微生物群落结构和功能的宏基因组学服务。
Methods Mol Biol. 2016;1399:207-33. doi: 10.1007/978-1-4939-3369-3_13.
3
Joining Illumina paired-end reads for classifying phylogenetic marker sequences.将 Illumina 配对末端读取用于分类系统发育标记序列。
BMC Bioinformatics. 2020 Mar 14;21(1):105. doi: 10.1186/s12859-020-3445-6.
4
Scalable metagenomics alignment research tool (SMART): a scalable, rapid, and complete search heuristic for the classification of metagenomic sequences from complex sequence populations.可扩展宏基因组比对研究工具(SMART):一种用于对复杂序列群体中的宏基因组序列进行分类的可扩展、快速且完整的搜索启发式方法。
BMC Bioinformatics. 2016 Jul 28;17:292. doi: 10.1186/s12859-016-1159-6.
5
Correction: MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads.更正:MGmapper:宏基因组序列读数的基于参考的映射和分类注释。
PLoS One. 2017 Jun 12;12(6):e0179778. doi: 10.1371/journal.pone.0179778. eCollection 2017.
6
Further steps in TANGO: improved taxonomic assignment in metagenomics.TANGO 的进一步发展:提高宏基因组学中的分类学分配。
Bioinformatics. 2014 Jan 1;30(1):17-23. doi: 10.1093/bioinformatics/btt256. Epub 2013 May 3.
7
Accurate taxonomic assignment of short pyrosequencing reads.对短焦磷酸测序读段进行准确的分类学归属
Pac Symp Biocomput. 2010:3-9. doi: 10.1142/9789814295291_0002.
8
MetaObtainer: A Tool for Obtaining Specified Species from Metagenomic Reads of Next-generation Sequencing.MetaObtainer:一种从下一代测序宏基因组读数中获取特定物种的工具。
Interdiscip Sci. 2015 Dec;7(4):405-13. doi: 10.1007/s12539-015-0281-x. Epub 2015 Aug 21.
9
TaxAss: Leveraging a Custom Freshwater Database Achieves Fine-Scale Taxonomic Resolution.TaxAss:利用自定义淡水数据库实现精细分类学分辨率。
mSphere. 2018 Sep 5;3(5):e00327-18. doi: 10.1128/mSphere.00327-18.
10
Species classifier choice is a key consideration when analysing low-complexity food microbiome data.在分析低复杂度食品微生物组数据时,物种分类器的选择是一个关键考虑因素。
Microbiome. 2018 Mar 20;6(1):50. doi: 10.1186/s40168-018-0437-0.

引用本文的文献

1
Differences in antimicrobial resistance gene abundance and microbial diversity of the gut microbiome in patients on antibiotics enrolled in a clinical trial.参与一项临床试验的使用抗生素患者肠道微生物群中抗菌药物耐药基因丰度和微生物多样性的差异。
Ther Adv Infect Dis. 2025 Jun 5;12:20499361251337597. doi: 10.1177/20499361251337597. eCollection 2025 Jan-Dec.
2
Gut heavy metal and antibiotic resistome of humans living in the high Arctic.生活在北极地区的人类肠道中的重金属和抗生素抗性组
Front Microbiol. 2024 Oct 30;15:1493803. doi: 10.3389/fmicb.2024.1493803. eCollection 2024.
3
: a novel immune modulator in breast cancer?

本文引用的文献

1
Evaluation of shotgun metagenomics sequence classification methods using in silico and in vitro simulated communities.使用计算机模拟和体外模拟群落评估鸟枪法宏基因组学序列分类方法
BMC Bioinformatics. 2015 Nov 4;16:363. doi: 10.1186/s12859-015-0788-5.
2
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes.在不使用参考基因组的情况下,对复杂宏基因组样本中的基因组和遗传元件进行鉴定和组装。
Nat Biotechnol. 2014 Aug;32(8):822-8. doi: 10.1038/nbt.2939. Epub 2014 Jul 6.
3
Bacterial diversity in snow on North Pole ice floes.
: 一种新型乳腺癌免疫调节剂?
Expert Rev Mol Med. 2023 Apr 3;25:e15. doi: 10.1017/erm.2023.9.
4
Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters.在蛋白质家族水平上探索微生物功能多样性——从宏基因组序列 reads 到注释的蛋白质簇。
Front Bioinform. 2023 Mar 3;3:1157956. doi: 10.3389/fbinf.2023.1157956. eCollection 2023.
5
The human microbiome: A promising target for lung cancer treatment.人类微生物组:肺癌治疗的有前途靶点。
Front Immunol. 2023 Jan 30;14:1091165. doi: 10.3389/fimmu.2023.1091165. eCollection 2023.
6
Co-diversification of an intestinal Mycoplasma and its salmonid host.肠道支原体与其鲑科宿主的共同多样化。
ISME J. 2023 May;17(5):682-692. doi: 10.1038/s41396-023-01379-z. Epub 2023 Feb 17.
7
A bat MERS-like coronavirus circulates in pangolins and utilizes human DPP4 and host proteases for cell entry.一种类似中东呼吸综合征的蝙蝠冠状病毒在穿山甲中传播,并利用人类 DPP4 和宿主蛋白酶进入细胞。
Cell. 2023 Feb 16;186(4):850-863.e16. doi: 10.1016/j.cell.2023.01.019.
8
Library Preparation and Sequencing Platform Introduce Bias in Metagenomic-Based Characterizations of Microbiomes.文库制备和测序平台会导致基于宏基因组学的微生物组特征分析产生偏差。
Microbiol Spectr. 2022 Apr 27;10(2):e0009022. doi: 10.1128/spectrum.00090-22. Epub 2022 Mar 15.
9
Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.宏基因组学音乐——应用、分析流程及其相关工具的综述。
Funct Integr Genomics. 2022 Feb;22(1):3-26. doi: 10.1007/s10142-021-00810-y. Epub 2021 Oct 18.
10
Standard Sample Storage Conditions Have an Impact on Inferred Microbiome Composition and Antimicrobial Resistance Patterns.标准样本储存条件对推断的微生物群落组成和抗生素耐药模式有影响。
Microbiol Spectr. 2021 Oct 31;9(2):e0138721. doi: 10.1128/Spectrum.01387-21. Epub 2021 Oct 6.
北极浮冰上雪中的细菌多样性。
Extremophiles. 2014 Nov;18(6):945-51. doi: 10.1007/s00792-014-0660-y. Epub 2014 Jun 21.
4
Kraken: ultrafast metagenomic sequence classification using exact alignments.克拉肯:使用精确比对的超快速宏基因组序列分类
Genome Biol. 2014 Mar 3;15(3):R46. doi: 10.1186/gb-2014-15-3-r46.
5
Benchmarking of methods for genomic taxonomy.基因组分类方法的基准测试
J Clin Microbiol. 2014 May;52(5):1529-39. doi: 10.1128/JCM.02981-13. Epub 2014 Feb 26.
6
Metagenomics reveals sediment microbial community response to Deepwater Horizon oil spill.宏基因组学揭示了沉积物微生物群落对深水地平线石油泄漏的响应。
ISME J. 2014 Jul;8(7):1464-75. doi: 10.1038/ismej.2013.254. Epub 2014 Jan 23.
7
Metagenomic analysis of a permafrost microbial community reveals a rapid response to thaw.对永久冻土微生物群落的宏基因组分析揭示了对解冻的快速响应。
Nature. 2011 Nov 6;480(7377):368-71. doi: 10.1038/nature10576.
8
RAPSearch2: a fast and memory-efficient protein similarity search tool for next-generation sequencing data.RAPSearch2:一种快速且内存高效的用于下一代测序数据的蛋白质相似性搜索工具。
Bioinformatics. 2012 Jan 1;28(1):125-6. doi: 10.1093/bioinformatics/btr595. Epub 2011 Oct 28.
9
Integrative analysis of environmental sequences using MEGAN4.使用 MEGAN4 进行环境序列的综合分析。
Genome Res. 2011 Sep;21(9):1552-60. doi: 10.1101/gr.120618.111. Epub 2011 Jun 20.
10
Taxonomic classification of metagenomic shotgun sequences with CARMA3.基于 CARMA3 的宏基因组鸟枪法测序的分类学分类
Nucleic Acids Res. 2011 Aug;39(14):e91. doi: 10.1093/nar/gkr225. Epub 2011 May 17.