NGS-eval：下一代测序错误分析与新序列变异检测工具

NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL.

作者信息

May Ali, Abeln Sanne, Buijs Mark J, Heringa Jaap, Crielaard Wim, Brandt Bernd W

机构信息

Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam (ACTA), University of Amsterdam and VU University Amsterdam, Amsterdam, The Netherlands Centre for Integrative Bioinformatics (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands.

Centre for Integrative Bioinformatics (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands AIMMS Amsterdam Institute for Molecules Medicines and Systems, VU University Amsterdam, Amsterdam, The Netherlands.

出版信息

Nucleic Acids Res. 2015 Jul 1;43(W1):W301-5. doi: 10.1093/nar/gkv346. Epub 2015 Apr 15.

DOI:10.1093/nar/gkv346

PMID:25878034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4489229/

Abstract

Massively parallel sequencing of microbial genetic markers (MGMs) is used to uncover the species composition in a multitude of ecological niches. These sequencing runs often contain a sample with known composition that can be used to evaluate the sequencing quality or to detect novel sequence variants. With NGS-eval, the reads from such (mock) samples can be used to (i) explore the differences between the reads and their references and to (ii) estimate the sequencing error rate. This tool maps these reads to references and calculates as well as visualizes the different types of sequencing errors. Clearly, sequencing errors can only be accurately calculated if the reference sequences are correct. However, even with known strains, it is not straightforward to select the correct references from databases. We previously analysed a pyrosequencing dataset from a mock sample to estimate sequencing error rates and detected sequence variants in our mock community, allowing us to obtain an accurate error estimation. Here, we demonstrate the variant detection and error analysis capability of NGS-eval with Illumina MiSeq reads from the same mock community. While tailored towards the field of metagenomics, this server can be used for any type of MGM-based reads. NGS-eval is available at http://www.ibi.vu.nl/programs/ngsevalwww/.

摘要

微生物遗传标记（MGM）的大规模平行测序用于揭示众多生态位中的物种组成。这些测序运行通常包含一个已知组成的样本，可用于评估测序质量或检测新的序列变体。使用NGS-eval，来自此类（模拟）样本的读数可用于（i）探索读数与其参考序列之间的差异，以及（ii）估计测序错误率。该工具将这些读数映射到参考序列，并计算和可视化不同类型的测序错误。显然，只有当参考序列正确时，才能准确计算测序错误。然而，即使对于已知菌株，从数据库中选择正确的参考序列也并非易事。我们之前分析了一个来自模拟样本的焦磷酸测序数据集，以估计测序错误率，并在我们的模拟群落中检测到序列变体，从而使我们能够获得准确的错误估计。在这里，我们用来自同一模拟群落的Illumina MiSeq读数展示了NGS-eval的变体检测和错误分析能力。虽然该服务器是针对宏基因组学领域定制的，但可用于任何基于MGM的读数。可在http://www.ibi.vu.nl/programs/ngsevalwww/获取NGS-eval。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6420/4489229/46064ed3aca3/gkv346fig1.jpg

相似文献

NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL.

Nucleic Acids Res. 2015 Jul 1;43(W1):W301-5. doi: 10.1093/nar/gkv346. Epub 2015 Apr 15.

MetaObtainer: A Tool for Obtaining Specified Species from Metagenomic Reads of Next-generation Sequencing.

Interdiscip Sci. 2015 Dec;7(4):405-13. doi: 10.1007/s12539-015-0281-x. Epub 2015 Aug 21.

IPED: a highly efficient denoising tool for Illumina MiSeq Paired-end 16S rRNA gene amplicon sequencing data.

BMC Bioinformatics. 2016 Apr 29;17(1):192. doi: 10.1186/s12859-016-1061-2.

Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples.

BMC Genomics. 2017 May 15;18(1):378. doi: 10.1186/s12864-017-3721-7.

Gencore: an efficient tool to generate consensus reads for error suppressing and duplicate removing of NGS data.

BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):606. doi: 10.1186/s12859-019-3280-9.

Next-generation sequencing to inventory taxonomic diversity in eukaryotic communities: a test for freshwater diatoms.

Mol Ecol Resour. 2013 Jul;13(4):607-19. doi: 10.1111/1755-0998.12105. Epub 2013 Apr 17.

Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations.

J Virol. 2015 Aug;89(16):8540-55. doi: 10.1128/JVI.00522-15. Epub 2015 Jun 3.

Barcode-free next-generation sequencing error validation for ultra-rare variant detection.

Nat Commun. 2019 Feb 28;10(1):977. doi: 10.1038/s41467-019-08941-4.

NGS for Sequence Variants.

Adv Exp Med Biol. 2016;939:1-20. doi: 10.1007/978-981-10-1503-8_1.

Human papillomavirus genotyping by 454 next generation sequencing technology.

J Clin Virol. 2011 Oct;52(2):93-7. doi: 10.1016/j.jcv.2011.07.006. Epub 2011 Jul 29.

引用本文的文献

Accurate phenotype-to-genotype mapping of high-diversity yeast libraries by heat-shock-electroporation (HEEL).

mBio. 2025 Feb 5;16(2):e0319724. doi: 10.1128/mbio.03197-24. Epub 2024 Dec 20.

Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Funct Integr Genomics. 2022 Feb;22(1):3-26. doi: 10.1007/s10142-021-00810-y. Epub 2021 Oct 18.

OTUs and ASVs Produce Comparable Taxonomic and Diversity from Shrimp Microbiota 16S Profiles Using Tailored Abundance Filters.

Genes (Basel). 2021 Apr 13;12(4):564. doi: 10.3390/genes12040564.

Sequencing error profiles of Illumina sequencing instruments.

NAR Genom Bioinform. 2021 Mar 27;3(1):lqab019. doi: 10.1093/nargab/lqab019. eCollection 2021 Mar.

Sequencing-based microsatellite instability testing using as few as six markers for high-throughput clinical diagnostics.

Hum Mutat. 2020 Jan;41(1):332-341. doi: 10.1002/humu.23906. Epub 2019 Sep 15.

Systematic evaluation of error rates and causes in short samples in next-generation sequencing.

Sci Rep. 2018 Jul 19;8(1):10950. doi: 10.1038/s41598-018-29325-6.

Deep-Coverage MPS Analysis of Heteroplasmic Variants within the mtGenome Allows for Frequent Differentiation of Maternal Relatives.

Genes (Basel). 2018 Feb 26;9(3):124. doi: 10.3390/genes9030124.

Comparative analyses of the major royal jelly protein gene cluster in three Apis species with long amplicon sequencing.

DNA Res. 2017 Jun 1;24(3):279-287. doi: 10.1093/dnares/dsw064.

Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity.

Nat Commun. 2016 Dec 20;7:13642. doi: 10.1038/ncomms13642.

本文引用的文献

Unraveling the outcome of 16S rDNA-based taxonomy analysis through mock data and simulations.

Bioinformatics. 2014 Jun 1;30(11):1530-8. doi: 10.1093/bioinformatics/btu085. Epub 2014 Feb 10.

Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.

Appl Environ Microbiol. 2013 Sep;79(17):5112-20. doi: 10.1128/AEM.01043-13. Epub 2013 Jun 21.

QualitySNPng: a user-friendly SNP detection and visualization tool.

Nucleic Acids Res. 2013 Jul;41(Web Server issue):W587-90. doi: 10.1093/nar/gkt333. Epub 2013 Apr 30.

Updating benchtop sequencing performance comparison.

Nat Biotechnol. 2013 Apr;31(4):294-6. doi: 10.1038/nbt.2522.

Comparing clustering and pre-processing in taxonomy analysis.

Bioinformatics. 2012 Nov 15;28(22):2891-7. doi: 10.1093/bioinformatics/bts552. Epub 2012 Sep 8.

A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.

BMC Genomics. 2012 Jul 24;13:341. doi: 10.1186/1471-2164-13-341.

Structure, function and diversity of the healthy human microbiome.

Nature. 2012 Jun 13;486(7402):207-14. doi: 10.1038/nature11234.

A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE.

PLoS Comput Biol. 2012;8(6):e1002541. doi: 10.1371/journal.pcbi.1002541. Epub 2012 Jun 7.

Performance comparison of benchtop high-throughput sequencing platforms.

Nat Biotechnol. 2012 May;30(5):434-9. doi: 10.1038/nbt.2198.

A survey of error-correction methods for next-generation sequencing.

Brief Bioinform. 2013 Jan;14(1):56-66. doi: 10.1093/bib/bbs015. Epub 2012 Apr 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

NGS-eval：下一代测序错误分析与新序列变异检测工具

NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL.

作者信息

May Ali, Abeln Sanne, Buijs Mark J, Heringa Jaap, Crielaard Wim, Brandt Bernd W

机构信息

出版信息

Nucleic Acids Res. 2015 Jul 1;43(W1):W301-5. doi: 10.1093/nar/gkv346. Epub 2015 Apr 15.

DOI:10.1093/nar/gkv346

PMID:25878034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4489229/

Abstract

摘要

NGS-eval：下一代测序错误分析与新序列变异检测工具

NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

NGS-eval：下一代测序错误分析与新序列变异检测工具

NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献