• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SEQ2MGS:一种从现有测序数据生成逼真的人工宏基因组的有效工具。

SEQ2MGS: an effective tool for generating realistic artificial metagenomes from the existing sequencing data.

作者信息

Van Camp Pieter-Jan, Porollo Aleksey

机构信息

Department of Biomedical Informatics, University of Cincinnati, Cincinnati, OH, USA.

Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.

出版信息

NAR Genom Bioinform. 2022 Jul 25;4(3):lqac050. doi: 10.1093/nargab/lqac050. eCollection 2022 Sep.

DOI:10.1093/nargab/lqac050
PMID:35899079
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9310082/
Abstract

Assessment of bioinformatics tools for the metagenomics analysis from the whole genome sequencing data requires realistic benchmark sets. We developed an effective and simple generator of artificial metagenomes from real sequencing experiments. The tool (SEQ2MGS) analyzes the input FASTQ files, precomputes genomic content, and blends shotgun reads from different sequenced isolates, or spike isolate(s) in real metagenome, in desired proportions. SEQ2MGS eliminates the need for simulation of sequencing platform variations, reads distributions, presence of plasmids, viruses, and contamination. The tool is especially useful for a quick generation of multiple complex samples that include new or understudied organisms, even without assembled genomes. For illustration, we first demonstrated the ease of SEQ2MGS use for the simulation of altered Schaedler flora (ASF) in comparison with metagenomics generators Grinder and CAMISIM. Next, we emulated the emergence of a pathogen in the human gut microbiome and observed that Kraken, Centrifuge, and MetaPhlAn, while correctly identified , produced inconsistent results for the rest of real metagenome. Finally, using the MG-RAST platform, we affirmed that SEQ2MGS properly transfers genomic information from an isolate into the simulated metagenome by the correct identification of antimicrobial resistance genes anticipated to appear compared to the original metagenome.

摘要

从全基因组测序数据进行宏基因组学分析的生物信息学工具评估需要现实的基准数据集。我们从真实测序实验中开发了一种有效且简单的人工宏基因组生成器。该工具(SEQ2MGS)分析输入的FASTQ文件,预先计算基因组内容,并按所需比例混合来自不同测序分离株或真实宏基因组中的加标分离株的鸟枪法 reads。SEQ2MGS无需模拟测序平台变异、reads分布、质粒、病毒和污染情况。该工具对于快速生成多个复杂样本特别有用,这些样本包括新的或研究不足的生物体,即使没有组装好的基因组。为了说明这一点,我们首先展示了与宏基因组学生成器Grinder和CAMISIM相比,SEQ2MGS用于模拟改变的舍德勒菌群(ASF)的简便性。接下来,我们模拟了人类肠道微生物群中病原体的出现,并观察到Kraken、Centrifuge和MetaPhlAn虽然能正确识别,但对其余真实宏基因组产生了不一致的结果。最后,使用MG-RAST平台,我们确认SEQ2MGS通过与原始宏基因组相比正确识别预期出现的抗菌抗性基因,将基因组信息从分离株正确转移到模拟宏基因组中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/c974d954d75c/lqac050fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/e5ab3c3c5f68/lqac050fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/1483bc517eec/lqac050fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/9a5ce8db88a8/lqac050fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/c974d954d75c/lqac050fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/e5ab3c3c5f68/lqac050fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/1483bc517eec/lqac050fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/9a5ce8db88a8/lqac050fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4390/9310082/c974d954d75c/lqac050fig4.jpg

相似文献

1
SEQ2MGS: an effective tool for generating realistic artificial metagenomes from the existing sequencing data.SEQ2MGS:一种从现有测序数据生成逼真的人工宏基因组的有效工具。
NAR Genom Bioinform. 2022 Jul 25;4(3):lqac050. doi: 10.1093/nargab/lqac050. eCollection 2022 Sep.
2
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
3
CAMISIM: simulating metagenomes and microbial communities.CAMISIM:模拟宏基因组和微生物群落。
Microbiome. 2019 Feb 8;7(1):17. doi: 10.1186/s40168-019-0633-6.
4
Human reference gut microbiome catalog including newly assembled genomes from under-represented Asian metagenomes.人类参考肠道微生物组目录,包括来自代表性不足的亚洲宏基因组的新组装基因组。
Genome Med. 2021 Aug 27;13(1):134. doi: 10.1186/s13073-021-00950-7.
5
Assembly and Annotation of Viral Metagenomes from Short-Read Sequencing Data.病毒宏基因组的短读测序数据组装与注释。
Methods Mol Biol. 2023;2649:317-337. doi: 10.1007/978-1-0716-3072-3_17.
6
Multiomics Characterization of the Canada Goose Fecal Microbiome Reveals Selective Efficacy of Simulated Metagenomes.多组学分析加拿大鹅粪便微生物组揭示了模拟宏基因组的选择性功效。
Microbiol Spectr. 2022 Dec 21;10(6):e0238422. doi: 10.1128/spectrum.02384-22. Epub 2022 Nov 1.
7
Deduplication Improves Cost-Efficiency and Yields of Assembly and Binning of Shotgun Metagenomes in Microbiome Research.重复数据删除提高了微生物组研究中鸟枪法宏基因组组装和分箱的成本效益及产量。
Microbiol Spectr. 2023 Feb 6;11(2):e0428222. doi: 10.1128/spectrum.04282-22.
8
Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis.用于宏基因组差异分析的k-mer谱适用性评估。
BMC Bioinformatics. 2016 Jan 16;17:38. doi: 10.1186/s12859-015-0875-7.
9
Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes.人类粪便宏基因组功能注释的测序前和测序后建议。
BMC Bioinformatics. 2020 Feb 24;21(1):74. doi: 10.1186/s12859-020-3416-y.
10
Modeling the limits of detection for antimicrobial resistance genes in agri-food samples: a comparative analysis of bioinformatics tools.模拟农业食品样本中抗微生物药物耐药基因的检测极限:生物信息学工具的比较分析。
BMC Microbiol. 2024 Jan 20;24(1):31. doi: 10.1186/s12866-023-03148-6.

引用本文的文献

1
MeSS and assembly_finder: a toolkit for in silico metagenomic sample generation.MeSS和assembly_finder:一个用于计算机模拟宏基因组样本生成的工具包。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae760.
2
MGS2AMR: a gene-centric mining of metagenomic sequencing data for pathogens and their antimicrobial resistance profile.MGS2AMR:一种基于基因的宏基因组测序数据分析方法,用于病原体及其抗菌药物耐药性特征的挖掘。
Microbiome. 2023 Oct 13;11(1):223. doi: 10.1186/s40168-023-01674-z.

本文引用的文献

1
Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3.利用 bioBakery 3 整合具有分类学、功能和菌株水平特征的多样化微生物群落。
Elife. 2021 May 4;10:e65088. doi: 10.7554/eLife.65088.
2
Peat-based gnotobiotic plant growth systems for Arabidopsis microbiome research.用于拟南芥微生物组研究的基于泥炭的悉生植物生长系统。
Nat Protoc. 2021 May;16(5):2450-2470. doi: 10.1038/s41596-021-00504-6. Epub 2021 Apr 28.
3
High-throughput molecular analyses of microbiomes as a tool to monitor the wellbeing of aquatic environments.
高通量微生物组分子分析作为监测水生环境健康状况的工具。
Microb Biotechnol. 2021 May;14(3):870-885. doi: 10.1111/1751-7915.13763. Epub 2021 Feb 9.
4
Evaluation of computational methods for human microbiome analysis using simulated data.使用模拟数据评估人类微生物组分析的计算方法。
PeerJ. 2020 Aug 11;8:e9688. doi: 10.7717/peerj.9688. eCollection 2020.
5
Transcriptome reconstruction and functional analysis of eukaryotic marine plankton communities via high-throughput metagenomics and metatranscriptomics.通过高通量宏基因组学和宏转录组学对真核海洋浮游生物群落进行转录组重建和功能分析。
Genome Res. 2020 Apr;30(4):647-659. doi: 10.1101/gr.253070.119. Epub 2020 Mar 23.
6
Improved metagenomic analysis with Kraken 2.Kraken 2 提升宏基因组分析。
Genome Biol. 2019 Nov 28;20(1):257. doi: 10.1186/s13059-019-1891-0.
7
Pavian: interactive analysis of metagenomics data for microbiome studies and pathogen identification.Pavian:微生物组研究和病原体鉴定的宏基因组数据分析的交互式分析。
Bioinformatics. 2020 Feb 15;36(4):1303-1304. doi: 10.1093/bioinformatics/btz715.
8
Benchmarking Metagenomics Tools for Taxonomic Classification.基于元基因组工具的分类学基准测试。
Cell. 2019 Aug 8;178(4):779-794. doi: 10.1016/j.cell.2019.07.010.
9
The Integrative Human Microbiome Project.整合人类微生物组计划。
Nature. 2019 May;569(7758):641-648. doi: 10.1038/s41586-019-1238-8. Epub 2019 May 29.
10
Human contamination in bacterial genomes has created thousands of spurious proteins.人类污染的细菌基因组中创造了数千个虚假蛋白质。
Genome Res. 2019 Jun;29(6):954-960. doi: 10.1101/gr.245373.118. Epub 2019 May 7.