• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用模拟数据集评估宏基因组学处理方法的保真度。

Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.

作者信息

Mavromatis Konstantinos, Ivanova Natalia, Barry Kerrie, Shapiro Harris, Goltsman Eugene, McHardy Alice C, Rigoutsos Isidore, Salamov Asaf, Korzeniewski Frank, Land Miriam, Lapidus Alla, Grigoriev Igor, Richardson Paul, Hugenholtz Philip, Kyrpides Nikos C

机构信息

Department of Energy Joint Genome Institute (DOE-JGI), 2800 Mitchell Drive, Walnut Creek, California 94598, USA.

出版信息

Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29.

DOI:10.1038/nmeth1043
PMID:17468765
Abstract

Metagenomics is a rapidly emerging field of research for studying microbial communities. To evaluate methods presently used to process metagenomic sequences, we constructed three simulated data sets of varying complexity by combining sequencing reads randomly selected from 113 isolate genomes. These data sets were designed to model real metagenomes in terms of complexity and phylogenetic composition. We assembled sampled reads using three commonly used genome assemblers (Phrap, Arachne and JAZZ), and predicted genes using two popular gene-finding pipelines (fgenesb and CRITICA/GLIMMER). The phylogenetic origins of the assembled contigs were predicted using one sequence similarity-based (blast hit distribution) and two sequence composition-based (PhyloPythia, oligonucleotide frequencies) binning methods. We explored the effects of the simulated community structure and method combinations on the fidelity of each processing step by comparison to the corresponding isolate genomes. The simulated data sets are available online to facilitate standardized benchmarking of tools for metagenomic analysis.

摘要

宏基因组学是一个快速兴起的研究微生物群落的领域。为了评估目前用于处理宏基因组序列的方法,我们通过组合从113个分离基因组中随机选择的测序读数构建了三个不同复杂度的模拟数据集。这些数据集旨在从复杂度和系统发育组成方面模拟真实的宏基因组。我们使用三种常用的基因组组装器(Phrap、Arachne和JAZZ)组装抽样读数,并使用两种流行的基因发现流程(fgenesb和CRITICA/GLIMMER)预测基因。使用一种基于序列相似性(blast命中分布)和两种基于序列组成(PhyloPythia、寡核苷酸频率)的分箱方法预测组装重叠群的系统发育起源。通过与相应的分离基因组进行比较,我们探索了模拟群落结构和方法组合对每个处理步骤保真度的影响。模拟数据集可在线获取,以促进宏基因组分析工具的标准化基准测试。

相似文献

1
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.使用模拟数据集评估宏基因组学处理方法的保真度。
Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29.
2
SOrt-ITEMS: Sequence orthology based approach for improved taxonomic estimation of metagenomic sequences.SOrt-ITEMS:基于序列直系同源性的方法,用于改进宏基因组序列的分类学估计。
Bioinformatics. 2009 Jul 15;25(14):1722-30. doi: 10.1093/bioinformatics/btp317. Epub 2009 May 13.
3
Metagenomics: read length matters.宏基因组学:读长很重要。
Appl Environ Microbiol. 2008 Mar;74(5):1453-63. doi: 10.1128/AEM.02181-07. Epub 2008 Jan 11.
4
nWayComp: a genome-wide sequence comparison tool for multiple strains/species of phylogenetically related microorganisms.nWayComp:一种用于系统发育相关微生物的多个菌株/物种的全基因组序列比较工具。
In Silico Biol. 2007;7(2):195-200.
5
MEGAN analysis of metagenomic data.宏基因组数据的MEGAN分析
Genome Res. 2007 Mar;17(3):377-86. doi: 10.1101/gr.5969107. Epub 2007 Jan 25.
6
Genome phylogenetic analysis based on extended gene contents.基于扩展基因内容的基因组系统发育分析。
Mol Biol Evol. 2004 Jul;21(7):1401-8. doi: 10.1093/molbev/msh138. Epub 2004 Apr 14.
7
Metagenomic signatures of 86 microbial and viral metagenomes.86 个微生物和病毒宏基因组的宏基因组特征。
Environ Microbiol. 2009 Jul;11(7):1752-66. doi: 10.1111/j.1462-2920.2009.01901.x. Epub 2009 Mar 18.
8
Megx.net--database resources for marine ecological genomics.Megx.net——海洋生态基因组学的数据库资源。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D390-3. doi: 10.1093/nar/gkj070.
9
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut.比较不同的组装和注释工具在分析肠道中模拟病毒宏基因组群落中的应用。
BMC Genomics. 2014 Jan 18;15:37. doi: 10.1186/1471-2164-15-37.
10
Annotation, comparison and databases for hundreds of bacterial genomes.数百种细菌基因组的注释、比较及数据库
Res Microbiol. 2007 Dec;158(10):724-36. doi: 10.1016/j.resmic.2007.09.009. Epub 2007 Oct 6.

引用本文的文献

1
Full-Length Sequencing of Circular DNA Viruses Using CIDER-Seq.使用CIDER-Seq对环状DNA病毒进行全长测序。
Methods Mol Biol. 2025;2912:191-204. doi: 10.1007/978-1-0716-4454-6_17.
2
Cataloging metagenome-assembled genomes and microbial genes from the athlete gut microbiome.对运动员肠道微生物组中的宏基因组组装基因组和微生物基因进行编目。
Microbiome Res Rep. 2024 Jul 22;3(4):41. doi: 10.20517/mrr.2023.69. eCollection 2024.
3
The choice of 16S rRNA gene sequence analysis impacted characterization of highly variable surface microbiota in dairy processing environments.
16S rRNA 基因序列分析的选择影响了乳品加工环境中高度可变表面微生物群落的特征分析。
mSystems. 2024 Nov 19;9(11):e0062024. doi: 10.1128/msystems.00620-24. Epub 2024 Oct 21.
4
MAGICIAN: MAG simulation for investigating criteria for bioinformatic analysis.魔术师:用于研究生物信息学分析标准的 MAG 模拟。
BMC Genomics. 2024 Jan 12;25(1):55. doi: 10.1186/s12864-023-09912-2.
5
Crop rotation and native microbiome inoculation restore soil capacity to suppress a root disease.轮作和原生微生物组接种恢复土壤抑制根病的能力。
Nat Commun. 2023 Dec 8;14(1):8126. doi: 10.1038/s41467-023-43926-4.
6
An Improved Machine Learning-Based Approach to Assess the Microbial Diversity in Major North Indian River Ecosystems.基于改进的机器学习方法评估印度主要北方河流生态系统中的微生物多样性。
Genes (Basel). 2023 May 14;14(5):1082. doi: 10.3390/genes14051082.
7
From defaults to databases: parameter and database choice dramatically impact the performance of metagenomic taxonomic classification tools.从默认值到数据库:参数和数据库的选择极大地影响了宏基因组分类工具的性能。
Microb Genom. 2023 Mar;9(3). doi: 10.1099/mgen.0.000949.
8
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking.为真实细菌群落中的几乎所有组件构建宏基因组组装基因组,用于分箱基准测试。
BMC Genomics. 2022 Nov 10;23(1):746. doi: 10.1186/s12864-022-08967-x.
9
Benchmarking taxonomic classifiers with Illumina and Nanopore sequence data for clinical metagenomic diagnostic applications.使用 Illumina 和 Nanopore 测序数据对临床宏基因组诊断应用进行分类器的基准测试。
Microb Genom. 2022 Oct;8(10). doi: 10.1099/mgen.0.000886.
10
Deep-Sea Sediments from the Southern Gulf of Mexico Harbor a Wide Diversity of PKS I Genes.墨西哥湾南部的深海沉积物蕴藏着多种聚酮合酶I基因。
Antibiotics (Basel). 2022 Jul 4;11(7):887. doi: 10.3390/antibiotics11070887.